ETRI-Knowledge Sharing Plaform



논문 검색
구분 SCI
연도 ~ 키워드


학술대회 A Comparison of the Effects of Data Imputation Methods on Model Performance
Cited 6 time in scopus Download 2 time Share share facebook twitter linkedin kakaostory
김우영, 조원웅, 최장호, 김지용, 박천복, 주재걸
International Conference on Advanced Communications Technology (ICACT) 2019, pp.592-599
18HB2700, 직독식 수질복합센서 및 초분광영상 기반 시공간 복합 인공지능 녹조 예측 기술, 권용환
Missing values cause critical problems on training a prediction model. Various missing data imputation methods have been introduced to settle down the problem. However, the imputation accuracy obtained by the methods is insufficient to validate performance of prediction models. Thus, in this study, we compare (1) imputation accuracy from various imputation methods as well as (2) the effects of imputation methods on prediction accuracy, investigating a relationship between imputation accuracy and prediction accuracy. For the comparison, we use water quality data composed of the latest actual observational multi-sensor data from Daecheong Lake. We conduct several experiments to compare seven imputation methods including a state of the art method, and their effects on three distinct prediction models. Through quantitative comparison and analysis, we proved that it is necessary to consider both imputation accuracy and model prediction accuracy when choosing an imputation method.
KSP 제안 키워드
Critical problems, Imputation methods, Missing Data Imputation, Missing values, Model performance, Multi-Sensor, Performance of prediction, Prediction accuracy, Quality data, Quantitative comparison, Water quality