SCI和EI收录∣中国化工学会会刊

中国化学工程学报 ›› 2023, Vol. 53 ›› Issue (1): 201-210.DOI: 10.1016/j.cjche.2022.01.033

• Full Length Article • 上一篇    下一篇

Univariate imputation method for recovering missing data in wastewater treatment process

Honggui Han, Meiting Sun, Huayun Han, Xiaolong Wu, Junfei Qiao   

  1. af005. Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China;
    af010. Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing 100124, China;
    af015. Engineering Research Center of Digital Community, Ministry of Education, Beijing 100124, China
  • 收稿日期:2021-09-13 修回日期:2022-01-05 出版日期:2023-01-28 发布日期:2023-04-08
  • 通讯作者: Honggui Han,E-mail:Rechardhan@sina.com
  • 基金资助:
    The authors are thankful to the National Key Research and Development Project (No. 2018YFC1900800-5), the National Natural Science Foundation of China (Nos. 61890930-5, 61903010, 6202100), the Beijing Outstanding Young Scientist Program (No. BJJWZYJH01201910005020) and the Beijing Natural Science Foundation (No. KZ202110005009).

Univariate imputation method for recovering missing data in wastewater treatment process

Honggui Han, Meiting Sun, Huayun Han, Xiaolong Wu, Junfei Qiao   

  1. af005. Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China;
    af010. Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing 100124, China;
    af015. Engineering Research Center of Digital Community, Ministry of Education, Beijing 100124, China
  • Received:2021-09-13 Revised:2022-01-05 Online:2023-01-28 Published:2023-04-08
  • Contact: Honggui Han,E-mail:Rechardhan@sina.com
  • Supported by:
    The authors are thankful to the National Key Research and Development Project (No. 2018YFC1900800-5), the National Natural Science Foundation of China (Nos. 61890930-5, 61903010, 6202100), the Beijing Outstanding Young Scientist Program (No. BJJWZYJH01201910005020) and the Beijing Natural Science Foundation (No. KZ202110005009).

摘要: High-quality data play a paramount role in monitoring, control, and prediction of wastewater treatment process (WWTP) and can effectively ensure the efficient and stable operation of system. Missing values seriously degrade the accuracy, reliability and completeness of the data quality due to network collapses, connection errors and data transformation failures. In these cases, it is infeasible to recover missing data depending on the correlation with other variables. To tackle this issue, a univariate imputation method (UIM) is proposed for WWTP integrating decomposition method and imputation algorithms. First, the seasonal-trend decomposition based on loess method is utilized to decompose the original time series into the seasonal, trend and remainder components to deal with the nonstationary characteristics of WWTP data. Second, the support vector regression is used to approximate the nonlinearity of the trend and remainder components respectively to provide estimates of its missing values. A self-similarity decomposition is conducted to fill the seasonal component based on its periodic pattern. Third, all the imputed results are merged to obtain the imputation result. Finally, six time series of WWTP are used to evaluate the imputation performance of the proposed UIM by comparing with existing seven methods based on two indicators. The experimental results illustrate that the proposed UIM is effective for WWTP time series under different missing ratios. Therefore, the proposed UIM is a promising method to impute WWTP time series.

关键词: Univariate, Self-similarity, Waste water, Algorithm, Integration

Abstract: High-quality data play a paramount role in monitoring, control, and prediction of wastewater treatment process (WWTP) and can effectively ensure the efficient and stable operation of system. Missing values seriously degrade the accuracy, reliability and completeness of the data quality due to network collapses, connection errors and data transformation failures. In these cases, it is infeasible to recover missing data depending on the correlation with other variables. To tackle this issue, a univariate imputation method (UIM) is proposed for WWTP integrating decomposition method and imputation algorithms. First, the seasonal-trend decomposition based on loess method is utilized to decompose the original time series into the seasonal, trend and remainder components to deal with the nonstationary characteristics of WWTP data. Second, the support vector regression is used to approximate the nonlinearity of the trend and remainder components respectively to provide estimates of its missing values. A self-similarity decomposition is conducted to fill the seasonal component based on its periodic pattern. Third, all the imputed results are merged to obtain the imputation result. Finally, six time series of WWTP are used to evaluate the imputation performance of the proposed UIM by comparing with existing seven methods based on two indicators. The experimental results illustrate that the proposed UIM is effective for WWTP time series under different missing ratios. Therefore, the proposed UIM is a promising method to impute WWTP time series.

Key words: Univariate, Self-similarity, Waste water, Algorithm, Integration