SCI和EI收录∣中国化工学会会刊

中国化学工程学报 ›› 2025, Vol. 84 ›› Issue (8): 254-265.DOI: 10.1016/j.cjche.2025.05.019

• Full Length Article • 上一篇    下一篇

A data-driven predictive model for solubility: A case study of the NaCl-Na2SO4-H2O system

Yuan Wang1, Mengyue Chen1, Jingwei Tian1, Weidong Zhang1,3,4, Dahuan Liu1,2,3,4,5   

  1. 1. College of Chemical Engineering, Qinghai University, Xining 810016, China;
    2. State Key Laboratory of Organic-Inorganic Composites, Beijing University of Chemical Technology, Beijing 100029, China;
    3. Key Laboratory of Salt Lake Chemical Material of Qinghai Province, Qinghai University, Xining 810016, China;
    4. Salt Lake Chemical Engineering Research Complex, Qinghai University, Xining 810016, China;
    5. Qinghai Provincial Laboratory for Intelligent Computing and Application, Xining 810016, China
  • 收稿日期:2024-12-26 修回日期:2025-04-14 接受日期:2025-05-28 出版日期:2025-08-28 发布日期:2025-06-14
  • 通讯作者: Weidong Zhang,E-mail:weidzhang1208@126.com;Dahuan Liu,E-mail:liudh@mail.buct.edu.cn
  • 基金资助:
    The financial support of the Natural Science Foundation of Qinghai Province of China (2024-ZJ-940) and Qinghai University Research Ability Enhancement Project (2025KTST02) are greatly appreciated.

A data-driven predictive model for solubility: A case study of the NaCl-Na2SO4-H2O system

Yuan Wang1, Mengyue Chen1, Jingwei Tian1, Weidong Zhang1,3,4, Dahuan Liu1,2,3,4,5   

  1. 1. College of Chemical Engineering, Qinghai University, Xining 810016, China;
    2. State Key Laboratory of Organic-Inorganic Composites, Beijing University of Chemical Technology, Beijing 100029, China;
    3. Key Laboratory of Salt Lake Chemical Material of Qinghai Province, Qinghai University, Xining 810016, China;
    4. Salt Lake Chemical Engineering Research Complex, Qinghai University, Xining 810016, China;
    5. Qinghai Provincial Laboratory for Intelligent Computing and Application, Xining 810016, China
  • Received:2024-12-26 Revised:2025-04-14 Accepted:2025-05-28 Online:2025-08-28 Published:2025-06-14
  • Contact: Weidong Zhang,E-mail:weidzhang1208@126.com;Dahuan Liu,E-mail:liudh@mail.buct.edu.cn
  • Supported by:
    The financial support of the Natural Science Foundation of Qinghai Province of China (2024-ZJ-940) and Qinghai University Research Ability Enhancement Project (2025KTST02) are greatly appreciated.

摘要: Accurate prediction of solubility data in the Sodium Chloride-Sodium Sulfate-Water system is essential. It provides theoretical support for salt lake resource development and wastewater treatment technologies. This study proposes an innovative solubility prediction approach. It addresses the limitations of traditional thermodynamic models. This is particularly important when experimental data from various sources contain inconsistencies. Our approach combines the Weighted Local Outlier Factor technique for anomaly detection with a Deep Ensemble Neural Network architecture. This methodology effectively removes local outliers while preserving data distribution integrity, and integrates multiple neural network sub-models to comprehensively capture system features while minimizing individual model biases. Experimental validation demonstrates exceptional prediction performance across temperatures from -20 °C to 150 °C, achieving a coefficient of determination of 0.989 after Bayesian hyperparameter optimization. This data-driven approach provides more accurate and universally applicable solubility predictions than conventional thermodynamic models, offering theoretical guidance for industrial applications in salt lake resource utilization, separation process optimization, and environmental salt management systems.

关键词: Weighted local outlier factor, Deep ensemble neural network, Solubility prediction, Optimization algorithm, Outlier detection

Abstract: Accurate prediction of solubility data in the Sodium Chloride-Sodium Sulfate-Water system is essential. It provides theoretical support for salt lake resource development and wastewater treatment technologies. This study proposes an innovative solubility prediction approach. It addresses the limitations of traditional thermodynamic models. This is particularly important when experimental data from various sources contain inconsistencies. Our approach combines the Weighted Local Outlier Factor technique for anomaly detection with a Deep Ensemble Neural Network architecture. This methodology effectively removes local outliers while preserving data distribution integrity, and integrates multiple neural network sub-models to comprehensively capture system features while minimizing individual model biases. Experimental validation demonstrates exceptional prediction performance across temperatures from -20 °C to 150 °C, achieving a coefficient of determination of 0.989 after Bayesian hyperparameter optimization. This data-driven approach provides more accurate and universally applicable solubility predictions than conventional thermodynamic models, offering theoretical guidance for industrial applications in salt lake resource utilization, separation process optimization, and environmental salt management systems.

Key words: Weighted local outlier factor, Deep ensemble neural network, Solubility prediction, Optimization algorithm, Outlier detection