SCI和EI收录∣中国化工学会会刊

中国化学工程学报 ›› 2023, Vol. 61 ›› Issue (9): 237-247.DOI: 10.1016/j.cjche.2023.03.020

• Full Length Article • 上一篇    下一篇

Data-driven intelligent modeling framework for the steam cracking process

Qiming Zhao1,2, Kexin Bi3,4, Tong Qiu1,2   

  1. 1. Department of Chemical Engineering, Tsinghua University, Beijing 100084, China;
    2. Beijing Key Laboratory of Industrial Big Data Systems and Applications, Tsinghua University, Beijing 100084, China;
    3. School of Chemical Engineering, Sichuan University, Sichuan 610065, China;
    4. Department of Bioprocess Engineering, Institute of Biotechnology, Technische Universität Berlin, Berlin 10623, Germany
  • 收稿日期:2022-11-17 修回日期:2023-03-01 出版日期:2023-09-28 发布日期:2023-12-14
  • 通讯作者: Kexin Bi,E-mail:bkx21@scu.edu.cn;Tong Qiu,E-mail:qiutong@tsinghua.edu.cn
  • 基金资助:
    The research work reported here was supported by the National Key Research and Development Program of China(2021 YFB 4000500, 2021 YFB 4000501, and 2021 YFB 4000502).

Data-driven intelligent modeling framework for the steam cracking process

Qiming Zhao1,2, Kexin Bi3,4, Tong Qiu1,2   

  1. 1. Department of Chemical Engineering, Tsinghua University, Beijing 100084, China;
    2. Beijing Key Laboratory of Industrial Big Data Systems and Applications, Tsinghua University, Beijing 100084, China;
    3. School of Chemical Engineering, Sichuan University, Sichuan 610065, China;
    4. Department of Bioprocess Engineering, Institute of Biotechnology, Technische Universität Berlin, Berlin 10623, Germany
  • Received:2022-11-17 Revised:2023-03-01 Online:2023-09-28 Published:2023-12-14
  • Contact: Kexin Bi,E-mail:bkx21@scu.edu.cn;Tong Qiu,E-mail:qiutong@tsinghua.edu.cn
  • Supported by:
    The research work reported here was supported by the National Key Research and Development Program of China(2021 YFB 4000500, 2021 YFB 4000501, and 2021 YFB 4000502).

摘要: Steam cracking is the dominant technology for producing light olefins, which are believed to be the foundation of the chemical industry. Predictive models of the cracking process can boost production efficiency and profit margin. Rapid advancements in machine learning research have recently enabled data-driven solutions to usher in a new era of process modeling. Meanwhile, its practical application to steam cracking is still hindered by the trade-off between prediction accuracy and computational speed. This research presents a framework for data-driven intelligent modeling of the steam cracking process. Industrial data preparation and feature engineering techniques provide computational-ready datasets for the framework, and feedstock similarities are exploited using k-means clustering. We propose LArge-Residuals-Deletion Multivariate Adaptive Regression Spline (LARD-MARS), a modeling approach that explicitly generates output formulas and eliminates potentially outlying instances. The framework is validated further by the presentation of clustering results, the explanation of variable importance, and the testing and comparison of model performance.

关键词: Mathematical modeling, Data-driven modeling, Process systems, Steam cracking, Clustering, Multivariate adaptive regression spline

Abstract: Steam cracking is the dominant technology for producing light olefins, which are believed to be the foundation of the chemical industry. Predictive models of the cracking process can boost production efficiency and profit margin. Rapid advancements in machine learning research have recently enabled data-driven solutions to usher in a new era of process modeling. Meanwhile, its practical application to steam cracking is still hindered by the trade-off between prediction accuracy and computational speed. This research presents a framework for data-driven intelligent modeling of the steam cracking process. Industrial data preparation and feature engineering techniques provide computational-ready datasets for the framework, and feedstock similarities are exploited using k-means clustering. We propose LArge-Residuals-Deletion Multivariate Adaptive Regression Spline (LARD-MARS), a modeling approach that explicitly generates output formulas and eliminates potentially outlying instances. The framework is validated further by the presentation of clustering results, the explanation of variable importance, and the testing and comparison of model performance.

Key words: Mathematical modeling, Data-driven modeling, Process systems, Steam cracking, Clustering, Multivariate adaptive regression spline