SCI和EI收录∣中国化工学会会刊

中国化学工程学报 ›› 2024, Vol. 70 ›› Issue (6): 104-117.DOI: 10.1016/j.cjche.2024.02.005

• • 上一篇    下一篇

Hierarchical multihead self-attention for time-series-based fault diagnosis

Chengtian Wang, Hongbo Shi, Bing Song, Yang Tao   

  1. Key Laboratory of Smart Manufacturing in Energy Chemical Process of the Ministry of Education, East China University of Science and Technology, Shanghai 200237, China
  • 收稿日期:2023-10-30 修回日期:2024-02-26 出版日期:2024-06-28 发布日期:2024-08-05
  • 通讯作者: Hongbo Shi,E-mail:hbshi@ecust.edu.cn;Bing Song,E-mail:songbing@ecust.edu.cn
  • 基金资助:
    This work is supported by the National Natural Science Foundation of China (62073140, 62073141) and the Shanghai Rising-Star Program (21QA1401800).

Hierarchical multihead self-attention for time-series-based fault diagnosis

Chengtian Wang, Hongbo Shi, Bing Song, Yang Tao   

  1. Key Laboratory of Smart Manufacturing in Energy Chemical Process of the Ministry of Education, East China University of Science and Technology, Shanghai 200237, China
  • Received:2023-10-30 Revised:2024-02-26 Online:2024-06-28 Published:2024-08-05
  • Contact: Hongbo Shi,E-mail:hbshi@ecust.edu.cn;Bing Song,E-mail:songbing@ecust.edu.cn
  • Supported by:
    This work is supported by the National Natural Science Foundation of China (62073140, 62073141) and the Shanghai Rising-Star Program (21QA1401800).

摘要: Fault diagnosis is important for maintaining the safety and effectiveness of chemical process. Considering the multivariate, nonlinear, and dynamic characteristic of chemical process, many time-series-based data-driven fault diagnosis methods have been developed in recent years. However, the existing methods have the problem of long-term dependency and are difficult to train due to the sequential way of training. To overcome these problems, a novel fault diagnosis method based on time-series and the hierarchical multihead self-attention (HMSAN) is proposed for chemical process. First, a sliding window strategy is adopted to construct the normalized time-series dataset. Second, the HMSAN is developed to extract the time-relevant features from the time-series process data. It improves the basic self-attention model in both width and depth. With the multihead structure, the HMSAN can pay attention to different aspects of the complicated chemical process and obtain the global dynamic features. However, the multiple heads in parallel lead to redundant information, which cannot improve the diagnosis performance. With the hierarchical structure, the redundant information is reduced and the deep local time-related features are further extracted. Besides, a novel many-to-one training strategy is introduced for HMSAN to simplify the training procedure and capture the long-term dependency. Finally, the effectiveness of the proposed method is demonstrated by two chemical cases. The experimental results show that the proposed method achieves a great performance on time-series industrial data and outperforms the state-of-the-art approaches.

关键词: Self-attention mechanism, Deep learning, Chemical process, Time-series, Fault diagnosis

Abstract: Fault diagnosis is important for maintaining the safety and effectiveness of chemical process. Considering the multivariate, nonlinear, and dynamic characteristic of chemical process, many time-series-based data-driven fault diagnosis methods have been developed in recent years. However, the existing methods have the problem of long-term dependency and are difficult to train due to the sequential way of training. To overcome these problems, a novel fault diagnosis method based on time-series and the hierarchical multihead self-attention (HMSAN) is proposed for chemical process. First, a sliding window strategy is adopted to construct the normalized time-series dataset. Second, the HMSAN is developed to extract the time-relevant features from the time-series process data. It improves the basic self-attention model in both width and depth. With the multihead structure, the HMSAN can pay attention to different aspects of the complicated chemical process and obtain the global dynamic features. However, the multiple heads in parallel lead to redundant information, which cannot improve the diagnosis performance. With the hierarchical structure, the redundant information is reduced and the deep local time-related features are further extracted. Besides, a novel many-to-one training strategy is introduced for HMSAN to simplify the training procedure and capture the long-term dependency. Finally, the effectiveness of the proposed method is demonstrated by two chemical cases. The experimental results show that the proposed method achieves a great performance on time-series industrial data and outperforms the state-of-the-art approaches.

Key words: Self-attention mechanism, Deep learning, Chemical process, Time-series, Fault diagnosis