系统仿真学报 ›› 2023, Vol. 35 ›› Issue (5): 1020-1033.doi: 10.16182/j.issn1004731x.joss.22-0047

• 论文 • 上一篇    下一篇

基于改进高斯混合模型的热工过程异常值检测

吴铮1,2(), 张悦1,2, 董泽1,2()   

  1. 1.华北电力大学 控制与计算机工程学院, 北京 102206
    2.河北省发电过程仿真与优化控制技术创新中心(华北电力大学), 河北 保定 071003
  • 收稿日期:2022-01-16 修回日期:2022-02-18 出版日期:2023-05-30 发布日期:2023-05-22
  • 通讯作者: 董泽 E-mail:wuzhengncepu@ncepu.edu.cn;dongze@ncepu.edu.cn
  • 作者简介:吴铮(1996-),男,满族,博士生,研究方向为火电机组建模理论与方法。E-mail:wuzhengncepu@ncepu.edu.cn
  • 基金资助:
    中央高校基本科研业务费专项资金(2018QN096);河北省自然科学基金(E2018502111)

Outlier Detection During Thermal Processes Based on Improved Gaussian Mixture Model

Zheng Wu1,2(), Yue Zhang1,2, Ze Dong1,2()   

  1. 1.School of Control and Computer Engineering, North China Electric Power University, Beijing 102206, China
    2.Hebei Technology Innovation Center of Simulation & Optimized Control for Power Generation, North China Electric Power University, Baoding 071003, China
  • Received:2022-01-16 Revised:2022-02-18 Online:2023-05-30 Published:2023-05-22
  • Contact: Ze Dong E-mail:wuzhengncepu@ncepu.edu.cn;dongze@ncepu.edu.cn

摘要:

热工过程异常数据检测是进行系统建模、控制、优化的基础,也是数据处理的重要组成部分。提出一种基于改进高斯混合模型的无监督热工过程异常值检测算法,利用每一维高斯分量捕获一类特定工况数据集群,通过添加惩罚约束因子修正传统模型的后验概率密度,对误检、漏检项加以惩罚,并根据与集群的相关性差异实现异常数据的识别。仿真实验结果表明,模型在多种误差条件下均可准确定位异常数据位置,具有很强的泛化性能,同时相较于传统高斯混合模型,误检、漏检点的检测效果总体提升了37.8%和15%,反映出模型改进的有效性。

关键词: 异常值检测, 高斯混合模型, 惩罚约束, 热工过程, 无监督

Abstract:

Abnormal data detection during thermal processes is the basis for performing system modeling, control, and optimization and constitutes an important part of data processing. In this paper, an unsupervised outlier detection algorithm during thermal processes based on an improved Gaussian mixture model is proposed. The algorithm captures a class of data clusters under specific working conditions by using Gaussian components in each dimension, modifies the posterior probability density of the traditional model by adding penalty constraint factors to penalize the false detection and missed detection items, and identifies abnormal data according to the correlation differences with the clusters. The simulation experimental results show that the model can accurately locate the abnormal data location under a variety of error conditions with strong generalization performance, and the overall detection effects of false detection and missed detection items are improved by 37.8% and 15% compared with the traditional Gaussian mixture model, which proves the effectiveness of the model improvement.

Key words: outlier detection, Gaussian mixture model, penalty constraint, thermal processes, unsupervised

中图分类号: