系统仿真学报 ›› 2017, Vol. 29 ›› Issue (12): 3035-3041.doi: 10.16182/j.issn1004731x.joss.201712014

• 仿真应用工程 • 上一篇    下一篇

分布式聚类在农场环境数据异常检测中的应用

邓丽1,2, 庞洪霖1,2, 王灵1,2, 费敏锐1,2   

  1. 1.上海大学机电工程与自动化学院,上海 200072;
    2.上海市电站自动化技术重点实验室,上海 200072
  • 收稿日期:2015-10-16 发布日期:2020-06-06
  • 作者简介:邓丽(1978-),女,安徽,博士后,副教授,研究方向为机器学习、智能优化算法等;庞洪霖(1991-),男,江苏,硕士生,研究方向为机器学习与分布式计算。
  • 基金资助:
    上海市科委重点项目(14DZ1206302)

Application of Distributed Clustering in Anomaly Detection of Farm Environment Data

Deng Li1,2, Pang Honglin1,2, Ling Wang1,2, Minrui Fei1,2   

  1. 1. School of Mechatronics Engineering and Automation, Shanghai University, Shanghai 200072, China;
    2. Shanghai Key Laboratory of Power Station Automation Technology, Shanghai 200072, China
  • Received:2015-10-16 Published:2020-06-06

摘要: 为了处理大量分布式存储的农场环境数据,为作物增产提供异常环境参考并制定预防策略,本文结合农场环境数据的特点,在Hadoop平台中实现了对农场环境数据的Dirichlet过程混合模型聚类,并提出了基于聚类分析的农场环境异常检测方法。在MapReduce框架下,Map阶段完成样本点到模型的分配;Reduce阶段对模型与类簇个数进行更新。通过实验验证了分布式Dirichlet聚类的性能,分析结果表明该方法可以应用于大量农场环境数据的异常检测。

关键词: Dirichlet过程混合模型, 分布式聚类, 农场环境数据, 异常检测

Abstract: The massive farm environment data stored in the distributed system should be dealt with so as to provide abnormal environment reference and make preventive strategies for crop yield. Considering the characteristics of the farm environment data, the Dirichlet Process Mixture Model (DPMM) clustering is implemented with the farm environment data on Hadoop and the anomaly detection method of the farm environment is proposed based on clustering analysis. Under the framework of MapReduce, Map stage implements the distribution of the sample points to the models; Reduce stage completes the update of models and the number of clusters. The performance has been verified by experiments. The results of clustering and the index of suitable environment for tomato are compared to implement the anomaly detection. The analysis results show that the method can be applied to anomaly detection of large number of farm environment data.

Key words: DPMM, distributed clustering, farm environment data, anomaly detection

中图分类号: