Journal of System Simulation ›› 2017, Vol. 29 ›› Issue (12): 3035-3041.doi: 10.16182/j.issn1004731x.joss.201712014

Previous Articles     Next Articles

Application of Distributed Clustering in Anomaly Detection of Farm Environment Data

Deng Li1,2, Pang Honglin1,2, Ling Wang1,2, Minrui Fei1,2   

  1. 1. School of Mechatronics Engineering and Automation, Shanghai University, Shanghai 200072, China;
    2. Shanghai Key Laboratory of Power Station Automation Technology, Shanghai 200072, China
  • Received:2015-10-16 Published:2020-06-06

Abstract: The massive farm environment data stored in the distributed system should be dealt with so as to provide abnormal environment reference and make preventive strategies for crop yield. Considering the characteristics of the farm environment data, the Dirichlet Process Mixture Model (DPMM) clustering is implemented with the farm environment data on Hadoop and the anomaly detection method of the farm environment is proposed based on clustering analysis. Under the framework of MapReduce, Map stage implements the distribution of the sample points to the models; Reduce stage completes the update of models and the number of clusters. The performance has been verified by experiments. The results of clustering and the index of suitable environment for tomato are compared to implement the anomaly detection. The analysis results show that the method can be applied to anomaly detection of large number of farm environment data.

Key words: DPMM, distributed clustering, farm environment data, anomaly detection

CLC Number: