Journal of System Simulation ›› 2018, Vol. 30 ›› Issue (4): 1473-1481.doi: 10.16182/j.issn1004731x.joss.201804032

Previous Articles     Next Articles

Parallel Pattern Recognition of Leak Current Data Using Spark-KNN

Li Li, Zhu Yongli, Song Yaqi   

  1. School of Control and Computer Engineering, North China Electric Power University, Baoding 071003, China
  • Received:2016-05-11 Revised:2016-07-15 Online:2018-04-08 Published:2019-01-04

Abstract: With the rapid development of smart grid, the status monitoring data of power grid equipment increase exponentially and gradually form the big data. Traditional computing architectures are no longer to meet the demand of computing performance. This paper explores how Spark and Cloud computing can accelerate performance of missive insulator leak current data pattern recognition. The Parallel KNN (k-Nearest Neighbor) algorithm is designed and implemented by using Spark and Aliyun E-MapReduce cloud computing platform. The results from experiments show that the performance of Spark-KNN is 2.97 times of MapReduce-KNN and gains acceleration of 8.8 times. The experimental results confirm that Spark is more suitable for real time data processing tasks than MapReduce.

Key words: power grid equipment, online monitoring, big data, Spark

CLC Number: