MATEC Web of Conferences (Jan 2018)

A kind of entity recognition algorithm based on Hadoop for power big data

  • Qi Jun,
  • Ge Weichun,
  • Li Zhao,
  • Li Wei,
  • Zhang Hongyu,
  • Zhao Jinghong,
  • Jin Chengming,
  • Yu Liangliang,
  • Chen Shuo,
  • Liu Biqi,
  • Yang Mingyu

DOI
https://doi.org/10.1051/matecconf/201818903005
Journal volume & issue
Vol. 189
p. 03005

Abstract

Read online

With the coming of the era of big data, traditional entity recognition technologies have been unable to effectively finish data preprocessing due to large scale of power grid data and complex volume type features. The rising of Hadoop technologies in these years can deal with big data processings better. Therefore, this paper proposes a power big data entity recognition algorithm based on Hadoop. It applies the discretization algorithm to select higher information accuracy discrete points and put forward a discretization evaluation indicator. In the end, we finish entity recognition of the monitoring data of wind turbines on Hadoop platform.Experimental results show that the proposed algorithm performs well in terms of correctness and breakpoint number experiments and it has a good speed-up ratio. The proposed algorithm can apply to power large data entity recognition processing.