Data Mining Algorithm for Cloud Network Information Based on Artificial Intelligence Decision Mechanism

Yuan Huang; Zhe Cheng; Qianyu Zhou; Yuxing Xiang; Ruixiao Zhao

doi:10.1109/ACCESS.2020.2981632

IEEE Access (Jan 2020)

Data Mining Algorithm for Cloud Network Information Based on Artificial Intelligence Decision Mechanism

Yuan Huang,
Zhe Cheng,
Qianyu Zhou,
Yuxing Xiang,
Ruixiao Zhao

Affiliations

Yuan Huang: ORCiD; School of Information and Electrical Engineering, Hebei University of Engineering, Handan, China
Zhe Cheng: ORCiD; School of Information and Electrical Engineering, Hebei University of Engineering, Handan, China
Qianyu Zhou: ORCiD; School of Earth Science and Engineering, Hebei University of Engineering, Handan, China
Yuxing Xiang: ORCiD; School of Information and Electrical Engineering, Hebei University of Engineering, Handan, China
Ruixiao Zhao: ORCiD; School of Information and Electrical Engineering, Hebei University of Engineering, Handan, China

DOI: https://doi.org/10.1109/ACCESS.2020.2981632
Journal volume & issue: Vol. 8
pp. 53394 – 53407

Abstract

Read online

Due to the rapid development of information technology and network technology, there is a lot of data, but the phenomenon of lack of knowledge is becoming more and more serious. Data mining technology has developed vigorously in this environment, and it has shown more and more vitality. Based on Spark programming model, this paper designs the parallel extension of fuzzy c-means. In order to enhance the performance of fuzzy c-means parallel expansion, the improvement strategy of k-means during the initialization phase is borrowed, and k-means// is extended to fuzzy c-means to obtain better clustering performance. Combined with Spark's programming model, this paper can obtain extended parallel fuzzy c-means algorithm. Several experiments on the data set of the algorithm proposed in this paper have shown good scalability and parallelism, effectively expanding fuzzy c-means clustering to distributed applications, greatly increasing the scale of the data processed by the algorithm. This improves the robustness of the algorithm and the adaptability of the algorithm to the shape and structure of the data, so that the parallel and scalable clustering algorithm can more effectively perform cluster analysis on big data. Three algorithms were simulated on MATLAB platform. We use simple data sets and complex two-dimensional data sets, and compare with the traditional fuzzy c-means algorithm and fuzzy c-means algorithm based on fuzzy entropy. Experiments show that the scalable parallel fuzzy c-means algorithm not only greatly improves the anti-noise performance, but also improves the convergence speed, and it can automatically determine the optimal number of clusters.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords