Tushuguanxue yu Zixun Kexue (Oct 2016)

Distributed Framework of Artificial Neural Network for Big Data Analysis

  • 何善豪 Shan Hao Ho,
  • 張景堯 Jiing-Yao Chang,
  • 劉文卿 Wen-Ching Liou

Journal volume & issue
Vol. 42, no. 2
pp. 45 – 64

Abstract

Read online

In this research, we introduce a distributed framework of artificial neural network (ANN) to deal with the big data real‐time analysis and return proper outcomes in very short delay. The result of our experiment shows that training the distributed ANN model could be converged in 17 seconds on 24‐core clustering platform and learns that multi‐model with stratification strategy would obtain most true positive predictions with nearly 70% precision at voting threshold value equal to 0.7.In our system, ANNs are used in the data mining process for identifying patterns in financial time series. We implement a framework for training ANNs on a distributed computing platform. We adopt Apache Spark to build the base computing cluster because it is capable of high performance in‐memory computing. We investigate a number of distributed back propagation algorithms and techniques, especially ones for time series prediction, and incorporate them into our framework with some modifications. With various options for the details, we provide the user with flexibility in neural network modeling.

Keywords