Open Access Open Access  Restricted Access Subscription Access

Scalable Neural Network Using Hadoop and MapReduce

Chit Thu Shine

Abstract


With the advent of digital technology, a large amount of digital data is being generated every day. One of the challenges is the volume of generated data. Because of the massive increase in the size of the data it becomes troublesome to perform effective analysis using the current traditional techniques. Artificial Neural Network (ANN) is widely used algorithm in classification, pattern recognition and prediction fields. However, ANNs are notably slow in computation especially when the size of data is large. To fulfill the potentials of ANNs for big data applications, the computation process must be speedup. The objective of this paper is to classify big data using Back-propagation Neural Network method with MapReduce paradigm which has become a major computing model to facilitate data intensive applications. The performance is measured by the execution time using two different interesting datasets. And the results show that proposed algorithm performs superior efficiency (time) better than single node Java implementation that do not use Hadoop and MapReduce.


Full Text:

PDF

References


Apache Hadoop: http://Hadoop.apache.org/. June. 2018.

Bhagattjee, B. “Emergence and Taxonomy of Big Data as a Service”. 2014.

Changlong Li, Xuehai Zhou, Kun Lu. “Implementing of Artificial Neural Networks in MapReduce Optimization”

C. Seelammal and K. Vimala Devi. “Hadoop Based Feature Selection and Decision Making Models on Big Data,” Middle-East Journal of Scientific Research 25 (3): 660-665, 2017.

HDFS Architecture Guide. https://hadoop.apache. org/docs/r1.2.1/hdfs_design.html. 2013

J.Dean and S. Ghemawat, “MapReduce: simplified data processing on large clusters,” Communications of the ACM, vol. 51, no. 1, pp. 107–113, 2008.

MapReduce: http://en.wikipedia.org/wiki/ MapReduce. June. 2018.

M. H. Hgan, H. B. Demuth, and M. H. Beale, “Neural Network Design”, PWS Publishing, 1996.

P Anchalia, Prajesh, and Kaushik Roy. “The K-Nearest Neighbor Algorithm Using MapReduce Paradigm”. Fifth International Conference on Intelligent Systems, Modelling and Simulation. 2014. Web. 15 Oct. 2015.

R.Gu, . Shen, and Y.Huang, “A parallel computing platform for training large scale neural networks”, IEEE International Conference on Big Data, pp. 376–384, October 2013.


Refbacks

  • There are currently no refbacks.