Comparative Study of Classification Techniques For Large Scale Data - Case Study

Nigar M.Shafiq Surameery; Dana Lattef Hussein

doi:10.24017/science.2017.3.2

Kurdistan Journal of Applied Research (Aug 2017)

Comparative Study of Classification Techniques For Large Scale Data - Case Study

Nigar M.Shafiq Surameery,
Dana Lattef Hussein

Affiliations

Nigar M.Shafiq Surameery: Building and Construction Engineering Dept, College of Engineering University of Garmian Kalar, sulaimani, Iraq
Dana Lattef Hussein: Database Dept, Computer Science Institute, Sulaimani Polytechnic University, Sulaimani, Iraq

DOI: https://doi.org/10.24017/science.2017.3.2
Journal volume & issue: Vol. 2, no. 3
pp. 56 – 61

Abstract

Read online

The existence of Massive datasets that are generated in many applications provides various opportunities and challenges. Especially, scalable mining of such large-scale datasets is a challenging issue that attracted some recent research. In the present study, the main focus is to analyse the classification techniques using WEKA machine learning workbench. Moreover, a large-scale dataset was used. This dataset comes from the protein structure prediction field. It has already been partitioned into training and test sets using the ten-fold cross-validation methodology. In this experiment, nine different methods have been tested. As a result, it became obvious that it is not applicable to test more than one classifier from the (tree) family in the same experiment. On the other hand, using (NaiveBayes) Classifier with the default properties of the attribute selection filter has a great time consuming. Finally, varying the parameters of the attribute selections should be prioritized for more accurate results.

classification techniques, WEKA, data mining, bioinformatics, knowledge discovery, large-scale data.

Published in Kurdistan Journal of Applied Research

ISSN: 2411-7684 (Print); 2411-7706 (Online)
Publisher: Sulaimani Polytechnic University
Country of publisher: Iraq
LCC subjects: Technology: Technology (General); Science
Website: https://www.kjar.spu.edu.iq/index.php/kjar/index

About the journal

Abstract

Keywords