Journal international d'exploration de données biomédicales

Journal international d'exploration de données biomédicales
Libre accès

ISSN: 2090-4924

Abstrait

Implementation of Decision Tree Using Hadoop MapReduce

Tianyi Yang and Anne Hee Hiong Ngu

Hadoop is one of the most popular general-purpose computing platforms for the distributed processing of big data. HDFS is implementation of distributed file system by Hadoop to be able to store huge amount of data in a reliable way and serve data processing component by Hadoop at the same time. MapReduce is the main processing engine of Hadoop. In this study, we have implemented HDFS and MapReduce for a well- known learning algorithm—decision tree in a scalable fashion to large input problem size. Computational performance with node count and problem size is evaluated.

Clause de non-responsabilité: Ce résumé a été traduit à l'aide d'outils d'intelligence artificielle et n'a pas encore été révisé ou vérifié.
Top