The Application of high-dimensional Data Classification by Random Forest based on Hadoop Cloud Computing Platform

The high-dimensional data has a number of uncertain factors, such as sparse features, repeated features and computational complexity. The random forest algorithm is a ensemble classifier method, and composed of numerous weak classifiers. It can overcome a number of practical problems, such as the sm...

Full description

Bibliographic Details
Main Author: C. Li
Format: Article
Language:English
Published: AIDIC Servizi S.r.l. 2016-08-01
Series:Chemical Engineering Transactions
Online Access:https://www.cetjournal.it/index.php/cet/article/view/3929