The Application of high-dimensional Data Classification by Random Forest based on Hadoop Cloud Computing Platform
The high-dimensional data has a number of uncertain factors, such as sparse features, repeated features and computational complexity. The random forest algorithm is a ensemble classifier method, and composed of numerous weak classifiers. It can overcome a number of practical problems, such as the sm...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
AIDIC Servizi S.r.l.
2016-08-01
|
Series: | Chemical Engineering Transactions |
Online Access: | https://www.cetjournal.it/index.php/cet/article/view/3929 |