Research on parallel data processing of data mining platform in the background of cloud computing

The efficient processing of large-scale data has very important practical value. In this study, a data mining platform based on Hadoop distributed file system was designed, and then K-means algorithm was improved with the idea of max-min distance. On Hadoop distributed file system platform, the para...

Full description

Bibliographic Details
Main Authors: Bu Lingrui, Zhang Hui, Xing Haiyan, Wu Lijun
Format: Article
Language:English
Published: De Gruyter 2021-02-01
Series:Journal of Intelligent Systems
Subjects:
Online Access:https://doi.org/10.1515/jisys-2020-0113