A graphical heuristic for reduction and partitioning of large datasets for scalable supervised training

Abstract A scalable graphical method is presented for selecting and partitioning datasets for the training phase of a classification task. For the heuristic, a clustering algorithm is required to get its computation cost in a reasonable proportion to the task itself. This step is succeeded by constr...

Full description

Bibliographic Details
Main Authors: Sumedh Yadav, Mathis Bode
Format: Article
Language:English
Published: SpringerOpen 2019-10-01
Series:Journal of Big Data
Subjects:
Online Access:http://link.springer.com/article/10.1186/s40537-019-0259-3