Head-to-head comparison of clustering methods for heterogeneous data: a simulation-driven benchmark

Abstract The choice of the most appropriate unsupervised machine-learning method for “heterogeneous” or “mixed” data, i.e. with both continuous and categorical variables, can be challenging. Our aim was to examine the performance of various clustering strategies for mixed data using both simulated a...

Full description

Bibliographic Details
Main Authors: Gregoire Preud’homme, Kevin Duarte, Kevin Dalleau, Claire Lacomblez, Emmanuel Bresso, Malika Smaïl-Tabbone, Miguel Couceiro, Marie-Dominique Devignes, Masatake Kobayashi, Olivier Huttin, João Pedro Ferreira, Faiez Zannad, Patrick Rossignol, Nicolas Girerd
Format: Article
Language:English
Published: Nature Publishing Group 2021-02-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-021-83340-8