Generalizing Local Density for Density-Based Clustering

Discovering densely-populated regions in a dataset of data points is an essential task for density-based clustering. To do so, it is often necessary to calculate each data point’s local density in the dataset. Various definitions for the local density have been proposed in the literature. These defi...

Full description

Bibliographic Details
Main Author: Jun-Lin Lin
Format: Article
Language:English
Published: MDPI AG 2021-01-01
Series:Symmetry
Subjects:
Online Access:https://www.mdpi.com/2073-8994/13/2/185
Description
Summary:Discovering densely-populated regions in a dataset of data points is an essential task for density-based clustering. To do so, it is often necessary to calculate each data point’s local density in the dataset. Various definitions for the local density have been proposed in the literature. These definitions can be divided into two categories: Radius-based and <i>k</i> Nearest Neighbors-based. In this study, we find the commonality between these two types of definitions and propose a canonical form for the local density. With the canonical form, the pros and cons of the existing definitions can be better explored, and new definitions for the local density can be derived and investigated.
ISSN:2073-8994