Principal points, principal curves and principal surfaces

The idea of approximating a distribution is a prominent problem in statistics. This dissertation explores the theory of principal points and principal curves as approximation methods to a distribution. Principal points of a distribution have been initially introduced by Flury (1990) who tackled the...

Full description

Bibliographic Details
Main Author: Ganey, Raeesa
Other Authors: Lubbe, Sugnet
Format: Dissertation
Language:English
Published: University of Cape Town 2015
Subjects:
Online Access:http://hdl.handle.net/11427/15515
id ndltd-netd.ac.za-oai-union.ndltd.org-uct-oai-localhost-11427-15515
record_format oai_dc
spelling ndltd-netd.ac.za-oai-union.ndltd.org-uct-oai-localhost-11427-155152020-10-06T05:11:43Z Principal points, principal curves and principal surfaces Ganey, Raeesa Lubbe, Sugnet Statistical Sciences Principal points k-means algorithm computational methods machine learning The idea of approximating a distribution is a prominent problem in statistics. This dissertation explores the theory of principal points and principal curves as approximation methods to a distribution. Principal points of a distribution have been initially introduced by Flury (1990) who tackled the problem of optimal grouping in multivariate data. In essence, principal points are the theoretical counterparts of cluster means obtained by the k-means algorithm. Principal curves defined by Hastie (1984), are smooth one-dimensional curves that pass through the middle of a p-dimensional data set, providing a nonlinear summary of the data. In this dissertation, details on the usefulness of principal points and principal curves are reviewed. The application of principal points and principal curves are then extended beyond its original purpose to well-known computational methods like Support Vector Machines in machine learning. 2015-12-02T12:04:56Z 2015-12-02T12:04:56Z 2015 Master Thesis Masters MSc http://hdl.handle.net/11427/15515 eng application/pdf University of Cape Town Faculty of Science Department of Statistical Sciences
collection NDLTD
language English
format Dissertation
sources NDLTD
topic Statistical Sciences
Principal points
k-means algorithm
computational methods
machine learning
spellingShingle Statistical Sciences
Principal points
k-means algorithm
computational methods
machine learning
Ganey, Raeesa
Principal points, principal curves and principal surfaces
description The idea of approximating a distribution is a prominent problem in statistics. This dissertation explores the theory of principal points and principal curves as approximation methods to a distribution. Principal points of a distribution have been initially introduced by Flury (1990) who tackled the problem of optimal grouping in multivariate data. In essence, principal points are the theoretical counterparts of cluster means obtained by the k-means algorithm. Principal curves defined by Hastie (1984), are smooth one-dimensional curves that pass through the middle of a p-dimensional data set, providing a nonlinear summary of the data. In this dissertation, details on the usefulness of principal points and principal curves are reviewed. The application of principal points and principal curves are then extended beyond its original purpose to well-known computational methods like Support Vector Machines in machine learning.
author2 Lubbe, Sugnet
author_facet Lubbe, Sugnet
Ganey, Raeesa
author Ganey, Raeesa
author_sort Ganey, Raeesa
title Principal points, principal curves and principal surfaces
title_short Principal points, principal curves and principal surfaces
title_full Principal points, principal curves and principal surfaces
title_fullStr Principal points, principal curves and principal surfaces
title_full_unstemmed Principal points, principal curves and principal surfaces
title_sort principal points, principal curves and principal surfaces
publisher University of Cape Town
publishDate 2015
url http://hdl.handle.net/11427/15515
work_keys_str_mv AT ganeyraeesa principalpointsprincipalcurvesandprincipalsurfaces
_version_ 1719350372726734848