Some theoretical essays on functional data classification

Functional data analysis is a fast-growing research area in statistics, dealing with statistical analysis of infinite-dimensional (functional) data. For many pattern recognition problems with finite-dimensional data there usually exists a solid theoretical foundation, for example, it is known under...

Full description

Bibliographic Details
Main Author: Kazakeviciute, Agne
Other Authors: Xue, Jinghao ; Olivo, Malini
Published: University College London (University of London) 2017
Online Access:https://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.746749
id ndltd-bl.uk-oai-ethos.bl.uk-746749
record_format oai_dc
spelling ndltd-bl.uk-oai-ethos.bl.uk-7467492019-01-08T03:19:36ZSome theoretical essays on functional data classificationKazakeviciute, AgneXue, Jinghao ; Olivo, Malini2017Functional data analysis is a fast-growing research area in statistics, dealing with statistical analysis of infinite-dimensional (functional) data. For many pattern recognition problems with finite-dimensional data there usually exists a solid theoretical foundation, for example, it is known under which assumptions various classifiers have desirable theoretical properties, such as consistency. Therefore, a natural interest is to extend the theory to the setting of infinite-dimensional data. The thesis is written in two directions: one is when we observe full curves, and the other is when we observe sparse and irregular curves. In the first direction, the main goal is to give a justification for a logistic classifier, where only the projection of the parameter function on some subspace is estimated via maximum quasi-likelihood and the rest of its coordinates are set to zero. This is preceded with studying the problem of detecting sample point separation in logistic regression–the case in which the maximum quasi-likelihood estimate of the model parameter does not exist or is not unique. In the other direction, a problem of extending sparsely and irregularly sampled functional data to full curves is considered so that potentially the theory from the first research direction could be applied in the future. There are several contributions of this thesis. First, it is proved that the separating hyperplane can be found from a finite set of candidates, and an upper bound of the probability of point separation is given. Second, the assumptions under which the logistic classifier is consistent are established, although simulation studies reveal that some assumptions are not necessary and may be relaxed. Thirdly, the thesis proposes a collaborative curve extension method, which is proven to be consistent under certain assumptions.University College London (University of London)https://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.746749http://discovery.ucl.ac.uk/1570359/Electronic Thesis or Dissertation
collection NDLTD
sources NDLTD
description Functional data analysis is a fast-growing research area in statistics, dealing with statistical analysis of infinite-dimensional (functional) data. For many pattern recognition problems with finite-dimensional data there usually exists a solid theoretical foundation, for example, it is known under which assumptions various classifiers have desirable theoretical properties, such as consistency. Therefore, a natural interest is to extend the theory to the setting of infinite-dimensional data. The thesis is written in two directions: one is when we observe full curves, and the other is when we observe sparse and irregular curves. In the first direction, the main goal is to give a justification for a logistic classifier, where only the projection of the parameter function on some subspace is estimated via maximum quasi-likelihood and the rest of its coordinates are set to zero. This is preceded with studying the problem of detecting sample point separation in logistic regression–the case in which the maximum quasi-likelihood estimate of the model parameter does not exist or is not unique. In the other direction, a problem of extending sparsely and irregularly sampled functional data to full curves is considered so that potentially the theory from the first research direction could be applied in the future. There are several contributions of this thesis. First, it is proved that the separating hyperplane can be found from a finite set of candidates, and an upper bound of the probability of point separation is given. Second, the assumptions under which the logistic classifier is consistent are established, although simulation studies reveal that some assumptions are not necessary and may be relaxed. Thirdly, the thesis proposes a collaborative curve extension method, which is proven to be consistent under certain assumptions.
author2 Xue, Jinghao ; Olivo, Malini
author_facet Xue, Jinghao ; Olivo, Malini
Kazakeviciute, Agne
author Kazakeviciute, Agne
spellingShingle Kazakeviciute, Agne
Some theoretical essays on functional data classification
author_sort Kazakeviciute, Agne
title Some theoretical essays on functional data classification
title_short Some theoretical essays on functional data classification
title_full Some theoretical essays on functional data classification
title_fullStr Some theoretical essays on functional data classification
title_full_unstemmed Some theoretical essays on functional data classification
title_sort some theoretical essays on functional data classification
publisher University College London (University of London)
publishDate 2017
url https://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.746749
work_keys_str_mv AT kazakeviciuteagne sometheoreticalessaysonfunctionaldataclassification
_version_ 1718807249944576000