Pushing for the Extreme: Estimation of Poisson Distribution from Low Count Unreplicated Data—How Close Can We Get?

Studies of learning algorithms typically concentrate on situations where potentially ever growing training sample is available. Yet, there can be situations (e.g., detection of differentially expressed genes on unreplicated data or estimation of time delay in non-stationary gravitationally lensed ph...

Full description

Bibliographic Details
Main Author: Peter Tiňo
Format: Article
Language:English
Published: MDPI AG 2013-04-01
Series:Entropy
Subjects:
Online Access:http://www.mdpi.com/1099-4300/15/4/1202
id doaj-8f6adec22d094096938a576768d36ecc
record_format Article
spelling doaj-8f6adec22d094096938a576768d36ecc2020-11-24T22:12:48ZengMDPI AGEntropy1099-43002013-04-011541202122010.3390/e15041202Pushing for the Extreme: Estimation of Poisson Distribution from Low Count Unreplicated Data—How Close Can We Get?Peter TiňoStudies of learning algorithms typically concentrate on situations where potentially ever growing training sample is available. Yet, there can be situations (e.g., detection of differentially expressed genes on unreplicated data or estimation of time delay in non-stationary gravitationally lensed photon streams) where only extremely small samples can be used in order to perform an inference. On unreplicated data, the inference has to be performed on the smallest sample possible—sample of size 1. We study whether anything useful can be learnt in such extreme situations by concentrating on a Bayesian approach that can account for possible prior information on expected counts. We perform a detailed information theoretic study of such Bayesian estimation and quantify the effect of Bayesian averaging on its first two moments. Finally, to analyze potential benefits of the Bayesian approach, we also consider Maximum Likelihood (ML) estimation as a baseline approach. We show both theoretically and empirically that the Bayesian model averaging can be potentially beneficial.http://www.mdpi.com/1099-4300/15/4/1202Poisson distributionunreplicated dataBayesian learningexpected Kullback–Leibler divergence
collection DOAJ
language English
format Article
sources DOAJ
author Peter Tiňo
spellingShingle Peter Tiňo
Pushing for the Extreme: Estimation of Poisson Distribution from Low Count Unreplicated Data—How Close Can We Get?
Entropy
Poisson distribution
unreplicated data
Bayesian learning
expected Kullback–Leibler divergence
author_facet Peter Tiňo
author_sort Peter Tiňo
title Pushing for the Extreme: Estimation of Poisson Distribution from Low Count Unreplicated Data—How Close Can We Get?
title_short Pushing for the Extreme: Estimation of Poisson Distribution from Low Count Unreplicated Data—How Close Can We Get?
title_full Pushing for the Extreme: Estimation of Poisson Distribution from Low Count Unreplicated Data—How Close Can We Get?
title_fullStr Pushing for the Extreme: Estimation of Poisson Distribution from Low Count Unreplicated Data—How Close Can We Get?
title_full_unstemmed Pushing for the Extreme: Estimation of Poisson Distribution from Low Count Unreplicated Data—How Close Can We Get?
title_sort pushing for the extreme: estimation of poisson distribution from low count unreplicated data—how close can we get?
publisher MDPI AG
series Entropy
issn 1099-4300
publishDate 2013-04-01
description Studies of learning algorithms typically concentrate on situations where potentially ever growing training sample is available. Yet, there can be situations (e.g., detection of differentially expressed genes on unreplicated data or estimation of time delay in non-stationary gravitationally lensed photon streams) where only extremely small samples can be used in order to perform an inference. On unreplicated data, the inference has to be performed on the smallest sample possible—sample of size 1. We study whether anything useful can be learnt in such extreme situations by concentrating on a Bayesian approach that can account for possible prior information on expected counts. We perform a detailed information theoretic study of such Bayesian estimation and quantify the effect of Bayesian averaging on its first two moments. Finally, to analyze potential benefits of the Bayesian approach, we also consider Maximum Likelihood (ML) estimation as a baseline approach. We show both theoretically and empirically that the Bayesian model averaging can be potentially beneficial.
topic Poisson distribution
unreplicated data
Bayesian learning
expected Kullback–Leibler divergence
url http://www.mdpi.com/1099-4300/15/4/1202
work_keys_str_mv AT petertino pushingfortheextremeestimationofpoissondistributionfromlowcountunreplicateddatahowclosecanweget
_version_ 1725802308709646336