Pushing for the Extreme: Estimation of Poisson Distribution from Low Count Unreplicated Data—How Close Can We Get?
Studies of learning algorithms typically concentrate on situations where potentially ever growing training sample is available. Yet, there can be situations (e.g., detection of differentially expressed genes on unreplicated data or estimation of time delay in non-stationary gravitationally lensed ph...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2013-04-01
|
Series: | Entropy |
Subjects: | |
Online Access: | http://www.mdpi.com/1099-4300/15/4/1202 |
id |
doaj-8f6adec22d094096938a576768d36ecc |
---|---|
record_format |
Article |
spelling |
doaj-8f6adec22d094096938a576768d36ecc2020-11-24T22:12:48ZengMDPI AGEntropy1099-43002013-04-011541202122010.3390/e15041202Pushing for the Extreme: Estimation of Poisson Distribution from Low Count Unreplicated Data—How Close Can We Get?Peter TiňoStudies of learning algorithms typically concentrate on situations where potentially ever growing training sample is available. Yet, there can be situations (e.g., detection of differentially expressed genes on unreplicated data or estimation of time delay in non-stationary gravitationally lensed photon streams) where only extremely small samples can be used in order to perform an inference. On unreplicated data, the inference has to be performed on the smallest sample possible—sample of size 1. We study whether anything useful can be learnt in such extreme situations by concentrating on a Bayesian approach that can account for possible prior information on expected counts. We perform a detailed information theoretic study of such Bayesian estimation and quantify the effect of Bayesian averaging on its first two moments. Finally, to analyze potential benefits of the Bayesian approach, we also consider Maximum Likelihood (ML) estimation as a baseline approach. We show both theoretically and empirically that the Bayesian model averaging can be potentially beneficial.http://www.mdpi.com/1099-4300/15/4/1202Poisson distributionunreplicated dataBayesian learningexpected Kullback–Leibler divergence |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Peter Tiňo |
spellingShingle |
Peter Tiňo Pushing for the Extreme: Estimation of Poisson Distribution from Low Count Unreplicated Data—How Close Can We Get? Entropy Poisson distribution unreplicated data Bayesian learning expected Kullback–Leibler divergence |
author_facet |
Peter Tiňo |
author_sort |
Peter Tiňo |
title |
Pushing for the Extreme: Estimation of Poisson Distribution from Low Count Unreplicated Data—How Close Can We Get? |
title_short |
Pushing for the Extreme: Estimation of Poisson Distribution from Low Count Unreplicated Data—How Close Can We Get? |
title_full |
Pushing for the Extreme: Estimation of Poisson Distribution from Low Count Unreplicated Data—How Close Can We Get? |
title_fullStr |
Pushing for the Extreme: Estimation of Poisson Distribution from Low Count Unreplicated Data—How Close Can We Get? |
title_full_unstemmed |
Pushing for the Extreme: Estimation of Poisson Distribution from Low Count Unreplicated Data—How Close Can We Get? |
title_sort |
pushing for the extreme: estimation of poisson distribution from low count unreplicated data—how close can we get? |
publisher |
MDPI AG |
series |
Entropy |
issn |
1099-4300 |
publishDate |
2013-04-01 |
description |
Studies of learning algorithms typically concentrate on situations where potentially ever growing training sample is available. Yet, there can be situations (e.g., detection of differentially expressed genes on unreplicated data or estimation of time delay in non-stationary gravitationally lensed photon streams) where only extremely small samples can be used in order to perform an inference. On unreplicated data, the inference has to be performed on the smallest sample possible—sample of size 1. We study whether anything useful can be learnt in such extreme situations by concentrating on a Bayesian approach that can account for possible prior information on expected counts. We perform a detailed information theoretic study of such Bayesian estimation and quantify the effect of Bayesian averaging on its first two moments. Finally, to analyze potential benefits of the Bayesian approach, we also consider Maximum Likelihood (ML) estimation as a baseline approach. We show both theoretically and empirically that the Bayesian model averaging can be potentially beneficial. |
topic |
Poisson distribution unreplicated data Bayesian learning expected Kullback–Leibler divergence |
url |
http://www.mdpi.com/1099-4300/15/4/1202 |
work_keys_str_mv |
AT petertino pushingfortheextremeestimationofpoissondistributionfromlowcountunreplicateddatahowclosecanweget |
_version_ |
1725802308709646336 |