t-Test at the Probe Level: An Alternative Method to Identify Statistically Significant Genes for Microarray Data

Microarray data analysis typically consists in identifying a list of differentially expressed genes (DEG), i.e., the genes that are differentially expressed between two experimental conditions. Variance shrinkage methods have been considered a better choice than the standard t-test for selecting the...

Full description

Bibliographic Details
Main Authors: Marcelo Boareto, Nestor Caticha
Format: Article
Language:English
Published: MDPI AG 2014-12-01
Series:Microarrays
Subjects:
Online Access:http://www.mdpi.com/2076-3905/3/4/340
id doaj-7b4f8ad8111b47908fd7f1b7d62da245
record_format Article
spelling doaj-7b4f8ad8111b47908fd7f1b7d62da2452020-11-25T00:19:35ZengMDPI AGMicroarrays2076-39052014-12-013434035110.3390/microarrays3040340microarrays3040340t-Test at the Probe Level: An Alternative Method to Identify Statistically Significant Genes for Microarray DataMarcelo Boareto0Nestor Caticha1Institute of Physics, University of São Paulo, São Paulo, SP 05508-900, BrazilInstitute of Physics, University of São Paulo, São Paulo, SP 05508-900, BrazilMicroarray data analysis typically consists in identifying a list of differentially expressed genes (DEG), i.e., the genes that are differentially expressed between two experimental conditions. Variance shrinkage methods have been considered a better choice than the standard t-test for selecting the DEG because they correct the dependence of the error with the expression level. This dependence is mainly caused by errors in background correction, which more severely affects genes with low expression values. Here, we propose a new method for identifying the DEG that overcomes this issue and does not require background correction or variance shrinkage. Unlike current methods, our methodology is easy to understand and implement. It consists of applying the standard t-test directly on the normalized intensity data, which is possible because the probe intensity is proportional to the gene expression level and because the t-test is scale- and location-invariant. This methodology considerably improves the sensitivity and robustness of the list of DEG when compared with the t-test applied to preprocessed data and to the most widely used shrinkage methods, Significance Analysis of Microarrays (SAM) and Linear Models for Microarray Data (LIMMA). Our approach is useful especially when the genes of interest have small differences in expression and therefore get ignored by standard variance shrinkage methods.http://www.mdpi.com/2076-3905/3/4/340microarrayspreprocessingvariance shrinkaget-testbackground correction
collection DOAJ
language English
format Article
sources DOAJ
author Marcelo Boareto
Nestor Caticha
spellingShingle Marcelo Boareto
Nestor Caticha
t-Test at the Probe Level: An Alternative Method to Identify Statistically Significant Genes for Microarray Data
Microarrays
microarrays
preprocessing
variance shrinkage
t-test
background correction
author_facet Marcelo Boareto
Nestor Caticha
author_sort Marcelo Boareto
title t-Test at the Probe Level: An Alternative Method to Identify Statistically Significant Genes for Microarray Data
title_short t-Test at the Probe Level: An Alternative Method to Identify Statistically Significant Genes for Microarray Data
title_full t-Test at the Probe Level: An Alternative Method to Identify Statistically Significant Genes for Microarray Data
title_fullStr t-Test at the Probe Level: An Alternative Method to Identify Statistically Significant Genes for Microarray Data
title_full_unstemmed t-Test at the Probe Level: An Alternative Method to Identify Statistically Significant Genes for Microarray Data
title_sort t-test at the probe level: an alternative method to identify statistically significant genes for microarray data
publisher MDPI AG
series Microarrays
issn 2076-3905
publishDate 2014-12-01
description Microarray data analysis typically consists in identifying a list of differentially expressed genes (DEG), i.e., the genes that are differentially expressed between two experimental conditions. Variance shrinkage methods have been considered a better choice than the standard t-test for selecting the DEG because they correct the dependence of the error with the expression level. This dependence is mainly caused by errors in background correction, which more severely affects genes with low expression values. Here, we propose a new method for identifying the DEG that overcomes this issue and does not require background correction or variance shrinkage. Unlike current methods, our methodology is easy to understand and implement. It consists of applying the standard t-test directly on the normalized intensity data, which is possible because the probe intensity is proportional to the gene expression level and because the t-test is scale- and location-invariant. This methodology considerably improves the sensitivity and robustness of the list of DEG when compared with the t-test applied to preprocessed data and to the most widely used shrinkage methods, Significance Analysis of Microarrays (SAM) and Linear Models for Microarray Data (LIMMA). Our approach is useful especially when the genes of interest have small differences in expression and therefore get ignored by standard variance shrinkage methods.
topic microarrays
preprocessing
variance shrinkage
t-test
background correction
url http://www.mdpi.com/2076-3905/3/4/340
work_keys_str_mv AT marceloboareto ttestattheprobelevelanalternativemethodtoidentifystatisticallysignificantgenesformicroarraydata
AT nestorcaticha ttestattheprobelevelanalternativemethodtoidentifystatisticallysignificantgenesformicroarraydata
_version_ 1725371078376685568