CuBlock: A cross-platform normalization method for gene-expression microarrays

Motivation: Cross-(multi)platform normalization of gene-expression microarray data remains an unresolved issue. Despite the existence of several algorithms, they are either constrained by the need to normalize all samples of all platforms together, compromising scalability and reuse, by adherence to...

Full description

Bibliographic Details
Main Authors:	Daura, X. (Author), Farrés, J. (Author), Junet, V. (Author), Mas, J.M (Author)
Format:	Article
Language:	English
Published:	Oxford University Press 2021
Subjects:	algorithm article controlled study DNA microarray experimental study extraction human randomized controlled trial standardization
Online Access:	View Fulltext in Publisher


LEADER	02928nam a2200277Ia 4500
001	10.1093-bioinformatics-btab105
008	220427s2021 CNT 000 0 und d
020			\|a 13674803 (ISSN)
245	1	0	\|a CuBlock: A cross-platform normalization method for gene-expression microarrays
260		0	\|b Oxford University Press \|c 2021
856			\|z View Fulltext in Publisher \|u https://doi.org/10.1093/bioinformatics/btab105
520	3		\|a Motivation: Cross-(multi)platform normalization of gene-expression microarray data remains an unresolved issue. Despite the existence of several algorithms, they are either constrained by the need to normalize all samples of all platforms together, compromising scalability and reuse, by adherence to the platforms of a specific provider, or simply by poor performance. In addition, many of the methods presented in the literature have not been specifically tested against multi-platform data and/or other methods applicable in this context. Thus, we set out to develop a normalization algorithm appropriate for gene-expression studies based on multiple, potentially large microarray sets collected along multiple platforms and at different times, applicable in systematic studies aimed at extracting knowledge from the wealth of microarray data available in public repositories; for example, for the extraction of Real-World Data to complement data from Randomized Controlled Trials. Our main focus or criterion for performance was on the capacity of the algorithm to properly separate samples from different biological groups. Results: We present CuBlock, an algorithm addressing this objective, together with a strategy to validate cross-platform normalization methods. To validate the algorithm and benchmark it against existing methods, we used two distinct datasets, one specifically generated for testing and standardization purposes and one from an actual experimental study. Using these datasets, we benchmarked CuBlock against ComBat (Johnson et al., 2007), UPC (Piccolo et al., 2013), YuGene (Lê Cao et al., 2014), DBNorm (Meng et al., 2017), Shambhala (Borisov et al., 2019) and a simple log2 transform as reference. We note that many other popular normalization methods are not applicable in this context. CuBlock was the only algorithm in this group that could always and clearly differentiate the underlying biological groups after mixing the data, from up to six different platforms in this study. © 2021 The Author(s) 2021. Published by Oxford University Press.
650	0	4	\|a algorithm
650	0	4	\|a article
650	0	4	\|a controlled study
650	0	4	\|a DNA microarray
650	0	4	\|a experimental study
650	0	4	\|a extraction
650	0	4	\|a human
650	0	4	\|a randomized controlled trial
650	0	4	\|a standardization
700	1		\|a Daura, X. \|e author
700	1		\|a Farrés, J. \|e author
700	1		\|a Junet, V. \|e author
700	1		\|a Mas, J.M. \|e author
773			\|t Bioinformatics

CuBlock: A cross-platform normalization method for gene-expression microarrays

Similar Items