cdev: a ground-truth based measure to evaluate RNA-seq normalization performance

Normalization of RNA-seq data has been an active area of research since the problem was first recognized a decade ago. Despite the active development of new normalizers, their performance measures have been given little attention. To evaluate normalizers, researchers have been relying on ad hoc meas...

Full description

Bibliographic Details
Main Authors: Diem-Trang Tran, Matthew Might
Format: Article
Language:English
Published: PeerJ Inc. 2021-10-01
Series:PeerJ
Subjects:
Online Access:https://peerj.com/articles/12233.pdf
Description
Summary:Normalization of RNA-seq data has been an active area of research since the problem was first recognized a decade ago. Despite the active development of new normalizers, their performance measures have been given little attention. To evaluate normalizers, researchers have been relying on ad hoc measures, most of which are either qualitative, potentially biased, or easily confounded by parametric choices of downstream analysis. We propose a metric called condition-number based deviation, or cdev, to quantify normalization success. cdev measures how much an expression matrix differs from another. If a ground truth normalization is given, cdev can then be used to evaluate the performance of normalizers. To establish experimental ground truth, we compiled an extensive set of public RNA-seq assays with external spike-ins. This data collection, together with cdev, provides a valuable toolset for benchmarking new and existing normalization methods.
ISSN:2167-8359