qtQDA: quantile transformed quadratic discriminant analysis for high-dimensional RNA-seq data

Classification on the basis of gene expression data derived from RNA-seq promises to become an important part of modern medicine. We propose a new classification method based on a model where the data is marginally negative binomial but dependent, thereby incorporating the dependence known to be pre...

Full description

Bibliographic Details
Main Authors: Necla Koçhan, G. Yazgi Tutuncu, Gordon K. Smyth, Luke C. Gandolfo, Göknur Giner
Format: Article
Language:English
Published: PeerJ Inc. 2019-12-01
Series:PeerJ
Subjects:
Online Access:https://peerj.com/articles/8260.pdf
Description
Summary:Classification on the basis of gene expression data derived from RNA-seq promises to become an important part of modern medicine. We propose a new classification method based on a model where the data is marginally negative binomial but dependent, thereby incorporating the dependence known to be present between measurements from different genes. The method, called qtQDA, works by first performing a quantile transformation (qt) then applying Gaussian quadratic discriminant analysis (QDA) using regularized covariance matrix estimates. We show that qtQDA has excellent performance when applied to real data sets and has advantages over some existing approaches. An R package implementing the method is also available on https://github.com/goknurginer/qtQDA.
ISSN:2167-8359