Accuracy, robustness and scalability of dimensionality reduction methods for single-cell RNA-seq analysis

Abstract Background Dimensionality reduction is an indispensable analytic component for many areas of single-cell RNA sequencing (scRNA-seq) data analysis. Proper dimensionality reduction can allow for effective noise removal and facilitate many downstream analyses that include cell clustering and l...

Full description

Bibliographic Details
Main Authors: Shiquan Sun, Jiaqiang Zhu, Ying Ma, Xiang Zhou
Format: Article
Language:English
Published: BMC 2019-12-01
Series:Genome Biology
Online Access:https://doi.org/10.1186/s13059-019-1898-6
id doaj-5261ad8235054965b19b61201f67e4b3
record_format Article
spelling doaj-5261ad8235054965b19b61201f67e4b32020-12-13T12:39:41ZengBMCGenome Biology1474-760X2019-12-0120112110.1186/s13059-019-1898-6Accuracy, robustness and scalability of dimensionality reduction methods for single-cell RNA-seq analysisShiquan Sun0Jiaqiang Zhu1Ying Ma2Xiang Zhou3School of Computer Science, Northwestern Polytechnical UniversityDepartment of Biostatistics, University of MichiganDepartment of Biostatistics, University of MichiganDepartment of Biostatistics, University of MichiganAbstract Background Dimensionality reduction is an indispensable analytic component for many areas of single-cell RNA sequencing (scRNA-seq) data analysis. Proper dimensionality reduction can allow for effective noise removal and facilitate many downstream analyses that include cell clustering and lineage reconstruction. Unfortunately, despite the critical importance of dimensionality reduction in scRNA-seq analysis and the vast number of dimensionality reduction methods developed for scRNA-seq studies, few comprehensive comparison studies have been performed to evaluate the effectiveness of different dimensionality reduction methods in scRNA-seq. Results We aim to fill this critical knowledge gap by providing a comparative evaluation of a variety of commonly used dimensionality reduction methods for scRNA-seq studies. Specifically, we compare 18 different dimensionality reduction methods on 30 publicly available scRNA-seq datasets that cover a range of sequencing techniques and sample sizes. We evaluate the performance of different dimensionality reduction methods for neighborhood preserving in terms of their ability to recover features of the original expression matrix, and for cell clustering and lineage reconstruction in terms of their accuracy and robustness. We also evaluate the computational scalability of different dimensionality reduction methods by recording their computational cost. Conclusions Based on the comprehensive evaluation results, we provide important guidelines for choosing dimensionality reduction methods for scRNA-seq data analysis. We also provide all analysis scripts used in the present study at www.xzlab.org/reproduce.html.https://doi.org/10.1186/s13059-019-1898-6
collection DOAJ
language English
format Article
sources DOAJ
author Shiquan Sun
Jiaqiang Zhu
Ying Ma
Xiang Zhou
spellingShingle Shiquan Sun
Jiaqiang Zhu
Ying Ma
Xiang Zhou
Accuracy, robustness and scalability of dimensionality reduction methods for single-cell RNA-seq analysis
Genome Biology
author_facet Shiquan Sun
Jiaqiang Zhu
Ying Ma
Xiang Zhou
author_sort Shiquan Sun
title Accuracy, robustness and scalability of dimensionality reduction methods for single-cell RNA-seq analysis
title_short Accuracy, robustness and scalability of dimensionality reduction methods for single-cell RNA-seq analysis
title_full Accuracy, robustness and scalability of dimensionality reduction methods for single-cell RNA-seq analysis
title_fullStr Accuracy, robustness and scalability of dimensionality reduction methods for single-cell RNA-seq analysis
title_full_unstemmed Accuracy, robustness and scalability of dimensionality reduction methods for single-cell RNA-seq analysis
title_sort accuracy, robustness and scalability of dimensionality reduction methods for single-cell rna-seq analysis
publisher BMC
series Genome Biology
issn 1474-760X
publishDate 2019-12-01
description Abstract Background Dimensionality reduction is an indispensable analytic component for many areas of single-cell RNA sequencing (scRNA-seq) data analysis. Proper dimensionality reduction can allow for effective noise removal and facilitate many downstream analyses that include cell clustering and lineage reconstruction. Unfortunately, despite the critical importance of dimensionality reduction in scRNA-seq analysis and the vast number of dimensionality reduction methods developed for scRNA-seq studies, few comprehensive comparison studies have been performed to evaluate the effectiveness of different dimensionality reduction methods in scRNA-seq. Results We aim to fill this critical knowledge gap by providing a comparative evaluation of a variety of commonly used dimensionality reduction methods for scRNA-seq studies. Specifically, we compare 18 different dimensionality reduction methods on 30 publicly available scRNA-seq datasets that cover a range of sequencing techniques and sample sizes. We evaluate the performance of different dimensionality reduction methods for neighborhood preserving in terms of their ability to recover features of the original expression matrix, and for cell clustering and lineage reconstruction in terms of their accuracy and robustness. We also evaluate the computational scalability of different dimensionality reduction methods by recording their computational cost. Conclusions Based on the comprehensive evaluation results, we provide important guidelines for choosing dimensionality reduction methods for scRNA-seq data analysis. We also provide all analysis scripts used in the present study at www.xzlab.org/reproduce.html.
url https://doi.org/10.1186/s13059-019-1898-6
work_keys_str_mv AT shiquansun accuracyrobustnessandscalabilityofdimensionalityreductionmethodsforsinglecellrnaseqanalysis
AT jiaqiangzhu accuracyrobustnessandscalabilityofdimensionalityreductionmethodsforsinglecellrnaseqanalysis
AT yingma accuracyrobustnessandscalabilityofdimensionalityreductionmethodsforsinglecellrnaseqanalysis
AT xiangzhou accuracyrobustnessandscalabilityofdimensionalityreductionmethodsforsinglecellrnaseqanalysis
_version_ 1724384407496687616