Accuracy, robustness and scalability of dimensionality reduction methods for single-cell RNA-seq analysis
Abstract Background Dimensionality reduction is an indispensable analytic component for many areas of single-cell RNA sequencing (scRNA-seq) data analysis. Proper dimensionality reduction can allow for effective noise removal and facilitate many downstream analyses that include cell clustering and l...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2019-12-01
|
Series: | Genome Biology |
Online Access: | https://doi.org/10.1186/s13059-019-1898-6 |
id |
doaj-5261ad8235054965b19b61201f67e4b3 |
---|---|
record_format |
Article |
spelling |
doaj-5261ad8235054965b19b61201f67e4b32020-12-13T12:39:41ZengBMCGenome Biology1474-760X2019-12-0120112110.1186/s13059-019-1898-6Accuracy, robustness and scalability of dimensionality reduction methods for single-cell RNA-seq analysisShiquan Sun0Jiaqiang Zhu1Ying Ma2Xiang Zhou3School of Computer Science, Northwestern Polytechnical UniversityDepartment of Biostatistics, University of MichiganDepartment of Biostatistics, University of MichiganDepartment of Biostatistics, University of MichiganAbstract Background Dimensionality reduction is an indispensable analytic component for many areas of single-cell RNA sequencing (scRNA-seq) data analysis. Proper dimensionality reduction can allow for effective noise removal and facilitate many downstream analyses that include cell clustering and lineage reconstruction. Unfortunately, despite the critical importance of dimensionality reduction in scRNA-seq analysis and the vast number of dimensionality reduction methods developed for scRNA-seq studies, few comprehensive comparison studies have been performed to evaluate the effectiveness of different dimensionality reduction methods in scRNA-seq. Results We aim to fill this critical knowledge gap by providing a comparative evaluation of a variety of commonly used dimensionality reduction methods for scRNA-seq studies. Specifically, we compare 18 different dimensionality reduction methods on 30 publicly available scRNA-seq datasets that cover a range of sequencing techniques and sample sizes. We evaluate the performance of different dimensionality reduction methods for neighborhood preserving in terms of their ability to recover features of the original expression matrix, and for cell clustering and lineage reconstruction in terms of their accuracy and robustness. We also evaluate the computational scalability of different dimensionality reduction methods by recording their computational cost. Conclusions Based on the comprehensive evaluation results, we provide important guidelines for choosing dimensionality reduction methods for scRNA-seq data analysis. We also provide all analysis scripts used in the present study at www.xzlab.org/reproduce.html.https://doi.org/10.1186/s13059-019-1898-6 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Shiquan Sun Jiaqiang Zhu Ying Ma Xiang Zhou |
spellingShingle |
Shiquan Sun Jiaqiang Zhu Ying Ma Xiang Zhou Accuracy, robustness and scalability of dimensionality reduction methods for single-cell RNA-seq analysis Genome Biology |
author_facet |
Shiquan Sun Jiaqiang Zhu Ying Ma Xiang Zhou |
author_sort |
Shiquan Sun |
title |
Accuracy, robustness and scalability of dimensionality reduction methods for single-cell RNA-seq analysis |
title_short |
Accuracy, robustness and scalability of dimensionality reduction methods for single-cell RNA-seq analysis |
title_full |
Accuracy, robustness and scalability of dimensionality reduction methods for single-cell RNA-seq analysis |
title_fullStr |
Accuracy, robustness and scalability of dimensionality reduction methods for single-cell RNA-seq analysis |
title_full_unstemmed |
Accuracy, robustness and scalability of dimensionality reduction methods for single-cell RNA-seq analysis |
title_sort |
accuracy, robustness and scalability of dimensionality reduction methods for single-cell rna-seq analysis |
publisher |
BMC |
series |
Genome Biology |
issn |
1474-760X |
publishDate |
2019-12-01 |
description |
Abstract Background Dimensionality reduction is an indispensable analytic component for many areas of single-cell RNA sequencing (scRNA-seq) data analysis. Proper dimensionality reduction can allow for effective noise removal and facilitate many downstream analyses that include cell clustering and lineage reconstruction. Unfortunately, despite the critical importance of dimensionality reduction in scRNA-seq analysis and the vast number of dimensionality reduction methods developed for scRNA-seq studies, few comprehensive comparison studies have been performed to evaluate the effectiveness of different dimensionality reduction methods in scRNA-seq. Results We aim to fill this critical knowledge gap by providing a comparative evaluation of a variety of commonly used dimensionality reduction methods for scRNA-seq studies. Specifically, we compare 18 different dimensionality reduction methods on 30 publicly available scRNA-seq datasets that cover a range of sequencing techniques and sample sizes. We evaluate the performance of different dimensionality reduction methods for neighborhood preserving in terms of their ability to recover features of the original expression matrix, and for cell clustering and lineage reconstruction in terms of their accuracy and robustness. We also evaluate the computational scalability of different dimensionality reduction methods by recording their computational cost. Conclusions Based on the comprehensive evaluation results, we provide important guidelines for choosing dimensionality reduction methods for scRNA-seq data analysis. We also provide all analysis scripts used in the present study at www.xzlab.org/reproduce.html. |
url |
https://doi.org/10.1186/s13059-019-1898-6 |
work_keys_str_mv |
AT shiquansun accuracyrobustnessandscalabilityofdimensionalityreductionmethodsforsinglecellrnaseqanalysis AT jiaqiangzhu accuracyrobustnessandscalabilityofdimensionalityreductionmethodsforsinglecellrnaseqanalysis AT yingma accuracyrobustnessandscalabilityofdimensionalityreductionmethodsforsinglecellrnaseqanalysis AT xiangzhou accuracyrobustnessandscalabilityofdimensionalityreductionmethodsforsinglecellrnaseqanalysis |
_version_ |
1724384407496687616 |