Automated optimized parameters for T-distributed stochastic neighbor embedding improve visualization and analysis of large datasets
Visualisation tools that use dimensionality reduction, such as t-SNE, provide poor visualisation on large data sets of millions of observations. Here the authors present opt-SNE, that automatically finds data set-tailored parameters for t-SNE to optimise visualisation and improve analysis.
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Nature Publishing Group
2019-11-01
|
Series: | Nature Communications |
Online Access: | https://doi.org/10.1038/s41467-019-13055-y |
id |
doaj-e5126a248205487bb5a7c54c11c0bcc3 |
---|---|
record_format |
Article |
spelling |
doaj-e5126a248205487bb5a7c54c11c0bcc32021-05-11T12:12:47ZengNature Publishing GroupNature Communications2041-17232019-11-0110111210.1038/s41467-019-13055-yAutomated optimized parameters for T-distributed stochastic neighbor embedding improve visualization and analysis of large datasetsAnna C. Belkina0Christopher O. Ciccolella1Rina Anno2Richard Halpert3Josef Spidlen4Jennifer E. Snyder-Cappione5Department of Pathology and Laboratory Medicine, Boston University School of MedicineOmiq, IncDepartment of Mathematics, Kansas State UniversityBD Life Sciences–FlowJoBD Life Sciences–FlowJoFlow Cytometry Core Facility, Boston University School of MedicineVisualisation tools that use dimensionality reduction, such as t-SNE, provide poor visualisation on large data sets of millions of observations. Here the authors present opt-SNE, that automatically finds data set-tailored parameters for t-SNE to optimise visualisation and improve analysis.https://doi.org/10.1038/s41467-019-13055-y |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Anna C. Belkina Christopher O. Ciccolella Rina Anno Richard Halpert Josef Spidlen Jennifer E. Snyder-Cappione |
spellingShingle |
Anna C. Belkina Christopher O. Ciccolella Rina Anno Richard Halpert Josef Spidlen Jennifer E. Snyder-Cappione Automated optimized parameters for T-distributed stochastic neighbor embedding improve visualization and analysis of large datasets Nature Communications |
author_facet |
Anna C. Belkina Christopher O. Ciccolella Rina Anno Richard Halpert Josef Spidlen Jennifer E. Snyder-Cappione |
author_sort |
Anna C. Belkina |
title |
Automated optimized parameters for T-distributed stochastic neighbor embedding improve visualization and analysis of large datasets |
title_short |
Automated optimized parameters for T-distributed stochastic neighbor embedding improve visualization and analysis of large datasets |
title_full |
Automated optimized parameters for T-distributed stochastic neighbor embedding improve visualization and analysis of large datasets |
title_fullStr |
Automated optimized parameters for T-distributed stochastic neighbor embedding improve visualization and analysis of large datasets |
title_full_unstemmed |
Automated optimized parameters for T-distributed stochastic neighbor embedding improve visualization and analysis of large datasets |
title_sort |
automated optimized parameters for t-distributed stochastic neighbor embedding improve visualization and analysis of large datasets |
publisher |
Nature Publishing Group |
series |
Nature Communications |
issn |
2041-1723 |
publishDate |
2019-11-01 |
description |
Visualisation tools that use dimensionality reduction, such as t-SNE, provide poor visualisation on large data sets of millions of observations. Here the authors present opt-SNE, that automatically finds data set-tailored parameters for t-SNE to optimise visualisation and improve analysis. |
url |
https://doi.org/10.1038/s41467-019-13055-y |
work_keys_str_mv |
AT annacbelkina automatedoptimizedparametersfortdistributedstochasticneighborembeddingimprovevisualizationandanalysisoflargedatasets AT christopherociccolella automatedoptimizedparametersfortdistributedstochasticneighborembeddingimprovevisualizationandanalysisoflargedatasets AT rinaanno automatedoptimizedparametersfortdistributedstochasticneighborembeddingimprovevisualizationandanalysisoflargedatasets AT richardhalpert automatedoptimizedparametersfortdistributedstochasticneighborembeddingimprovevisualizationandanalysisoflargedatasets AT josefspidlen automatedoptimizedparametersfortdistributedstochasticneighborembeddingimprovevisualizationandanalysisoflargedatasets AT jenniferesnydercappione automatedoptimizedparametersfortdistributedstochasticneighborembeddingimprovevisualizationandanalysisoflargedatasets |
_version_ |
1721445314957672448 |