Tensor-Decomposition-Based Unsupervised Feature Extraction in Single-Cell Multiomics Data Analysis
Analysis of single-cell multiomics datasets is a novel topic and is considerably challenging because such datasets contain a large number of features with numerous missing values. In this study, we implemented a recently proposed tensor-decomposition (TD)-based unsupervised feature extraction (FE) t...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2021-09-01
|
Series: | Genes |
Subjects: | |
Online Access: | https://www.mdpi.com/2073-4425/12/9/1442 |
id |
doaj-d92df2c40cb44474bb2e7baddadaad63 |
---|---|
record_format |
Article |
spelling |
doaj-d92df2c40cb44474bb2e7baddadaad632021-09-26T00:13:36ZengMDPI AGGenes2073-44252021-09-01121442144210.3390/genes12091442Tensor-Decomposition-Based Unsupervised Feature Extraction in Single-Cell Multiomics Data AnalysisY-h. Taguchi0Turki Turki1Department of Physics, Chuo University, Tokyo 112-8551, JapanDepartment of Computer Science, King Abdulaziz University, Jeddah 21589, Saudi ArabiaAnalysis of single-cell multiomics datasets is a novel topic and is considerably challenging because such datasets contain a large number of features with numerous missing values. In this study, we implemented a recently proposed tensor-decomposition (TD)-based unsupervised feature extraction (FE) technique to address this difficult problem. The technique can successfully integrate single-cell multiomics data composed of gene expression, DNA methylation, and accessibility. Although the last two have large dimensions, as many as ten million, containing only a few percentage of nonzero values, TD-based unsupervised FE can integrate three omics datasets without filling in missing values. Together with UMAP, which is used frequently when embedding single-cell measurements into two-dimensional space, TD-based unsupervised FE can produce two-dimensional embedding coincident with classification when integrating single-cell omics datasets. Genes selected based on TD-based unsupervised FE are also significantly related to reasonable biological roles.https://www.mdpi.com/2073-4425/12/9/1442tensor decompositionfeature extractionsingle-cellmultiomics data |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Y-h. Taguchi Turki Turki |
spellingShingle |
Y-h. Taguchi Turki Turki Tensor-Decomposition-Based Unsupervised Feature Extraction in Single-Cell Multiomics Data Analysis Genes tensor decomposition feature extraction single-cell multiomics data |
author_facet |
Y-h. Taguchi Turki Turki |
author_sort |
Y-h. Taguchi |
title |
Tensor-Decomposition-Based Unsupervised Feature Extraction in Single-Cell Multiomics Data Analysis |
title_short |
Tensor-Decomposition-Based Unsupervised Feature Extraction in Single-Cell Multiomics Data Analysis |
title_full |
Tensor-Decomposition-Based Unsupervised Feature Extraction in Single-Cell Multiomics Data Analysis |
title_fullStr |
Tensor-Decomposition-Based Unsupervised Feature Extraction in Single-Cell Multiomics Data Analysis |
title_full_unstemmed |
Tensor-Decomposition-Based Unsupervised Feature Extraction in Single-Cell Multiomics Data Analysis |
title_sort |
tensor-decomposition-based unsupervised feature extraction in single-cell multiomics data analysis |
publisher |
MDPI AG |
series |
Genes |
issn |
2073-4425 |
publishDate |
2021-09-01 |
description |
Analysis of single-cell multiomics datasets is a novel topic and is considerably challenging because such datasets contain a large number of features with numerous missing values. In this study, we implemented a recently proposed tensor-decomposition (TD)-based unsupervised feature extraction (FE) technique to address this difficult problem. The technique can successfully integrate single-cell multiomics data composed of gene expression, DNA methylation, and accessibility. Although the last two have large dimensions, as many as ten million, containing only a few percentage of nonzero values, TD-based unsupervised FE can integrate three omics datasets without filling in missing values. Together with UMAP, which is used frequently when embedding single-cell measurements into two-dimensional space, TD-based unsupervised FE can produce two-dimensional embedding coincident with classification when integrating single-cell omics datasets. Genes selected based on TD-based unsupervised FE are also significantly related to reasonable biological roles. |
topic |
tensor decomposition feature extraction single-cell multiomics data |
url |
https://www.mdpi.com/2073-4425/12/9/1442 |
work_keys_str_mv |
AT yhtaguchi tensordecompositionbasedunsupervisedfeatureextractioninsinglecellmultiomicsdataanalysis AT turkiturki tensordecompositionbasedunsupervisedfeatureextractioninsinglecellmultiomicsdataanalysis |
_version_ |
1717366715017330688 |