Tensor-Decomposition-Based Unsupervised Feature Extraction in Single-Cell Multiomics Data Analysis

Analysis of single-cell multiomics datasets is a novel topic and is considerably challenging because such datasets contain a large number of features with numerous missing values. In this study, we implemented a recently proposed tensor-decomposition (TD)-based unsupervised feature extraction (FE) t...

Full description

Bibliographic Details
Main Authors: Y-h. Taguchi, Turki Turki
Format: Article
Language:English
Published: MDPI AG 2021-09-01
Series:Genes
Subjects:
Online Access:https://www.mdpi.com/2073-4425/12/9/1442
id doaj-d92df2c40cb44474bb2e7baddadaad63
record_format Article
spelling doaj-d92df2c40cb44474bb2e7baddadaad632021-09-26T00:13:36ZengMDPI AGGenes2073-44252021-09-01121442144210.3390/genes12091442Tensor-Decomposition-Based Unsupervised Feature Extraction in Single-Cell Multiomics Data AnalysisY-h. Taguchi0Turki Turki1Department of Physics, Chuo University, Tokyo 112-8551, JapanDepartment of Computer Science, King Abdulaziz University, Jeddah 21589, Saudi ArabiaAnalysis of single-cell multiomics datasets is a novel topic and is considerably challenging because such datasets contain a large number of features with numerous missing values. In this study, we implemented a recently proposed tensor-decomposition (TD)-based unsupervised feature extraction (FE) technique to address this difficult problem. The technique can successfully integrate single-cell multiomics data composed of gene expression, DNA methylation, and accessibility. Although the last two have large dimensions, as many as ten million, containing only a few percentage of nonzero values, TD-based unsupervised FE can integrate three omics datasets without filling in missing values. Together with UMAP, which is used frequently when embedding single-cell measurements into two-dimensional space, TD-based unsupervised FE can produce two-dimensional embedding coincident with classification when integrating single-cell omics datasets. Genes selected based on TD-based unsupervised FE are also significantly related to reasonable biological roles.https://www.mdpi.com/2073-4425/12/9/1442tensor decompositionfeature extractionsingle-cellmultiomics data
collection DOAJ
language English
format Article
sources DOAJ
author Y-h. Taguchi
Turki Turki
spellingShingle Y-h. Taguchi
Turki Turki
Tensor-Decomposition-Based Unsupervised Feature Extraction in Single-Cell Multiomics Data Analysis
Genes
tensor decomposition
feature extraction
single-cell
multiomics data
author_facet Y-h. Taguchi
Turki Turki
author_sort Y-h. Taguchi
title Tensor-Decomposition-Based Unsupervised Feature Extraction in Single-Cell Multiomics Data Analysis
title_short Tensor-Decomposition-Based Unsupervised Feature Extraction in Single-Cell Multiomics Data Analysis
title_full Tensor-Decomposition-Based Unsupervised Feature Extraction in Single-Cell Multiomics Data Analysis
title_fullStr Tensor-Decomposition-Based Unsupervised Feature Extraction in Single-Cell Multiomics Data Analysis
title_full_unstemmed Tensor-Decomposition-Based Unsupervised Feature Extraction in Single-Cell Multiomics Data Analysis
title_sort tensor-decomposition-based unsupervised feature extraction in single-cell multiomics data analysis
publisher MDPI AG
series Genes
issn 2073-4425
publishDate 2021-09-01
description Analysis of single-cell multiomics datasets is a novel topic and is considerably challenging because such datasets contain a large number of features with numerous missing values. In this study, we implemented a recently proposed tensor-decomposition (TD)-based unsupervised feature extraction (FE) technique to address this difficult problem. The technique can successfully integrate single-cell multiomics data composed of gene expression, DNA methylation, and accessibility. Although the last two have large dimensions, as many as ten million, containing only a few percentage of nonzero values, TD-based unsupervised FE can integrate three omics datasets without filling in missing values. Together with UMAP, which is used frequently when embedding single-cell measurements into two-dimensional space, TD-based unsupervised FE can produce two-dimensional embedding coincident with classification when integrating single-cell omics datasets. Genes selected based on TD-based unsupervised FE are also significantly related to reasonable biological roles.
topic tensor decomposition
feature extraction
single-cell
multiomics data
url https://www.mdpi.com/2073-4425/12/9/1442
work_keys_str_mv AT yhtaguchi tensordecompositionbasedunsupervisedfeatureextractioninsinglecellmultiomicsdataanalysis
AT turkiturki tensordecompositionbasedunsupervisedfeatureextractioninsinglecellmultiomicsdataanalysis
_version_ 1717366715017330688