Chromosome Walking: A Novel Approach to Analyse Amino Acid Content of Human Proteins Ordered by Gene Position

Notwithstanding the huge amount of detailed information available in protein databases, it is not possible to automatically download a list of proteins ordered by the position of their codifying gene. This order becomes crucial when analyzing common features of proteins produced by loci or other spe...

Full description

Bibliographic Details
Main Authors: Annamaria Vernone, Chiara Ricca, Gianpiero Pescarmona, Francesca Silvagno
Format: Article
Language:English
Published: MDPI AG 2021-04-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/11/8/3511
id doaj-080dd0f4816346a4babc19d25d967a6b
record_format Article
spelling doaj-080dd0f4816346a4babc19d25d967a6b2021-04-14T23:03:14ZengMDPI AGApplied Sciences2076-34172021-04-01113511351110.3390/app11083511Chromosome Walking: A Novel Approach to Analyse Amino Acid Content of Human Proteins Ordered by Gene PositionAnnamaria Vernone0Chiara Ricca1Gianpiero Pescarmona2Francesca Silvagno3Department of Oncology, University of Torino, Via Santena 5 bis, 10126 Torino, ItalyDepartment of Oncology, University of Torino, Via Santena 5 bis, 10126 Torino, ItalyDepartment of Oncology, University of Torino, Via Santena 5 bis, 10126 Torino, ItalyDepartment of Oncology, University of Torino, Via Santena 5 bis, 10126 Torino, ItalyNotwithstanding the huge amount of detailed information available in protein databases, it is not possible to automatically download a list of proteins ordered by the position of their codifying gene. This order becomes crucial when analyzing common features of proteins produced by loci or other specific regions of human chromosomes. In this study, we developed a new procedure that interrogates two human databases (genomic and protein) and produces a novel dataset of ordered proteins following the mapping of the corresponding genes. We validated and implemented the procedure to create a user-friendly web application. This novel data mining was used to evaluate the distribution of critical amino acid content in proteins codified by a human chromosome. For this purpose, we designed a new methodological approach called chromosome walking, which scanned the whole chromosome and found the regions producing proteins enriched in a selected amino acid. As an example of biomedical application, we investigated the human chromosome 15, which contains the locus DYX1 linked to developmental dyslexia, and we found three additional putative gene clusters whose expression could be driven by the environmental availability of glutamate. The novel data mining procedure and analysis could be exploited in the study of several human pathologies.https://www.mdpi.com/2076-3417/11/8/3511protein databasegenomic databaseamino acid contentglutamatehuman chromosome 15developmental dyslexia
collection DOAJ
language English
format Article
sources DOAJ
author Annamaria Vernone
Chiara Ricca
Gianpiero Pescarmona
Francesca Silvagno
spellingShingle Annamaria Vernone
Chiara Ricca
Gianpiero Pescarmona
Francesca Silvagno
Chromosome Walking: A Novel Approach to Analyse Amino Acid Content of Human Proteins Ordered by Gene Position
Applied Sciences
protein database
genomic database
amino acid content
glutamate
human chromosome 15
developmental dyslexia
author_facet Annamaria Vernone
Chiara Ricca
Gianpiero Pescarmona
Francesca Silvagno
author_sort Annamaria Vernone
title Chromosome Walking: A Novel Approach to Analyse Amino Acid Content of Human Proteins Ordered by Gene Position
title_short Chromosome Walking: A Novel Approach to Analyse Amino Acid Content of Human Proteins Ordered by Gene Position
title_full Chromosome Walking: A Novel Approach to Analyse Amino Acid Content of Human Proteins Ordered by Gene Position
title_fullStr Chromosome Walking: A Novel Approach to Analyse Amino Acid Content of Human Proteins Ordered by Gene Position
title_full_unstemmed Chromosome Walking: A Novel Approach to Analyse Amino Acid Content of Human Proteins Ordered by Gene Position
title_sort chromosome walking: a novel approach to analyse amino acid content of human proteins ordered by gene position
publisher MDPI AG
series Applied Sciences
issn 2076-3417
publishDate 2021-04-01
description Notwithstanding the huge amount of detailed information available in protein databases, it is not possible to automatically download a list of proteins ordered by the position of their codifying gene. This order becomes crucial when analyzing common features of proteins produced by loci or other specific regions of human chromosomes. In this study, we developed a new procedure that interrogates two human databases (genomic and protein) and produces a novel dataset of ordered proteins following the mapping of the corresponding genes. We validated and implemented the procedure to create a user-friendly web application. This novel data mining was used to evaluate the distribution of critical amino acid content in proteins codified by a human chromosome. For this purpose, we designed a new methodological approach called chromosome walking, which scanned the whole chromosome and found the regions producing proteins enriched in a selected amino acid. As an example of biomedical application, we investigated the human chromosome 15, which contains the locus DYX1 linked to developmental dyslexia, and we found three additional putative gene clusters whose expression could be driven by the environmental availability of glutamate. The novel data mining procedure and analysis could be exploited in the study of several human pathologies.
topic protein database
genomic database
amino acid content
glutamate
human chromosome 15
developmental dyslexia
url https://www.mdpi.com/2076-3417/11/8/3511
work_keys_str_mv AT annamariavernone chromosomewalkinganovelapproachtoanalyseaminoacidcontentofhumanproteinsorderedbygeneposition
AT chiararicca chromosomewalkinganovelapproachtoanalyseaminoacidcontentofhumanproteinsorderedbygeneposition
AT gianpieropescarmona chromosomewalkinganovelapproachtoanalyseaminoacidcontentofhumanproteinsorderedbygeneposition
AT francescasilvagno chromosomewalkinganovelapproachtoanalyseaminoacidcontentofhumanproteinsorderedbygeneposition
_version_ 1721526748119564288