Pseudogenes and Their Genome-Wide Prediction in Plants

Pseudogenes are paralogs generated from ancestral functional genes (parents) during genome evolution, which contain critical defects in their sequences, such as lacking a promoter, having a premature stop codon or frameshift mutations. Generally, pseudogenes are functionless, but recent evidence dem...

Full description

Bibliographic Details
Main Authors: Jin Xiao, Manoj Kumar Sekhwal, Pingchuan Li, Raja Ragupathy, Sylvie Cloutier, Xiue Wang, Frank M. You
Format: Article
Language:English
Published: MDPI AG 2016-11-01
Series:International Journal of Molecular Sciences
Subjects:
Online Access:http://www.mdpi.com/1422-0067/17/12/1991
id doaj-c4b51f70045f4e64939a3e6ddb85cc76
record_format Article
spelling doaj-c4b51f70045f4e64939a3e6ddb85cc762020-11-24T22:11:45ZengMDPI AGInternational Journal of Molecular Sciences1422-00672016-11-011712199110.3390/ijms17121991ijms17121991Pseudogenes and Their Genome-Wide Prediction in PlantsJin Xiao0Manoj Kumar Sekhwal1Pingchuan Li2Raja Ragupathy3Sylvie Cloutier4Xiue Wang5Frank M. You6Morden Research and Development Centre, Agriculture and Agri-Food Canada, Morden, MB R6M 1Y5, CanadaMorden Research and Development Centre, Agriculture and Agri-Food Canada, Morden, MB R6M 1Y5, CanadaMorden Research and Development Centre, Agriculture and Agri-Food Canada, Morden, MB R6M 1Y5, CanadaDepartment of Plant Science, University of Saskatchewan, Saskatoon, SK S7N 5A2, CanadaOttawa Research and Development Centre, Agriculture and Agri-Food Canada, Ottawa, ON K1A 0C6, CanadaDepartment of Agronomy, Nanjing Agricultural University, Nanjing 210095, ChinaMorden Research and Development Centre, Agriculture and Agri-Food Canada, Morden, MB R6M 1Y5, CanadaPseudogenes are paralogs generated from ancestral functional genes (parents) during genome evolution, which contain critical defects in their sequences, such as lacking a promoter, having a premature stop codon or frameshift mutations. Generally, pseudogenes are functionless, but recent evidence demonstrates that some of them have potential roles in regulation. The majority of pseudogenes are generated from functional progenitor genes either by gene duplication (duplicated pseudogenes) or retro-transposition (processed pseudogenes). Pseudogenes are primarily identified by comparison to their parent genes. Bioinformatics tools for pseudogene prediction have been developed, among which PseudoPipe, PSF and Shiu’s pipeline are publicly available. We compared these three tools using the well-annotated Arabidopsis thaliana genome and its known 924 pseudogenes as a test data set. PseudoPipe and Shiu’s pipeline identified ~80% of A. thaliana pseudogenes, of which 94% were shared, while PSF failed to generate adequate results. A need for improvement of the bioinformatics tools for pseudogene prediction accuracy in plant genomes was thus identified, with the ultimate goal of improving the quality of genome annotation in plants.http://www.mdpi.com/1422-0067/17/12/1991pseudogenesprocessedduplicatedbioinformatics toolsplantsgenome-wide
collection DOAJ
language English
format Article
sources DOAJ
author Jin Xiao
Manoj Kumar Sekhwal
Pingchuan Li
Raja Ragupathy
Sylvie Cloutier
Xiue Wang
Frank M. You
spellingShingle Jin Xiao
Manoj Kumar Sekhwal
Pingchuan Li
Raja Ragupathy
Sylvie Cloutier
Xiue Wang
Frank M. You
Pseudogenes and Their Genome-Wide Prediction in Plants
International Journal of Molecular Sciences
pseudogenes
processed
duplicated
bioinformatics tools
plants
genome-wide
author_facet Jin Xiao
Manoj Kumar Sekhwal
Pingchuan Li
Raja Ragupathy
Sylvie Cloutier
Xiue Wang
Frank M. You
author_sort Jin Xiao
title Pseudogenes and Their Genome-Wide Prediction in Plants
title_short Pseudogenes and Their Genome-Wide Prediction in Plants
title_full Pseudogenes and Their Genome-Wide Prediction in Plants
title_fullStr Pseudogenes and Their Genome-Wide Prediction in Plants
title_full_unstemmed Pseudogenes and Their Genome-Wide Prediction in Plants
title_sort pseudogenes and their genome-wide prediction in plants
publisher MDPI AG
series International Journal of Molecular Sciences
issn 1422-0067
publishDate 2016-11-01
description Pseudogenes are paralogs generated from ancestral functional genes (parents) during genome evolution, which contain critical defects in their sequences, such as lacking a promoter, having a premature stop codon or frameshift mutations. Generally, pseudogenes are functionless, but recent evidence demonstrates that some of them have potential roles in regulation. The majority of pseudogenes are generated from functional progenitor genes either by gene duplication (duplicated pseudogenes) or retro-transposition (processed pseudogenes). Pseudogenes are primarily identified by comparison to their parent genes. Bioinformatics tools for pseudogene prediction have been developed, among which PseudoPipe, PSF and Shiu’s pipeline are publicly available. We compared these three tools using the well-annotated Arabidopsis thaliana genome and its known 924 pseudogenes as a test data set. PseudoPipe and Shiu’s pipeline identified ~80% of A. thaliana pseudogenes, of which 94% were shared, while PSF failed to generate adequate results. A need for improvement of the bioinformatics tools for pseudogene prediction accuracy in plant genomes was thus identified, with the ultimate goal of improving the quality of genome annotation in plants.
topic pseudogenes
processed
duplicated
bioinformatics tools
plants
genome-wide
url http://www.mdpi.com/1422-0067/17/12/1991
work_keys_str_mv AT jinxiao pseudogenesandtheirgenomewidepredictioninplants
AT manojkumarsekhwal pseudogenesandtheirgenomewidepredictioninplants
AT pingchuanli pseudogenesandtheirgenomewidepredictioninplants
AT rajaragupathy pseudogenesandtheirgenomewidepredictioninplants
AT sylviecloutier pseudogenesandtheirgenomewidepredictioninplants
AT xiuewang pseudogenesandtheirgenomewidepredictioninplants
AT frankmyou pseudogenesandtheirgenomewidepredictioninplants
_version_ 1725804477980606464