A general pipeline for the development of anchor markers for comparative genomics in plants

<p>Abstract</p> <p>Background</p> <p>Complete or near-complete genomic sequence information is presently only available for a few plant species representing a large phylogenetic diversity among plants. In order to effectively transfer this information to species lacking...

Full description

Bibliographic Details
Main Authors: Stougaard Jens, Sandal Niels, Bertioli David, Nielsen Anna, Hougaard Birgit K, Madsen Lene H, Fredslund Jakob, Schauser Leif
Format: Article
Language:English
Published: BMC 2006-08-01
Series:BMC Genomics
Online Access:http://www.biomedcentral.com/1471-2164/7/207
id doaj-0d6b6cddbf7248d1b265519205732661
record_format Article
spelling doaj-0d6b6cddbf7248d1b2655192057326612020-11-25T00:37:40ZengBMCBMC Genomics1471-21642006-08-017120710.1186/1471-2164-7-207A general pipeline for the development of anchor markers for comparative genomics in plantsStougaard JensSandal NielsBertioli DavidNielsen AnnaHougaard Birgit KMadsen Lene HFredslund JakobSchauser Leif<p>Abstract</p> <p>Background</p> <p>Complete or near-complete genomic sequence information is presently only available for a few plant species representing a large phylogenetic diversity among plants. In order to effectively transfer this information to species lacking sequence information, comparative genomic tools need to be developed. Molecular markers permitting cross-species mapping along co-linear genomic regions are central to comparative genomics. These "anchor" markers, defining unique loci in genetic linkage maps of multiple species, are gene-based and possess a number of features that make them relatively sparse. To identify potential anchor marker sequences more efficiently, we have established an automated bioinformatic pipeline that combines multi-species Expressed Sequence Tags (EST) and genome sequence data.</p> <p>Results</p> <p>Taking advantage of sequence data from related species, the pipeline identifies evolutionarily conserved sequences that are likely to define unique orthologous loci in most species of the same phylogenetic clade. The key features are the identification of evolutionarily conserved sequences followed by automated design of intron-flanking Polymerase Chain Reaction (PCR) primer pairs. Polymorphisms can subsequently be identified by size- or sequence variation of PCR products, amplified from mapping parents or populations. We illustrate our procedure in legumes and grasses and exemplify its application in legumes, where model plant studies and the genome- and EST-sequence data available have a potential impact on the breeding of crop species and on our understanding of the evolution of this large and diverse family.</p> <p>Conclusion</p> <p>We provide a database of 459 candidate anchor loci which have the potential to serve as map anchors in more than 18,000 legume species, a number of which are of agricultural importance. For grasses, the database contains 1335 candidate anchor loci. Based on this database, we have evaluated 76 candidate anchor loci with respect to marker development in legume species with no sequence information available, demonstrating the validity of this approach.</p> http://www.biomedcentral.com/1471-2164/7/207
collection DOAJ
language English
format Article
sources DOAJ
author Stougaard Jens
Sandal Niels
Bertioli David
Nielsen Anna
Hougaard Birgit K
Madsen Lene H
Fredslund Jakob
Schauser Leif
spellingShingle Stougaard Jens
Sandal Niels
Bertioli David
Nielsen Anna
Hougaard Birgit K
Madsen Lene H
Fredslund Jakob
Schauser Leif
A general pipeline for the development of anchor markers for comparative genomics in plants
BMC Genomics
author_facet Stougaard Jens
Sandal Niels
Bertioli David
Nielsen Anna
Hougaard Birgit K
Madsen Lene H
Fredslund Jakob
Schauser Leif
author_sort Stougaard Jens
title A general pipeline for the development of anchor markers for comparative genomics in plants
title_short A general pipeline for the development of anchor markers for comparative genomics in plants
title_full A general pipeline for the development of anchor markers for comparative genomics in plants
title_fullStr A general pipeline for the development of anchor markers for comparative genomics in plants
title_full_unstemmed A general pipeline for the development of anchor markers for comparative genomics in plants
title_sort general pipeline for the development of anchor markers for comparative genomics in plants
publisher BMC
series BMC Genomics
issn 1471-2164
publishDate 2006-08-01
description <p>Abstract</p> <p>Background</p> <p>Complete or near-complete genomic sequence information is presently only available for a few plant species representing a large phylogenetic diversity among plants. In order to effectively transfer this information to species lacking sequence information, comparative genomic tools need to be developed. Molecular markers permitting cross-species mapping along co-linear genomic regions are central to comparative genomics. These "anchor" markers, defining unique loci in genetic linkage maps of multiple species, are gene-based and possess a number of features that make them relatively sparse. To identify potential anchor marker sequences more efficiently, we have established an automated bioinformatic pipeline that combines multi-species Expressed Sequence Tags (EST) and genome sequence data.</p> <p>Results</p> <p>Taking advantage of sequence data from related species, the pipeline identifies evolutionarily conserved sequences that are likely to define unique orthologous loci in most species of the same phylogenetic clade. The key features are the identification of evolutionarily conserved sequences followed by automated design of intron-flanking Polymerase Chain Reaction (PCR) primer pairs. Polymorphisms can subsequently be identified by size- or sequence variation of PCR products, amplified from mapping parents or populations. We illustrate our procedure in legumes and grasses and exemplify its application in legumes, where model plant studies and the genome- and EST-sequence data available have a potential impact on the breeding of crop species and on our understanding of the evolution of this large and diverse family.</p> <p>Conclusion</p> <p>We provide a database of 459 candidate anchor loci which have the potential to serve as map anchors in more than 18,000 legume species, a number of which are of agricultural importance. For grasses, the database contains 1335 candidate anchor loci. Based on this database, we have evaluated 76 candidate anchor loci with respect to marker development in legume species with no sequence information available, demonstrating the validity of this approach.</p>
url http://www.biomedcentral.com/1471-2164/7/207
work_keys_str_mv AT stougaardjens ageneralpipelineforthedevelopmentofanchormarkersforcomparativegenomicsinplants
AT sandalniels ageneralpipelineforthedevelopmentofanchormarkersforcomparativegenomicsinplants
AT bertiolidavid ageneralpipelineforthedevelopmentofanchormarkersforcomparativegenomicsinplants
AT nielsenanna ageneralpipelineforthedevelopmentofanchormarkersforcomparativegenomicsinplants
AT hougaardbirgitk ageneralpipelineforthedevelopmentofanchormarkersforcomparativegenomicsinplants
AT madsenleneh ageneralpipelineforthedevelopmentofanchormarkersforcomparativegenomicsinplants
AT fredslundjakob ageneralpipelineforthedevelopmentofanchormarkersforcomparativegenomicsinplants
AT schauserleif ageneralpipelineforthedevelopmentofanchormarkersforcomparativegenomicsinplants
AT stougaardjens generalpipelineforthedevelopmentofanchormarkersforcomparativegenomicsinplants
AT sandalniels generalpipelineforthedevelopmentofanchormarkersforcomparativegenomicsinplants
AT bertiolidavid generalpipelineforthedevelopmentofanchormarkersforcomparativegenomicsinplants
AT nielsenanna generalpipelineforthedevelopmentofanchormarkersforcomparativegenomicsinplants
AT hougaardbirgitk generalpipelineforthedevelopmentofanchormarkersforcomparativegenomicsinplants
AT madsenleneh generalpipelineforthedevelopmentofanchormarkersforcomparativegenomicsinplants
AT fredslundjakob generalpipelineforthedevelopmentofanchormarkersforcomparativegenomicsinplants
AT schauserleif generalpipelineforthedevelopmentofanchormarkersforcomparativegenomicsinplants
_version_ 1725300115821821952