Pipeline for the Rapid Development of Cytogenetic Markers Using Genomic Data of Related Species

Repetitive DNA including tandem repeats (TRs) is a significant part of most eukaryotic genomes. TRs include rapidly evolving satellite DNA (satDNA) that can be shared by closely related species, their abundance may be associated with evolutionary divergence, and they have been widely used for chromo...

Full description

Bibliographic Details
Main Authors: Pavel Kroupin, Victoria Kuznetsova, Dmitry Romanov, Alina Kocheshkova, Gennady Karlov, Thi Xuan Dang, Thi Mai L. Khuat, Ilya Kirov, Oleg Alexandrov, Alexander Polkhovskiy, Olga Razumova, Mikhail Divashuk
Format: Article
Language:English
Published: MDPI AG 2019-02-01
Series:Genes
Subjects:
rye
Online Access:https://www.mdpi.com/2073-4425/10/2/113
id doaj-2a40879ef54e4eedacbd5fde9b9504d1
record_format Article
collection DOAJ
language English
format Article
sources DOAJ
author Pavel Kroupin
Victoria Kuznetsova
Dmitry Romanov
Alina Kocheshkova
Gennady Karlov
Thi Xuan Dang
Thi Mai L. Khuat
Ilya Kirov
Oleg Alexandrov
Alexander Polkhovskiy
Olga Razumova
Mikhail Divashuk
spellingShingle Pavel Kroupin
Victoria Kuznetsova
Dmitry Romanov
Alina Kocheshkova
Gennady Karlov
Thi Xuan Dang
Thi Mai L. Khuat
Ilya Kirov
Oleg Alexandrov
Alexander Polkhovskiy
Olga Razumova
Mikhail Divashuk
Pipeline for the Rapid Development of Cytogenetic Markers Using Genomic Data of Related Species
Genes
DNA repeats
satellite DNA
tandem repeats
bioinformatics search
whole-genome sequencing
cytogenetic markers
fluorescence in situ hybridization
wheat
rye
triticale
author_facet Pavel Kroupin
Victoria Kuznetsova
Dmitry Romanov
Alina Kocheshkova
Gennady Karlov
Thi Xuan Dang
Thi Mai L. Khuat
Ilya Kirov
Oleg Alexandrov
Alexander Polkhovskiy
Olga Razumova
Mikhail Divashuk
author_sort Pavel Kroupin
title Pipeline for the Rapid Development of Cytogenetic Markers Using Genomic Data of Related Species
title_short Pipeline for the Rapid Development of Cytogenetic Markers Using Genomic Data of Related Species
title_full Pipeline for the Rapid Development of Cytogenetic Markers Using Genomic Data of Related Species
title_fullStr Pipeline for the Rapid Development of Cytogenetic Markers Using Genomic Data of Related Species
title_full_unstemmed Pipeline for the Rapid Development of Cytogenetic Markers Using Genomic Data of Related Species
title_sort pipeline for the rapid development of cytogenetic markers using genomic data of related species
publisher MDPI AG
series Genes
issn 2073-4425
publishDate 2019-02-01
description Repetitive DNA including tandem repeats (TRs) is a significant part of most eukaryotic genomes. TRs include rapidly evolving satellite DNA (satDNA) that can be shared by closely related species, their abundance may be associated with evolutionary divergence, and they have been widely used for chromosome karyotyping using fluorescence in situ hybridization (FISH). The recent progress in the development of whole-genome sequencing and bioinformatics tools enables rapid and cost-effective searches for TRs including satDNA that can be converted into molecular cytogenetic markers. In the case of closely related taxa, the genome sequence of one species (<i>donor</i>) can be used as a base for the development of chromosome markers for related species or genomes (<i>target</i>). Here, we present a pipeline for rapid and high-throughput screening for new satDNA TRs in whole-genome sequencing of the <i>donor</i> genome and the development of chromosome markers based on them that can be applied in the <i>target</i> genome. One of the main peculiarities of the developed pipeline is that preliminary estimation of TR abundance using qPCR and ranking found TRs according to their copy number in the <i>target</i> genome; it facilitates the selection of the most prospective (most abundant) TRs that can be converted into cytogenetic markers. Another feature of our pipeline is the probe preparation for FISH using PCR with primers designed on the aligned TR unit sequences and the genomic DNA of a <i>target</i> species as a template that enables amplification of a whole pool of monomers inherent in the chromosomes of the <i>target</i> species. We demonstrate the efficiency of the developed pipeline by the example of FISH probes developed for A, B, and R subgenome chromosomes of hexaploid triticale (BBAARR) based on a bioinformatics analysis of the D genome of <i>Aegilops tauschii</i> (DD) whole-genome sequence. Our pipeline can be used to develop chromosome markers in closely related species for comparative cytogenetics in evolutionary and breeding studies.
topic DNA repeats
satellite DNA
tandem repeats
bioinformatics search
whole-genome sequencing
cytogenetic markers
fluorescence in situ hybridization
wheat
rye
triticale
url https://www.mdpi.com/2073-4425/10/2/113
work_keys_str_mv AT pavelkroupin pipelinefortherapiddevelopmentofcytogeneticmarkersusinggenomicdataofrelatedspecies
AT victoriakuznetsova pipelinefortherapiddevelopmentofcytogeneticmarkersusinggenomicdataofrelatedspecies
AT dmitryromanov pipelinefortherapiddevelopmentofcytogeneticmarkersusinggenomicdataofrelatedspecies
AT alinakocheshkova pipelinefortherapiddevelopmentofcytogeneticmarkersusinggenomicdataofrelatedspecies
AT gennadykarlov pipelinefortherapiddevelopmentofcytogeneticmarkersusinggenomicdataofrelatedspecies
AT thixuandang pipelinefortherapiddevelopmentofcytogeneticmarkersusinggenomicdataofrelatedspecies
AT thimailkhuat pipelinefortherapiddevelopmentofcytogeneticmarkersusinggenomicdataofrelatedspecies
AT ilyakirov pipelinefortherapiddevelopmentofcytogeneticmarkersusinggenomicdataofrelatedspecies
AT olegalexandrov pipelinefortherapiddevelopmentofcytogeneticmarkersusinggenomicdataofrelatedspecies
AT alexanderpolkhovskiy pipelinefortherapiddevelopmentofcytogeneticmarkersusinggenomicdataofrelatedspecies
AT olgarazumova pipelinefortherapiddevelopmentofcytogeneticmarkersusinggenomicdataofrelatedspecies
AT mikhaildivashuk pipelinefortherapiddevelopmentofcytogeneticmarkersusinggenomicdataofrelatedspecies
_version_ 1725332816933158912
spelling doaj-2a40879ef54e4eedacbd5fde9b9504d12020-11-25T00:29:10ZengMDPI AGGenes2073-44252019-02-0110211310.3390/genes10020113genes10020113Pipeline for the Rapid Development of Cytogenetic Markers Using Genomic Data of Related SpeciesPavel Kroupin0Victoria Kuznetsova1Dmitry Romanov2Alina Kocheshkova3Gennady Karlov4Thi Xuan Dang5Thi Mai L. Khuat6Ilya Kirov7Oleg Alexandrov8Alexander Polkhovskiy9Olga Razumova10Mikhail Divashuk11Laboratory of Applied Genomics and Crop Breeding, All-Russia Research Institute of Agricultural Biotechnology, Timiryazevskaya str. 42, Moscow 127550, RussiaLaboratory of Applied Genomics and Crop Breeding, All-Russia Research Institute of Agricultural Biotechnology, Timiryazevskaya str. 42, Moscow 127550, RussiaLaboratory of Applied Genomics and Crop Breeding, All-Russia Research Institute of Agricultural Biotechnology, Timiryazevskaya str. 42, Moscow 127550, RussiaCenter of Molecular Biotechnology, Russian State Agrarian University-Moscow Timiryazev Agricultural Academy, Timiryazevskaya str. 49, Moscow 127550, RussiaLaboratory of Applied Genomics and Crop Breeding, All-Russia Research Institute of Agricultural Biotechnology, Timiryazevskaya str. 42, Moscow 127550, RussiaCenter of Molecular Biotechnology, Russian State Agrarian University-Moscow Timiryazev Agricultural Academy, Timiryazevskaya str. 49, Moscow 127550, RussiaCenter of Molecular Biotechnology, Russian State Agrarian University-Moscow Timiryazev Agricultural Academy, Timiryazevskaya str. 49, Moscow 127550, RussiaLaboratory of Marker-Assisted and Genomic Selection of Plants, All-Russia Research Institute of Agricultural Biotechnology, Timiryazevskaya str. 42, Moscow 127550, RussiaLaboratory of Plant Cell Engineering, All-Russia Research Institute of Agricultural Biotechnology, Timiryazevskaya str. 42, Moscow 127550, RussiaCenter of Molecular Biotechnology, Russian State Agrarian University-Moscow Timiryazev Agricultural Academy, Timiryazevskaya str. 49, Moscow 127550, RussiaLaboratory of Applied Genomics and Crop Breeding, All-Russia Research Institute of Agricultural Biotechnology, Timiryazevskaya str. 42, Moscow 127550, RussiaLaboratory of Applied Genomics and Crop Breeding, All-Russia Research Institute of Agricultural Biotechnology, Timiryazevskaya str. 42, Moscow 127550, RussiaRepetitive DNA including tandem repeats (TRs) is a significant part of most eukaryotic genomes. TRs include rapidly evolving satellite DNA (satDNA) that can be shared by closely related species, their abundance may be associated with evolutionary divergence, and they have been widely used for chromosome karyotyping using fluorescence in situ hybridization (FISH). The recent progress in the development of whole-genome sequencing and bioinformatics tools enables rapid and cost-effective searches for TRs including satDNA that can be converted into molecular cytogenetic markers. In the case of closely related taxa, the genome sequence of one species (<i>donor</i>) can be used as a base for the development of chromosome markers for related species or genomes (<i>target</i>). Here, we present a pipeline for rapid and high-throughput screening for new satDNA TRs in whole-genome sequencing of the <i>donor</i> genome and the development of chromosome markers based on them that can be applied in the <i>target</i> genome. One of the main peculiarities of the developed pipeline is that preliminary estimation of TR abundance using qPCR and ranking found TRs according to their copy number in the <i>target</i> genome; it facilitates the selection of the most prospective (most abundant) TRs that can be converted into cytogenetic markers. Another feature of our pipeline is the probe preparation for FISH using PCR with primers designed on the aligned TR unit sequences and the genomic DNA of a <i>target</i> species as a template that enables amplification of a whole pool of monomers inherent in the chromosomes of the <i>target</i> species. We demonstrate the efficiency of the developed pipeline by the example of FISH probes developed for A, B, and R subgenome chromosomes of hexaploid triticale (BBAARR) based on a bioinformatics analysis of the D genome of <i>Aegilops tauschii</i> (DD) whole-genome sequence. Our pipeline can be used to develop chromosome markers in closely related species for comparative cytogenetics in evolutionary and breeding studies.https://www.mdpi.com/2073-4425/10/2/113DNA repeatssatellite DNAtandem repeatsbioinformatics searchwhole-genome sequencingcytogenetic markersfluorescence in situ hybridizationwheatryetriticale