High quality maize centromere 10 sequence reveals evidence of frequent recombination events

The ancestral centromeres of maize contain long stretches of the tandemly arranged CentC repeat. The abundance of tandem DNA repeats and centromeric retrotransposons (CR) have presented a significant challenge to completely assembling centromeres using traditional sequencing methods. Here we report...

Full description

Bibliographic Details
Main Authors: Thomas Kai Wolfgruber, Megan M Nakashima, Kevin L Schneider, Anupma eSharma, Zidian eXie, Patrice S Albert, Ronghui eXu, Paul eBilinski, R. Kelly Dawe, Jeffrey eRoss-Ibarra, James A. Birchler, Gernot ePresting
Format: Article
Language:English
Published: Frontiers Media S.A. 2016-03-01
Series:Frontiers in Plant Science
Subjects:
Online Access:http://journal.frontiersin.org/Journal/10.3389/fpls.2016.00308/full
Description
Summary:The ancestral centromeres of maize contain long stretches of the tandemly arranged CentC repeat. The abundance of tandem DNA repeats and centromeric retrotransposons (CR) have presented a significant challenge to completely assembling centromeres using traditional sequencing methods. Here we report a nearly complete assembly of the 1.85 Mb maize centromere 10 from inbred B73 using PacBio technology and BACs from the reference genome project. The error rates estimated from overlapping BAC sequences are 7 x 10-6 and 5 x 10-5 for mismatches and indels, respectively. The number of gaps in the region covered by the reassembly was reduced from 140 in the reference genome to three. Three expressed genes are located between 92 and 477 kb of the inferred ancestral CentC cluster, which lies within the region of highest centromeric repeat density. The improved assembly increased the count of full-length centromeric retrotransposons from 5 to 55 and revealed a 22.7 kb segmental duplication that occurred approximately 121,000 years ago. Our analysis provides evidence of frequent recombination events in the form of partial retrotransposons, deletions within retrotransposons, chimeric retrotransposons, segmental duplications including higher order CentC repeats, a deleted CentC monomer, centromere-proximal inversions, and insertion of mitochondrial sequences. Double-strand DNA break (DSB) repair is the most plausible mechanism for these events and may be the major driver of centromere repeat evolution and diversity. This repair appears to be mediated by microhomology, suggesting that tandem repeats may have evolved to facilitate the repair of frequent DSBs in centromeres.
ISSN:1664-462X