Recent advances in the detection of repeat expansions with short-read next-generation sequencing [version 1; referees: 3 approved]

Short tandem repeats (STRs), also known as microsatellites, are commonly defined as consisting of tandemly repeated nucleotide motifs of 2–6 base pairs in length. STRs appear throughout the human genome, and about 239,000 are documented in the Simple Repeats Track available from the UCSC (University...

Full description

Bibliographic Details
Main Authors: Melanie Bahlo, Mark F Bennett, Peter Degorski, Rick M Tankard, Martin B Delatycki, Paul J Lockhart
Format: Article
Language:English
Published: F1000 Research Ltd 2018-06-01
Series:F1000Research
Online Access:https://f1000research.com/articles/7-736/v1
id doaj-c1be5a076927498ba4d8b3464972cfdc
record_format Article
spelling doaj-c1be5a076927498ba4d8b3464972cfdc2020-11-25T03:49:14ZengF1000 Research LtdF1000Research2046-14022018-06-01710.12688/f1000research.13980.115195Recent advances in the detection of repeat expansions with short-read next-generation sequencing [version 1; referees: 3 approved]Melanie Bahlo0Mark F Bennett1Peter Degorski2Rick M Tankard3Martin B Delatycki4Paul J Lockhart5Population Health and Immunity Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, AustraliaPopulation Health and Immunity Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, AustraliaPopulation Health and Immunity Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, AustraliaMathematics and Statistics, Murdoch University, Murdoch, AustraliaBruce Lefroy Centre for Genetic Health Research, Murdoch Children’s Research Institute, Royal Children’s Hospital, Parkville, Victoria, AustraliaBruce Lefroy Centre for Genetic Health Research, Murdoch Children’s Research Institute, Royal Children’s Hospital, Parkville, Victoria, AustraliaShort tandem repeats (STRs), also known as microsatellites, are commonly defined as consisting of tandemly repeated nucleotide motifs of 2–6 base pairs in length. STRs appear throughout the human genome, and about 239,000 are documented in the Simple Repeats Track available from the UCSC (University of California, Santa Cruz) genome browser. STRs vary in size, producing highly polymorphic markers commonly used as genetic markers. A small fraction of STRs (about 30 loci) have been associated with human disease whereby one or both alleles exceed an STR-specific threshold in size, leading to disease. Detection of repeat expansions is currently performed with polymerase chain reaction–based assays or with Southern blots for large expansions. The tests are expensive and time-consuming and are not always conclusive, leading to lengthy diagnostic journeys for patients, potentially including missed diagnoses. The advent of whole exome and whole genome sequencing has identified the genetic cause of many genetic disorders; however, analysis pipelines are focused primarily on the detection of short nucleotide variations and short insertions and deletions (indels). Until recently, repeat expansions, with the exception of the smallest expansion (SCA6), were not detectable in next-generation short-read sequencing datasets and would have been ignored in most analyses. In the last two years, four analysis methods with accompanying software (ExpansionHunter, exSTRa, STRetch, and TREDPARSE) have been released. Although a comprehensive comparative analysis of the performance of these methods across all known repeat expansions is still lacking, it is clear that these methods are a valuable addition to any existing analysis pipeline. Here, we detail how to assess short-read data for evidence of expansions, reviewing all four methods and outlining their strengths and weaknesses. Implementation of these methods should lead to increased diagnostic yield of repeat expansion disorders for known STR loci and has the potential to detect novel repeat expansions.https://f1000research.com/articles/7-736/v1
collection DOAJ
language English
format Article
sources DOAJ
author Melanie Bahlo
Mark F Bennett
Peter Degorski
Rick M Tankard
Martin B Delatycki
Paul J Lockhart
spellingShingle Melanie Bahlo
Mark F Bennett
Peter Degorski
Rick M Tankard
Martin B Delatycki
Paul J Lockhart
Recent advances in the detection of repeat expansions with short-read next-generation sequencing [version 1; referees: 3 approved]
F1000Research
author_facet Melanie Bahlo
Mark F Bennett
Peter Degorski
Rick M Tankard
Martin B Delatycki
Paul J Lockhart
author_sort Melanie Bahlo
title Recent advances in the detection of repeat expansions with short-read next-generation sequencing [version 1; referees: 3 approved]
title_short Recent advances in the detection of repeat expansions with short-read next-generation sequencing [version 1; referees: 3 approved]
title_full Recent advances in the detection of repeat expansions with short-read next-generation sequencing [version 1; referees: 3 approved]
title_fullStr Recent advances in the detection of repeat expansions with short-read next-generation sequencing [version 1; referees: 3 approved]
title_full_unstemmed Recent advances in the detection of repeat expansions with short-read next-generation sequencing [version 1; referees: 3 approved]
title_sort recent advances in the detection of repeat expansions with short-read next-generation sequencing [version 1; referees: 3 approved]
publisher F1000 Research Ltd
series F1000Research
issn 2046-1402
publishDate 2018-06-01
description Short tandem repeats (STRs), also known as microsatellites, are commonly defined as consisting of tandemly repeated nucleotide motifs of 2–6 base pairs in length. STRs appear throughout the human genome, and about 239,000 are documented in the Simple Repeats Track available from the UCSC (University of California, Santa Cruz) genome browser. STRs vary in size, producing highly polymorphic markers commonly used as genetic markers. A small fraction of STRs (about 30 loci) have been associated with human disease whereby one or both alleles exceed an STR-specific threshold in size, leading to disease. Detection of repeat expansions is currently performed with polymerase chain reaction–based assays or with Southern blots for large expansions. The tests are expensive and time-consuming and are not always conclusive, leading to lengthy diagnostic journeys for patients, potentially including missed diagnoses. The advent of whole exome and whole genome sequencing has identified the genetic cause of many genetic disorders; however, analysis pipelines are focused primarily on the detection of short nucleotide variations and short insertions and deletions (indels). Until recently, repeat expansions, with the exception of the smallest expansion (SCA6), were not detectable in next-generation short-read sequencing datasets and would have been ignored in most analyses. In the last two years, four analysis methods with accompanying software (ExpansionHunter, exSTRa, STRetch, and TREDPARSE) have been released. Although a comprehensive comparative analysis of the performance of these methods across all known repeat expansions is still lacking, it is clear that these methods are a valuable addition to any existing analysis pipeline. Here, we detail how to assess short-read data for evidence of expansions, reviewing all four methods and outlining their strengths and weaknesses. Implementation of these methods should lead to increased diagnostic yield of repeat expansion disorders for known STR loci and has the potential to detect novel repeat expansions.
url https://f1000research.com/articles/7-736/v1
work_keys_str_mv AT melaniebahlo recentadvancesinthedetectionofrepeatexpansionswithshortreadnextgenerationsequencingversion1referees3approved
AT markfbennett recentadvancesinthedetectionofrepeatexpansionswithshortreadnextgenerationsequencingversion1referees3approved
AT peterdegorski recentadvancesinthedetectionofrepeatexpansionswithshortreadnextgenerationsequencingversion1referees3approved
AT rickmtankard recentadvancesinthedetectionofrepeatexpansionswithshortreadnextgenerationsequencingversion1referees3approved
AT martinbdelatycki recentadvancesinthedetectionofrepeatexpansionswithshortreadnextgenerationsequencingversion1referees3approved
AT pauljlockhart recentadvancesinthedetectionofrepeatexpansionswithshortreadnextgenerationsequencingversion1referees3approved
_version_ 1724496573772070912