SigProfilerMatrixGenerator: a tool for visualizing and exploring patterns of small mutational events

Abstract Background Cancer genomes are peppered with somatic mutations imprinted by different mutational processes. The mutational pattern of a cancer genome can be used to identify and understand the etiology of the underlying mutational processes. A plethora of prior research has focused on examin...

Full description

Bibliographic Details
Main Authors: Erik N. Bergstrom, Mi Ni Huang, Uma Mahto, Mark Barnes, Michael R. Stratton, Steven G. Rozen, Ludmil B. Alexandrov
Format: Article
Language:English
Published: BMC 2019-08-01
Series:BMC Genomics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12864-019-6041-2
id doaj-15d692f5dc384868a0328f08ea0f4215
record_format Article
spelling doaj-15d692f5dc384868a0328f08ea0f42152020-11-25T03:48:49ZengBMCBMC Genomics1471-21642019-08-0120111210.1186/s12864-019-6041-2SigProfilerMatrixGenerator: a tool for visualizing and exploring patterns of small mutational eventsErik N. Bergstrom0Mi Ni Huang1Uma Mahto2Mark Barnes3Michael R. Stratton4Steven G. Rozen5Ludmil B. Alexandrov6Department of Cellular and Molecular Medicine and Department of Bioengineering and Moores Cancer Center, University of CaliforniaCentre for Computational Biology and Programme in Cancer & Stem Cell Biology, Duke-NUS Medical SchoolDepartment of Cellular and Molecular Medicine and Department of Bioengineering and Moores Cancer Center, University of CaliforniaDepartment of Cellular and Molecular Medicine and Department of Bioengineering and Moores Cancer Center, University of CaliforniaCancer, Ageing and Somatic Mutation, Wellcome Trust Sanger InstituteCentre for Computational Biology and Programme in Cancer & Stem Cell Biology, Duke-NUS Medical SchoolDepartment of Cellular and Molecular Medicine and Department of Bioengineering and Moores Cancer Center, University of CaliforniaAbstract Background Cancer genomes are peppered with somatic mutations imprinted by different mutational processes. The mutational pattern of a cancer genome can be used to identify and understand the etiology of the underlying mutational processes. A plethora of prior research has focused on examining mutational signatures and mutational patterns from single base substitutions and their immediate sequencing context. We recently demonstrated that further classification of small mutational events (including substitutions, insertions, deletions, and doublet substitutions) can be used to provide a deeper understanding of the mutational processes that have molded a cancer genome. However, there has been no standard tool that allows fast, accurate, and comprehensive classification for all types of small mutational events. Results Here, we present SigProfilerMatrixGenerator, a computational tool designed for optimized exploration and visualization of mutational patterns for all types of small mutational events. SigProfilerMatrixGenerator is written in Python with an R wrapper package provided for users that prefer working in an R environment. SigProfilerMatrixGenerator produces fourteen distinct matrices by considering transcriptional strand bias of individual events and by incorporating distinct classifications for single base substitutions, doublet base substitutions, and small insertions and deletions. While the tool provides a comprehensive classification of mutations, SigProfilerMatrixGenerator is also faster and more memory efficient than existing tools that generate only a single matrix. Conclusions SigProfilerMatrixGenerator provides a standardized method for classifying small mutational events that is both efficient and scalable to large datasets. In addition to extending the classification of single base substitutions, the tool is the first to provide support for classifying doublet base substitutions and small insertions and deletions. SigProfilerMatrixGenerator is freely available at https://github.com/AlexandrovLab/SigProfilerMatrixGenerator with an extensive documentation at https://osf.io/s93d5/wiki/home/.http://link.springer.com/article/10.1186/s12864-019-6041-2Somatic mutationsMutational patternsMutational signatures
collection DOAJ
language English
format Article
sources DOAJ
author Erik N. Bergstrom
Mi Ni Huang
Uma Mahto
Mark Barnes
Michael R. Stratton
Steven G. Rozen
Ludmil B. Alexandrov
spellingShingle Erik N. Bergstrom
Mi Ni Huang
Uma Mahto
Mark Barnes
Michael R. Stratton
Steven G. Rozen
Ludmil B. Alexandrov
SigProfilerMatrixGenerator: a tool for visualizing and exploring patterns of small mutational events
BMC Genomics
Somatic mutations
Mutational patterns
Mutational signatures
author_facet Erik N. Bergstrom
Mi Ni Huang
Uma Mahto
Mark Barnes
Michael R. Stratton
Steven G. Rozen
Ludmil B. Alexandrov
author_sort Erik N. Bergstrom
title SigProfilerMatrixGenerator: a tool for visualizing and exploring patterns of small mutational events
title_short SigProfilerMatrixGenerator: a tool for visualizing and exploring patterns of small mutational events
title_full SigProfilerMatrixGenerator: a tool for visualizing and exploring patterns of small mutational events
title_fullStr SigProfilerMatrixGenerator: a tool for visualizing and exploring patterns of small mutational events
title_full_unstemmed SigProfilerMatrixGenerator: a tool for visualizing and exploring patterns of small mutational events
title_sort sigprofilermatrixgenerator: a tool for visualizing and exploring patterns of small mutational events
publisher BMC
series BMC Genomics
issn 1471-2164
publishDate 2019-08-01
description Abstract Background Cancer genomes are peppered with somatic mutations imprinted by different mutational processes. The mutational pattern of a cancer genome can be used to identify and understand the etiology of the underlying mutational processes. A plethora of prior research has focused on examining mutational signatures and mutational patterns from single base substitutions and their immediate sequencing context. We recently demonstrated that further classification of small mutational events (including substitutions, insertions, deletions, and doublet substitutions) can be used to provide a deeper understanding of the mutational processes that have molded a cancer genome. However, there has been no standard tool that allows fast, accurate, and comprehensive classification for all types of small mutational events. Results Here, we present SigProfilerMatrixGenerator, a computational tool designed for optimized exploration and visualization of mutational patterns for all types of small mutational events. SigProfilerMatrixGenerator is written in Python with an R wrapper package provided for users that prefer working in an R environment. SigProfilerMatrixGenerator produces fourteen distinct matrices by considering transcriptional strand bias of individual events and by incorporating distinct classifications for single base substitutions, doublet base substitutions, and small insertions and deletions. While the tool provides a comprehensive classification of mutations, SigProfilerMatrixGenerator is also faster and more memory efficient than existing tools that generate only a single matrix. Conclusions SigProfilerMatrixGenerator provides a standardized method for classifying small mutational events that is both efficient and scalable to large datasets. In addition to extending the classification of single base substitutions, the tool is the first to provide support for classifying doublet base substitutions and small insertions and deletions. SigProfilerMatrixGenerator is freely available at https://github.com/AlexandrovLab/SigProfilerMatrixGenerator with an extensive documentation at https://osf.io/s93d5/wiki/home/.
topic Somatic mutations
Mutational patterns
Mutational signatures
url http://link.springer.com/article/10.1186/s12864-019-6041-2
work_keys_str_mv AT eriknbergstrom sigprofilermatrixgeneratoratoolforvisualizingandexploringpatternsofsmallmutationalevents
AT minihuang sigprofilermatrixgeneratoratoolforvisualizingandexploringpatternsofsmallmutationalevents
AT umamahto sigprofilermatrixgeneratoratoolforvisualizingandexploringpatternsofsmallmutationalevents
AT markbarnes sigprofilermatrixgeneratoratoolforvisualizingandexploringpatternsofsmallmutationalevents
AT michaelrstratton sigprofilermatrixgeneratoratoolforvisualizingandexploringpatternsofsmallmutationalevents
AT stevengrozen sigprofilermatrixgeneratoratoolforvisualizingandexploringpatternsofsmallmutationalevents
AT ludmilbalexandrov sigprofilermatrixgeneratoratoolforvisualizingandexploringpatternsofsmallmutationalevents
_version_ 1724496942713536512