S-plot2: Rapid Visual and Statistical Analysis of Genomic Sequences

With the daily release of data from whole genome sequencing projects, tools to facilitate comparative studies are hard-pressed to keep pace. Graphical software solutions can readily recognize synteny by measuring similarities between sequences. Nevertheless, regions of dissimilarity can prove to be...

Full description

Bibliographic Details
Main Authors: Laurynas Kalesinskas, Evan Cudone, Yuriy Fofanov, Catherine Putonti
Format: Article
Language:English
Published: SAGE Publishing 2018-09-01
Series:Evolutionary Bioinformatics
Online Access:https://doi.org/10.1177/1176934318797354
id doaj-98a95f54248e43f9b807e7c6b75b81db
record_format Article
spelling doaj-98a95f54248e43f9b807e7c6b75b81db2020-11-25T02:48:18ZengSAGE PublishingEvolutionary Bioinformatics1176-93432018-09-011410.1177/1176934318797354S-plot2: Rapid Visual and Statistical Analysis of Genomic SequencesLaurynas Kalesinskas0Evan Cudone1Yuriy Fofanov2Catherine Putonti3Department of Biology, Loyola University Chicago, Chicago, IL, USADepartment of Mathematics and Statistics, Loyola University Chicago, Chicago, IL, USADepartment of Pharmacology and Toxicology, The University of Texas Medical Branch at Galveston, Galveston, TX, USADepartment of Computer Science, Loyola University Chicago, Chicago, IL, USAWith the daily release of data from whole genome sequencing projects, tools to facilitate comparative studies are hard-pressed to keep pace. Graphical software solutions can readily recognize synteny by measuring similarities between sequences. Nevertheless, regions of dissimilarity can prove to be equally informative; these regions may harbor genes acquired via lateral gene transfer (LGT), signify gene loss or gain, or include coding regions under strong selection. Previously, we developed the software S-plot. This tool employed an alignment-free approach for comparing bacterial genomes and generated a heatmap representing the genomes’ similarities and dissimilarities in nucleotide usage. In prior studies, this tool proved valuable in identifying genome rearrangements as well as exogenous sequences acquired via LGT in several bacterial species. Herein, we present the next generation of this tool, S-plot2. Similar to its predecessor, S-plot2 creates an interactive, 2-dimensional heatmap capturing the similarities and dissimilarities in nucleotide usage between genomic sequences (partial or complete). This new version, however, includes additional metrics for analysis, new reporting options, and integrated BLAST query functionality for the user to interrogate regions of interest. Furthermore, S-plot2 can evaluate larger sequences, including whole eukaryotic chromosomes. To illustrate some of the applications of the tool, 2 case studies are presented. The first examines strain-specific variation across the Pseudomonas aeruginosa genome and strain-specific LGT events. In the second case study, corresponding human, chimpanzee, and rhesus macaque autosomes were studied and lineage specific contributions to divergence were estimated. S-plot2 provides a means to both visually and quantitatively compare nucleotide sequences, from microbial genomes to eukaryotic chromosomes. The case studies presented illustrate just 2 potential applications of the tool, highlighting its capability to identify and investigate the variation in molecular divergence rates across sequences. S-plot2 is freely available through https://bitbucket.org/lkalesinskas/splot and is supported on the Linux and MS Windows operating systems.https://doi.org/10.1177/1176934318797354
collection DOAJ
language English
format Article
sources DOAJ
author Laurynas Kalesinskas
Evan Cudone
Yuriy Fofanov
Catherine Putonti
spellingShingle Laurynas Kalesinskas
Evan Cudone
Yuriy Fofanov
Catherine Putonti
S-plot2: Rapid Visual and Statistical Analysis of Genomic Sequences
Evolutionary Bioinformatics
author_facet Laurynas Kalesinskas
Evan Cudone
Yuriy Fofanov
Catherine Putonti
author_sort Laurynas Kalesinskas
title S-plot2: Rapid Visual and Statistical Analysis of Genomic Sequences
title_short S-plot2: Rapid Visual and Statistical Analysis of Genomic Sequences
title_full S-plot2: Rapid Visual and Statistical Analysis of Genomic Sequences
title_fullStr S-plot2: Rapid Visual and Statistical Analysis of Genomic Sequences
title_full_unstemmed S-plot2: Rapid Visual and Statistical Analysis of Genomic Sequences
title_sort s-plot2: rapid visual and statistical analysis of genomic sequences
publisher SAGE Publishing
series Evolutionary Bioinformatics
issn 1176-9343
publishDate 2018-09-01
description With the daily release of data from whole genome sequencing projects, tools to facilitate comparative studies are hard-pressed to keep pace. Graphical software solutions can readily recognize synteny by measuring similarities between sequences. Nevertheless, regions of dissimilarity can prove to be equally informative; these regions may harbor genes acquired via lateral gene transfer (LGT), signify gene loss or gain, or include coding regions under strong selection. Previously, we developed the software S-plot. This tool employed an alignment-free approach for comparing bacterial genomes and generated a heatmap representing the genomes’ similarities and dissimilarities in nucleotide usage. In prior studies, this tool proved valuable in identifying genome rearrangements as well as exogenous sequences acquired via LGT in several bacterial species. Herein, we present the next generation of this tool, S-plot2. Similar to its predecessor, S-plot2 creates an interactive, 2-dimensional heatmap capturing the similarities and dissimilarities in nucleotide usage between genomic sequences (partial or complete). This new version, however, includes additional metrics for analysis, new reporting options, and integrated BLAST query functionality for the user to interrogate regions of interest. Furthermore, S-plot2 can evaluate larger sequences, including whole eukaryotic chromosomes. To illustrate some of the applications of the tool, 2 case studies are presented. The first examines strain-specific variation across the Pseudomonas aeruginosa genome and strain-specific LGT events. In the second case study, corresponding human, chimpanzee, and rhesus macaque autosomes were studied and lineage specific contributions to divergence were estimated. S-plot2 provides a means to both visually and quantitatively compare nucleotide sequences, from microbial genomes to eukaryotic chromosomes. The case studies presented illustrate just 2 potential applications of the tool, highlighting its capability to identify and investigate the variation in molecular divergence rates across sequences. S-plot2 is freely available through https://bitbucket.org/lkalesinskas/splot and is supported on the Linux and MS Windows operating systems.
url https://doi.org/10.1177/1176934318797354
work_keys_str_mv AT laurynaskalesinskas splot2rapidvisualandstatisticalanalysisofgenomicsequences
AT evancudone splot2rapidvisualandstatisticalanalysisofgenomicsequences
AT yuriyfofanov splot2rapidvisualandstatisticalanalysisofgenomicsequences
AT catherineputonti splot2rapidvisualandstatisticalanalysisofgenomicsequences
_version_ 1724748618954440704