Identification of prophages in bacterial genomes by dinucleotide relative abundance difference.

BACKGROUND: Prophages are integrated viral forms in bacterial genomes that have been found to contribute to interstrain genetic variability. Many virulence-associated genes are reported to be prophage encoded. Present computational methods to detect prophages are either by identifying possible essen...

Full description

Bibliographic Details
Main Authors: K V Srividhya, V Alaguraj, G Poornima, Dinesh Kumar, G P Singh, L Raghavenderan, A V S K Mohan Katta, Preeti Mehta, S Krishnaswamy
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2007-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC2075365?pdf=render
id doaj-13a4b711576c45968eb82e234d63f926
record_format Article
spelling doaj-13a4b711576c45968eb82e234d63f9262020-11-25T01:57:54ZengPublic Library of Science (PLoS)PLoS ONE1932-62032007-01-01211e119310.1371/journal.pone.0001193Identification of prophages in bacterial genomes by dinucleotide relative abundance difference.K V SrividhyaV AlagurajG PoornimaDinesh KumarG P SinghL RaghavenderanA V S K Mohan KattaPreeti MehtaS KrishnaswamyBACKGROUND: Prophages are integrated viral forms in bacterial genomes that have been found to contribute to interstrain genetic variability. Many virulence-associated genes are reported to be prophage encoded. Present computational methods to detect prophages are either by identifying possible essential proteins such as integrases or by an extension of this technique, which involves identifying a region containing proteins similar to those occurring in prophages. These methods suffer due to the problem of low sequence similarity at the protein level, which suggests that a nucleotide based approach could be useful. METHODOLOGY: Earlier dinucleotide relative abundance (DRA) have been used to identify regions, which deviate from the neighborhood areas, in genomes. We have used the difference in the dinucleotide relative abundance (DRAD) between the bacterial and prophage DNA to aid location of DNA stretches that could be of prophage origin in bacterial genomes. Prophage sequences which deviate from bacterial regions in their dinucleotide frequencies are detected by scanning bacterial genome sequences. The method was validated using a subset of genomes with prophage data from literature reports. A web interface for prophage scan based on this method is available at http://bicmku.in:8082/prophagedb/dra.html. Two hundred bacterial genomes which do not have annotated prophages have been scanned for prophage regions using this method. CONCLUSIONS: The relative dinucleotide distribution difference helps detect prophage regions in genome sequences. The usefulness of this method is seen in the identification of 461 highly probable loci pertaining to prophages which have not been annotated so earlier. This work emphasizes the need to extend the efforts to detect and annotate prophage elements in genome sequences.http://europepmc.org/articles/PMC2075365?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author K V Srividhya
V Alaguraj
G Poornima
Dinesh Kumar
G P Singh
L Raghavenderan
A V S K Mohan Katta
Preeti Mehta
S Krishnaswamy
spellingShingle K V Srividhya
V Alaguraj
G Poornima
Dinesh Kumar
G P Singh
L Raghavenderan
A V S K Mohan Katta
Preeti Mehta
S Krishnaswamy
Identification of prophages in bacterial genomes by dinucleotide relative abundance difference.
PLoS ONE
author_facet K V Srividhya
V Alaguraj
G Poornima
Dinesh Kumar
G P Singh
L Raghavenderan
A V S K Mohan Katta
Preeti Mehta
S Krishnaswamy
author_sort K V Srividhya
title Identification of prophages in bacterial genomes by dinucleotide relative abundance difference.
title_short Identification of prophages in bacterial genomes by dinucleotide relative abundance difference.
title_full Identification of prophages in bacterial genomes by dinucleotide relative abundance difference.
title_fullStr Identification of prophages in bacterial genomes by dinucleotide relative abundance difference.
title_full_unstemmed Identification of prophages in bacterial genomes by dinucleotide relative abundance difference.
title_sort identification of prophages in bacterial genomes by dinucleotide relative abundance difference.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2007-01-01
description BACKGROUND: Prophages are integrated viral forms in bacterial genomes that have been found to contribute to interstrain genetic variability. Many virulence-associated genes are reported to be prophage encoded. Present computational methods to detect prophages are either by identifying possible essential proteins such as integrases or by an extension of this technique, which involves identifying a region containing proteins similar to those occurring in prophages. These methods suffer due to the problem of low sequence similarity at the protein level, which suggests that a nucleotide based approach could be useful. METHODOLOGY: Earlier dinucleotide relative abundance (DRA) have been used to identify regions, which deviate from the neighborhood areas, in genomes. We have used the difference in the dinucleotide relative abundance (DRAD) between the bacterial and prophage DNA to aid location of DNA stretches that could be of prophage origin in bacterial genomes. Prophage sequences which deviate from bacterial regions in their dinucleotide frequencies are detected by scanning bacterial genome sequences. The method was validated using a subset of genomes with prophage data from literature reports. A web interface for prophage scan based on this method is available at http://bicmku.in:8082/prophagedb/dra.html. Two hundred bacterial genomes which do not have annotated prophages have been scanned for prophage regions using this method. CONCLUSIONS: The relative dinucleotide distribution difference helps detect prophage regions in genome sequences. The usefulness of this method is seen in the identification of 461 highly probable loci pertaining to prophages which have not been annotated so earlier. This work emphasizes the need to extend the efforts to detect and annotate prophage elements in genome sequences.
url http://europepmc.org/articles/PMC2075365?pdf=render
work_keys_str_mv AT kvsrividhya identificationofprophagesinbacterialgenomesbydinucleotiderelativeabundancedifference
AT valaguraj identificationofprophagesinbacterialgenomesbydinucleotiderelativeabundancedifference
AT gpoornima identificationofprophagesinbacterialgenomesbydinucleotiderelativeabundancedifference
AT dineshkumar identificationofprophagesinbacterialgenomesbydinucleotiderelativeabundancedifference
AT gpsingh identificationofprophagesinbacterialgenomesbydinucleotiderelativeabundancedifference
AT lraghavenderan identificationofprophagesinbacterialgenomesbydinucleotiderelativeabundancedifference
AT avskmohankatta identificationofprophagesinbacterialgenomesbydinucleotiderelativeabundancedifference
AT preetimehta identificationofprophagesinbacterialgenomesbydinucleotiderelativeabundancedifference
AT skrishnaswamy identificationofprophagesinbacterialgenomesbydinucleotiderelativeabundancedifference
_version_ 1724971816186806272