Sequence characteristics define trade-offs between on-target and genome-wide off-target hybridization of oligoprobes.

Off-target oligoprobe's interaction with partially complementary nucleotide sequences represents a problem for many bio-techniques. The goal of the study was to identify oligoprobe sequence characteristics that control the ratio between on-target and off-target hybridization. To understand the...

Full description

Bibliographic Details
Main Authors: Olga V Matveeva, Aleksey Y Ogurtsov, Nafisa N Nazipova, Svetlana A Shabalina
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2018-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC6013149?pdf=render
id doaj-0070b70d07534ea0948c340435d7dbc8
record_format Article
spelling doaj-0070b70d07534ea0948c340435d7dbc82020-11-24T22:11:46ZengPublic Library of Science (PLoS)PLoS ONE1932-62032018-01-01136e019916210.1371/journal.pone.0199162Sequence characteristics define trade-offs between on-target and genome-wide off-target hybridization of oligoprobes.Olga V MatveevaAleksey Y OgurtsovNafisa N NazipovaSvetlana A ShabalinaOff-target oligoprobe's interaction with partially complementary nucleotide sequences represents a problem for many bio-techniques. The goal of the study was to identify oligoprobe sequence characteristics that control the ratio between on-target and off-target hybridization. To understand the complex interplay between specific and genome-wide off-target (cross-hybridization) signals, we analyzed a database derived from genomic comparison hybridization experiments performed with an Affymetrix tiling array. The database included two types of probes with signals derived from (i) a combination of specific signal and cross-hybridization and (ii) genomic cross-hybridization only. All probes from the database were grouped into bins according to their sequence characteristics, where both hybridization signals were averaged separately. For selection of specific probes, we analyzed the following sequence characteristics: vulnerability to self-folding, nucleotide composition bias, numbers of G nucleotides and GGG-blocks, and occurrence of probe's k-mers in the human genome. Increases in bin ranges for these characteristics are simultaneously accompanied by a decrease in hybridization specificity-the ratio between specific and cross-hybridization signals. However, both averaged hybridization signals exhibit growing trends along with an increase of probes' binding energy, where the hybridization specific signal increases significantly faster in comparison to the cross-hybridization. The same trend is evident for the S function, which serves as a combined evaluation of probe binding energy and occurrence of probe's k-mers in the genome. Application of S allows extracting a larger number of specific probes, as compared to using only binding energy. Thus, we showed that high values of specific and cross-hybridization signals are not mutually exclusive for probes with high values of binding energy and S. In this study, the application of a new set of sequence characteristics allows detection of probes that are highly specific to their targets for array design and other bio-techniques that require selection of specific probes.http://europepmc.org/articles/PMC6013149?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Olga V Matveeva
Aleksey Y Ogurtsov
Nafisa N Nazipova
Svetlana A Shabalina
spellingShingle Olga V Matveeva
Aleksey Y Ogurtsov
Nafisa N Nazipova
Svetlana A Shabalina
Sequence characteristics define trade-offs between on-target and genome-wide off-target hybridization of oligoprobes.
PLoS ONE
author_facet Olga V Matveeva
Aleksey Y Ogurtsov
Nafisa N Nazipova
Svetlana A Shabalina
author_sort Olga V Matveeva
title Sequence characteristics define trade-offs between on-target and genome-wide off-target hybridization of oligoprobes.
title_short Sequence characteristics define trade-offs between on-target and genome-wide off-target hybridization of oligoprobes.
title_full Sequence characteristics define trade-offs between on-target and genome-wide off-target hybridization of oligoprobes.
title_fullStr Sequence characteristics define trade-offs between on-target and genome-wide off-target hybridization of oligoprobes.
title_full_unstemmed Sequence characteristics define trade-offs between on-target and genome-wide off-target hybridization of oligoprobes.
title_sort sequence characteristics define trade-offs between on-target and genome-wide off-target hybridization of oligoprobes.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2018-01-01
description Off-target oligoprobe's interaction with partially complementary nucleotide sequences represents a problem for many bio-techniques. The goal of the study was to identify oligoprobe sequence characteristics that control the ratio between on-target and off-target hybridization. To understand the complex interplay between specific and genome-wide off-target (cross-hybridization) signals, we analyzed a database derived from genomic comparison hybridization experiments performed with an Affymetrix tiling array. The database included two types of probes with signals derived from (i) a combination of specific signal and cross-hybridization and (ii) genomic cross-hybridization only. All probes from the database were grouped into bins according to their sequence characteristics, where both hybridization signals were averaged separately. For selection of specific probes, we analyzed the following sequence characteristics: vulnerability to self-folding, nucleotide composition bias, numbers of G nucleotides and GGG-blocks, and occurrence of probe's k-mers in the human genome. Increases in bin ranges for these characteristics are simultaneously accompanied by a decrease in hybridization specificity-the ratio between specific and cross-hybridization signals. However, both averaged hybridization signals exhibit growing trends along with an increase of probes' binding energy, where the hybridization specific signal increases significantly faster in comparison to the cross-hybridization. The same trend is evident for the S function, which serves as a combined evaluation of probe binding energy and occurrence of probe's k-mers in the genome. Application of S allows extracting a larger number of specific probes, as compared to using only binding energy. Thus, we showed that high values of specific and cross-hybridization signals are not mutually exclusive for probes with high values of binding energy and S. In this study, the application of a new set of sequence characteristics allows detection of probes that are highly specific to their targets for array design and other bio-techniques that require selection of specific probes.
url http://europepmc.org/articles/PMC6013149?pdf=render
work_keys_str_mv AT olgavmatveeva sequencecharacteristicsdefinetradeoffsbetweenontargetandgenomewideofftargethybridizationofoligoprobes
AT alekseyyogurtsov sequencecharacteristicsdefinetradeoffsbetweenontargetandgenomewideofftargethybridizationofoligoprobes
AT nafisannazipova sequencecharacteristicsdefinetradeoffsbetweenontargetandgenomewideofftargethybridizationofoligoprobes
AT svetlanaashabalina sequencecharacteristicsdefinetradeoffsbetweenontargetandgenomewideofftargethybridizationofoligoprobes
_version_ 1725804259392356352