Fast design of arbitrary length loops in proteins using InteractiveRosetta

Abstract Background With increasing interest in ab initio protein design, there is a desire to be able to fully explore the design space of insertions and deletions. Nature inserts and deletes residues to optimize energy and function, but allowing variable length indels in the context of an interact...

Full description

Bibliographic Details
Main Authors: William F. Hooper, Benjamin D. Walcott, Xing Wang, Christopher Bystroff
Format: Article
Language:English
Published: BMC 2018-09-01
Series:BMC Bioinformatics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12859-018-2345-5
id doaj-7969d0d5b20945b6b70f4931d4d5bad0
record_format Article
spelling doaj-7969d0d5b20945b6b70f4931d4d5bad02020-11-25T01:29:10ZengBMCBMC Bioinformatics1471-21052018-09-011911910.1186/s12859-018-2345-5Fast design of arbitrary length loops in proteins using InteractiveRosettaWilliam F. Hooper0Benjamin D. Walcott1Xing Wang2Christopher Bystroff3Emmes CorporationDepartment of Biology, Rensselaer Polytechnic InstituteDepartment of Chemistry and Chemical Biology, Rensselaer Polytechnic InstituteDepartment of Biology, Rensselaer Polytechnic InstituteAbstract Background With increasing interest in ab initio protein design, there is a desire to be able to fully explore the design space of insertions and deletions. Nature inserts and deletes residues to optimize energy and function, but allowing variable length indels in the context of an interactive protein design session presents challenges with regard to speed and accuracy. Results Here we present a new module (INDEL) for InteractiveRosetta which allows the user to specify a range of lengths for a desired indel, and which returns a set of low energy backbones in a matter of seconds. To make the loop search fast, loop anchor points are geometrically hashed using C α-C α and C β-C β distances, and the hash is mapped to start and end points in a pre-compiled random access file of non-redundant, protein backbone coordinates. Loops with superposable anchors are filtered for collisions and returned to InteractiveRosetta as poly-alanine for display and selective incorporation into the design template. Sidechains can then be added using RosettaDesign tools. Conclusions INDEL was able to find viable loops in 100% of 500 attempts for all lengths from 3 to 20 residues. INDEL has been applied to the task of designing a domain-swapping loop for T7-endonuclease I, changing its specificity from Holliday junctions to paranemic crossover (PX) DNA.http://link.springer.com/article/10.1186/s12859-018-2345-5IndelBystroffInteractiveRosettaRosettaPyRosettaT7 endonuclease I
collection DOAJ
language English
format Article
sources DOAJ
author William F. Hooper
Benjamin D. Walcott
Xing Wang
Christopher Bystroff
spellingShingle William F. Hooper
Benjamin D. Walcott
Xing Wang
Christopher Bystroff
Fast design of arbitrary length loops in proteins using InteractiveRosetta
BMC Bioinformatics
Indel
Bystroff
InteractiveRosetta
Rosetta
PyRosetta
T7 endonuclease I
author_facet William F. Hooper
Benjamin D. Walcott
Xing Wang
Christopher Bystroff
author_sort William F. Hooper
title Fast design of arbitrary length loops in proteins using InteractiveRosetta
title_short Fast design of arbitrary length loops in proteins using InteractiveRosetta
title_full Fast design of arbitrary length loops in proteins using InteractiveRosetta
title_fullStr Fast design of arbitrary length loops in proteins using InteractiveRosetta
title_full_unstemmed Fast design of arbitrary length loops in proteins using InteractiveRosetta
title_sort fast design of arbitrary length loops in proteins using interactiverosetta
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2018-09-01
description Abstract Background With increasing interest in ab initio protein design, there is a desire to be able to fully explore the design space of insertions and deletions. Nature inserts and deletes residues to optimize energy and function, but allowing variable length indels in the context of an interactive protein design session presents challenges with regard to speed and accuracy. Results Here we present a new module (INDEL) for InteractiveRosetta which allows the user to specify a range of lengths for a desired indel, and which returns a set of low energy backbones in a matter of seconds. To make the loop search fast, loop anchor points are geometrically hashed using C α-C α and C β-C β distances, and the hash is mapped to start and end points in a pre-compiled random access file of non-redundant, protein backbone coordinates. Loops with superposable anchors are filtered for collisions and returned to InteractiveRosetta as poly-alanine for display and selective incorporation into the design template. Sidechains can then be added using RosettaDesign tools. Conclusions INDEL was able to find viable loops in 100% of 500 attempts for all lengths from 3 to 20 residues. INDEL has been applied to the task of designing a domain-swapping loop for T7-endonuclease I, changing its specificity from Holliday junctions to paranemic crossover (PX) DNA.
topic Indel
Bystroff
InteractiveRosetta
Rosetta
PyRosetta
T7 endonuclease I
url http://link.springer.com/article/10.1186/s12859-018-2345-5
work_keys_str_mv AT williamfhooper fastdesignofarbitrarylengthloopsinproteinsusinginteractiverosetta
AT benjamindwalcott fastdesignofarbitrarylengthloopsinproteinsusinginteractiverosetta
AT xingwang fastdesignofarbitrarylengthloopsinproteinsusinginteractiverosetta
AT christopherbystroff fastdesignofarbitrarylengthloopsinproteinsusinginteractiverosetta
_version_ 1725098097126670336