Protein structure prediction enhanced with evolutionary diversity: SPEED

Joe Debartolo, Glen Hocky, Michael Wilde, Jinbo Xu, Karl F. Freed, Tobin R. Sosnick

Research output: Contribution to journalArticle

Abstract

For naturally occurring proteins, similar sequence implies similar structure. Consequently, multiple sequence alignments (MSAs) often are used in template-based modeling of protein structure and have been incorporated into fragment-based assembly methods. Our previous homology-free structure prediction study introduced an algorithm that mimics the folding pathway by coupling the formation of secondary and tertiary structure. Moves in the Monte Carlo procedure involve only a change in a single pair of Φ,ψ backbone dihedral angles that are obtained from a Protein Data Bank-based distribution appropriate for each amino acid, conditional on the type and conformation of the flanking residues. We improve this method by using MSAs to enrich the sampling distribution, but in a manner that does not require structural knowledge of any protein sequence (i.e., not homologous fragment insertion). In combination with other tools, including clustering and refinement, the accuracies of the predicted secondary and tertiary structures are substantially improved and a global and position-resolved measure of confidence is introduced for the accuracy of the predictions. Performance of the method in the Critical Assessment of Structure Prediction (CASP8) is discussed. Published by Wiley-Blackwell.

Original languageEnglish (US)
Pages (from-to)520-534
Number of pages15
JournalProtein Science
Volume19
Issue number3
DOIs
StatePublished - Mar 1 2010

Fingerprint

Sequence Alignment
Proteins
Dihedral angle
Cluster Analysis
Conformations
Databases
Sampling
Amino Acids

Keywords

  • Folding pathway
  • ItFix
  • Monte Carlo simulated annealing
  • Multiple sequence alignment
  • Protein folding
  • Statistical potential

ASJC Scopus subject areas

  • Biochemistry
  • Molecular Biology

Cite this

Debartolo, J., Hocky, G., Wilde, M., Xu, J., Freed, K. F., & Sosnick, T. R. (2010). Protein structure prediction enhanced with evolutionary diversity: SPEED. Protein Science, 19(3), 520-534. https://doi.org/10.1002/pro.330

Protein structure prediction enhanced with evolutionary diversity : SPEED. / Debartolo, Joe; Hocky, Glen; Wilde, Michael; Xu, Jinbo; Freed, Karl F.; Sosnick, Tobin R.

In: Protein Science, Vol. 19, No. 3, 01.03.2010, p. 520-534.

Research output: Contribution to journalArticle

Debartolo, J, Hocky, G, Wilde, M, Xu, J, Freed, KF & Sosnick, TR 2010, 'Protein structure prediction enhanced with evolutionary diversity: SPEED', Protein Science, vol. 19, no. 3, pp. 520-534. https://doi.org/10.1002/pro.330
Debartolo, Joe ; Hocky, Glen ; Wilde, Michael ; Xu, Jinbo ; Freed, Karl F. ; Sosnick, Tobin R. / Protein structure prediction enhanced with evolutionary diversity : SPEED. In: Protein Science. 2010 ; Vol. 19, No. 3. pp. 520-534.
@article{08a18c21e37345b096e7843a5253c9f3,
title = "Protein structure prediction enhanced with evolutionary diversity: SPEED",
abstract = "For naturally occurring proteins, similar sequence implies similar structure. Consequently, multiple sequence alignments (MSAs) often are used in template-based modeling of protein structure and have been incorporated into fragment-based assembly methods. Our previous homology-free structure prediction study introduced an algorithm that mimics the folding pathway by coupling the formation of secondary and tertiary structure. Moves in the Monte Carlo procedure involve only a change in a single pair of Φ,ψ backbone dihedral angles that are obtained from a Protein Data Bank-based distribution appropriate for each amino acid, conditional on the type and conformation of the flanking residues. We improve this method by using MSAs to enrich the sampling distribution, but in a manner that does not require structural knowledge of any protein sequence (i.e., not homologous fragment insertion). In combination with other tools, including clustering and refinement, the accuracies of the predicted secondary and tertiary structures are substantially improved and a global and position-resolved measure of confidence is introduced for the accuracy of the predictions. Performance of the method in the Critical Assessment of Structure Prediction (CASP8) is discussed. Published by Wiley-Blackwell.",
keywords = "Folding pathway, ItFix, Monte Carlo simulated annealing, Multiple sequence alignment, Protein folding, Statistical potential",
author = "Joe Debartolo and Glen Hocky and Michael Wilde and Jinbo Xu and Freed, {Karl F.} and Sosnick, {Tobin R.}",
year = "2010",
month = "3",
day = "1",
doi = "10.1002/pro.330",
language = "English (US)",
volume = "19",
pages = "520--534",
journal = "Protein Science",
issn = "0961-8368",
publisher = "Cold Spring Harbor Laboratory Press",
number = "3",

}

TY - JOUR

T1 - Protein structure prediction enhanced with evolutionary diversity

T2 - SPEED

AU - Debartolo, Joe

AU - Hocky, Glen

AU - Wilde, Michael

AU - Xu, Jinbo

AU - Freed, Karl F.

AU - Sosnick, Tobin R.

PY - 2010/3/1

Y1 - 2010/3/1

N2 - For naturally occurring proteins, similar sequence implies similar structure. Consequently, multiple sequence alignments (MSAs) often are used in template-based modeling of protein structure and have been incorporated into fragment-based assembly methods. Our previous homology-free structure prediction study introduced an algorithm that mimics the folding pathway by coupling the formation of secondary and tertiary structure. Moves in the Monte Carlo procedure involve only a change in a single pair of Φ,ψ backbone dihedral angles that are obtained from a Protein Data Bank-based distribution appropriate for each amino acid, conditional on the type and conformation of the flanking residues. We improve this method by using MSAs to enrich the sampling distribution, but in a manner that does not require structural knowledge of any protein sequence (i.e., not homologous fragment insertion). In combination with other tools, including clustering and refinement, the accuracies of the predicted secondary and tertiary structures are substantially improved and a global and position-resolved measure of confidence is introduced for the accuracy of the predictions. Performance of the method in the Critical Assessment of Structure Prediction (CASP8) is discussed. Published by Wiley-Blackwell.

AB - For naturally occurring proteins, similar sequence implies similar structure. Consequently, multiple sequence alignments (MSAs) often are used in template-based modeling of protein structure and have been incorporated into fragment-based assembly methods. Our previous homology-free structure prediction study introduced an algorithm that mimics the folding pathway by coupling the formation of secondary and tertiary structure. Moves in the Monte Carlo procedure involve only a change in a single pair of Φ,ψ backbone dihedral angles that are obtained from a Protein Data Bank-based distribution appropriate for each amino acid, conditional on the type and conformation of the flanking residues. We improve this method by using MSAs to enrich the sampling distribution, but in a manner that does not require structural knowledge of any protein sequence (i.e., not homologous fragment insertion). In combination with other tools, including clustering and refinement, the accuracies of the predicted secondary and tertiary structures are substantially improved and a global and position-resolved measure of confidence is introduced for the accuracy of the predictions. Performance of the method in the Critical Assessment of Structure Prediction (CASP8) is discussed. Published by Wiley-Blackwell.

KW - Folding pathway

KW - ItFix

KW - Monte Carlo simulated annealing

KW - Multiple sequence alignment

KW - Protein folding

KW - Statistical potential

UR - http://www.scopus.com/inward/record.url?scp=77249098840&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77249098840&partnerID=8YFLogxK

U2 - 10.1002/pro.330

DO - 10.1002/pro.330

M3 - Article

VL - 19

SP - 520

EP - 534

JO - Protein Science

JF - Protein Science

SN - 0961-8368

IS - 3

ER -