5UP1 image
Entry Detail
PDB ID:
5UP1
Keywords:
Title:
Solution structure of the de novo mini protein EEHEE_rd3_1049
Biological Source:
Source Organism:
Host Organism:
PDB Version:
Deposition Date:
2017-02-01
Release Date:
2017-07-26
Method Details:
Experimental Method:
Conformers Calculated:
100
Conformers Submitted:
20
Selection Criteria:
structures with the lowest energy
Macromolecular Entities
Polymer Type:polypeptide(L)
Description:EEHEE_rd3_1049
Chain IDs:A
Chain Length:64
Number of Molecules:1
Biological Source:Escherichia coli
Ligand Molecules
Primary Citation
Global analysis of protein folding using massively parallel design, synthesis, and testing.
Science 357 168 175 (2017)
PMID: 28706065 DOI: 10.1126/science.aan0693

Abstact

Proteins fold into unique native structures stabilized by thousands of weak interactions that collectively overcome the entropic cost of folding. Although these forces are "encoded" in the thousands of known protein structures, "decoding" them is challenging because of the complexity of natural proteins that have evolved for function, not stability. We combined computational protein design, next-generation gene synthesis, and a high-throughput protease susceptibility assay to measure folding and stability for more than 15,000 de novo designed miniproteins, 1000 natural proteins, 10,000 point mutants, and 30,000 negative control sequences. This analysis identified more than 2500 stable designed proteins in four basic folds-a number sufficient to enable us to systematically examine how sequence determines folding and stability in uncharted protein space. Iteration between design and experiment increased the design success rate from 6% to 47%, produced stable proteins unlike those found in nature for topologies where design was initially unsuccessful, and revealed subtle contributions to stability as designs became increasingly optimized. Our approach achieves the long-standing goal of a tight feedback cycle between computation and experiment and has the potential to transform computational protein design into a data-driven science.

Legend

Protein

Chemical

Disease

Primary Citation of related structures