8WWC image
Deposition Date 2023-10-25
Release Date 2024-10-09
Last Version Date 2025-05-07
Entry Detail
PDB ID:
8WWC
Keywords:
Title:
De novo design binder of HRAS -120-4
Biological Source:
Source Organism:
Host Organism:
Method Details:
Experimental Method:
Resolution:
2.80 Å
R-Value Free:
0.24
R-Value Work:
0.21
R-Value Observed:
0.22
Space Group:
P 63
Macromolecular Entities
Structures with similar UniProt ID
Protein Blast
Polymer Type:polypeptide(L)
Molecule:GTPase HRas
Gene (Uniprot):HRAS
Chain IDs:B (auth: A), C (auth: B)
Chain Length:166
Number of Molecules:2
Biological Source:Homo sapiens
Polymer Type:polypeptide(L)
Molecule:De novo design protein 120-4
Chain IDs:A (auth: C), D
Chain Length:120
Number of Molecules:2
Biological Source:synthetic construct
Primary Citation
De novo protein design with a denoising diffusion network independent of pretrained structure prediction models.
Nat.Methods 21 2107 2116 (2024)
PMID: 39384986 DOI: 10.1038/s41592-024-02437-w

Abstact

The recent success of RFdiffusion, a method for protein structure design with a denoising diffusion probabilistic model, has relied on fine-tuning the RoseTTAFold structure prediction network for protein backbone denoising. Here, we introduce SCUBA-diffusion (SCUBA-D), a protein backbone denoising diffusion probabilistic model freshly trained by considering co-diffusion of sequence representation to enhance model regularization and adversarial losses to minimize data-out-of-distribution errors. While matching the performance of the pretrained RoseTTAFold-based RFdiffusion in generating experimentally realizable protein structures, SCUBA-D readily generates protein structures with not-yet-observed overall folds that are different from those predictable with RoseTTAFold. The accuracy of SCUBA-D was confirmed by the X-ray structures of 16 designed proteins and a protein complex, and by experiments validating designed heme-binding proteins and Ras-binding proteins. Our work shows that deep generative models of images or texts can be fruitfully extended to complex physical objects like protein structures by addressing outstanding issues such as the data-out-of-distribution errors.

Legend

Protein

Chemical

Disease

Primary Citation of related structures
Feedback Form
Name
Email
Institute
Feedback