8XYW image
Entry Detail
PDB ID:
8XYW
Keywords:
Title:
De novo designed protein Trx-3
Biological Source:
Source Organism:
PDB Version:
Deposition Date:
2024-01-20
Release Date:
2024-10-02
Method Details:
Experimental Method:
Resolution:
1.41 Å
R-Value Free:
0.24
R-Value Work:
0.22
R-Value Observed:
0.22
Space Group:
C 1 2 1
Macromolecular Entities
Polymer Type:polypeptide(L)
Description:De novo designed Trx-3
Chain IDs:A
Chain Length:116
Number of Molecules:1
Biological Source:synthetic construct
Primary Citation
All-Atom Protein Sequence Design Based on Geometric Deep Learning.
Angew.Chem.Int.Ed.Engl. 63 e202411461 e202411461 (2024)
PMID: 39295564 DOI: 10.1002/anie.202411461

Abstact

Designing sequences for specific protein backbones is a key step in creating new functional proteins. Here, we introduce GeoSeqBuilder, a deep learning framework that integrates protein sequence generation with side chain conformation prediction to produce the complete all-atom structures for designed sequences. GeoSeqBuilder uses spatial geometric features from protein backbones and explicitly includes three-body interactions of neighboring residues. GeoSeqBuilder achieves native residue type recovery rate of 51.6 %, comparable to ProteinMPNN and other leading methods, while accurately predicting side chain conformations. We first used GeoSeqBuilder to design sequences for thioredoxin and a hallucinated three-helical bundle protein. All the 15 tested sequences expressed as soluble monomeric proteins with high thermal stability, and the 2 high-resolution crystal structures solved closely match the designed models. The generated protein sequences exhibit low similarity (minimum 23 %) to the original sequences, with significantly altered hydrophobic cores. We further redesigned the hydrophobic core of glutathione peroxidase 4, and 3 of the 5 designs showed improved enzyme activity. Although further testing is needed, the high experimental success rate in our testing demonstrates that GeoSeqBuilder is a powerful tool for designing novel sequences for predefined protein structures with atomic details. GeoSeqBuilder is available at https://github.com/PKUliujl/GeoSeqBuilder.

Legend

Protein

Chemical

Disease

Primary Citation of related structures