9IHA image
Deposition Date 2025-02-20
Release Date 2025-08-06
Last Version Date 2025-08-13
Entry Detail
PDB ID:
9IHA
Title:
A-SPECTRIN SH3 DOMAIN V9L, A11V, M25V, V46F, V58L MUTANT
Biological Source:
Source Organism:
Gallus gallus (Taxon ID: 9031)
Method Details:
Experimental Method:
Resolution:
1.01 Å
R-Value Free:
0.17
R-Value Work:
0.16
Space Group:
P 21 21 21
Macromolecular Entities
Structures with similar UniProt ID
Protein Blast
Polymer Type:polypeptide(L)
Molecule:Spectrin alpha chain, non-erythrocytic 1
Gene (Uniprot):SPTAN1
Mutagens:V9L, A11V, M25V, V46F, V58L
Chain IDs:A (auth: AAA)
Chain Length:64
Number of Molecules:1
Biological Source:Gallus gallus
Ligand Molecules
Primary Citation
Artificial intelligence and first-principle methods in protein redesign: A marriage of convenience?
Protein Sci. 34 e70210 e70210 (2025)
PMID: 40671352 DOI: 10.1002/pro.70210

Abstact

Since AlphaFold2's rise, many deep learning methods for protein design have emerged. Here, we validate widely used and recognized tools, compare them with first-principle methods, and explore their combinations, focusing on their effectiveness in protein redesign and potential for therapeutic repurposing. We address two challenges: evaluating tools and combinations ability to detect the effects of multiple concurrent mutations in protein variants, and leveraging large-scale datasets to compare modeling-free methods, namely force fields, which handle point mutations well with limited backbone rearrangement, and inverse folding tools, which excel at native sequence recovery but may struggle with non-natural proteins. Debuting TriCombine, a tool that identifies residue triangles in input structures, matches them to a structural database, and scores mutants based on substitution frequencies, we shortlisted candidates, modeled them with FoldX, and generated 16 SH3 mutants carrying up to 9 concurrent substitutions. The dataset was expanded to include 36 mutants and 11 crystal structures (7 newly solved), along with a parallel set of multiple non-concurrent mutants from three additional proteins. For broader validation, we analyzed 160,000 four-site GB1 mutants and 163,555 (single and double) variants across 179 natural and de novo domains. We show that combining AI-based modeling tools with force field scoring functions yields the most reliable results. Inverse folding tools perform very well but lose accuracy on less-represented proteins. First-principle force fields like FoldX remain the most accurate for point mutations. All methods perform worse when applied to unsolved de novo models, underscoring the need for hybrid strategies in robust protein design.

Legend

Protein

Chemical

Disease

Primary Citation of related structures