7DTA image
Deposition Date 2021-01-04
Release Date 2021-07-21
Last Version Date 2024-05-01
Entry Detail
PDB ID:
7DTA
Title:
Solution structure of the C-clamp domain from human HDBP1 in complex with DNA
Biological Source:
Source Organism:
Homo sapiens (Taxon ID: 9606)
Host Organism:
Method Details:
Experimental Method:
Conformers Calculated:
100
Conformers Submitted:
20
Selection Criteria:
structures with the lowest energy
Macromolecular Entities
Structures with similar UniProt ID
Protein Blast
Polymer Type:polypeptide(L)
Molecule:SLC2A4 regulator
Gene (Uniprot):SLC2A4RG
Chain IDs:A
Chain Length:33
Number of Molecules:1
Biological Source:Homo sapiens
Polymer Type:polydeoxyribonucleotide
Molecule:DNA (5'-D(*TP*AP*TP*GP*CP*CP*GP*GP*GP*AP*C)-3')
Chain IDs:B
Chain Length:11
Number of Molecules:1
Biological Source:Homo sapiens
Polymer Type:polydeoxyribonucleotide
Molecule:DNA (5'-D(*GP*TP*CP*CP*CP*GP*GP*CP*AP*TP*A)-3')
Chain IDs:C
Chain Length:11
Number of Molecules:1
Biological Source:Homo sapiens
Ligand Molecules
Primary Citation
Selective Nonmethylated CpG DNA Recognition Mechanism of Cysteine Clamp Domains.
J.Am.Chem.Soc. 143 7688 7697 (2021)
PMID: 33983734 DOI: 10.1021/jacs.1c00599

Abstact

Methylation of DNA at CpG sites is a major mark for epigenetic regulation, but how transcription factors are influenced by CpG methylation is not well understood. Here, we report the molecular mechanisms of how the TCF (T-cell factor) and GEF (glucose transporter 4 enhancer factor) families of proteins selectively target unmethylated DNA sequences with a C-clamp type zinc finger domain. The structure of the C-clamp domain from human GEF family protein HDBP1 (C-clampHDBP1) in complex with DNA was determined using NMR spectroscopy, which adopts a unique zinc finger fold and selectively binds RCCGG (R = A/G) DNA sequences with an "Arg···Trp-Lys-Lys" DNA recognition motif inserted in the major groove. The CpG base pairs are central to the binding due to multiple hydrogen bonds formed with the backbone carbonyl groups of Trp378 and Lys379, as well as the side chain ε-amino groups of Lys379 and Lys380 from C-clampHDBP1. Consequently, methylation of the CpG dinucleotide almost abolishes the binding. Homology modeling reveals that the C-clamp domain from human TCF1E (C-clampTCF1E) binds DNA through essentially the same mechanism, with a similar "Arg···Arg-Lys-Lys" DNA recognition motif. The substitution of tryptophan by arginine makes C-clampHDBP1 prefer RCCGC DNA sequences. The two signature DNA recognition motifs are invariant in the GEF and TCF families of proteins, respectively, from fly to human. The recognition of the CpG dinucleotide through two consecutive backbone carbonyl groups is the same as that of the CXXC type unmethylated CpG DNA binding domains, suggesting a common mechanism shared by unmethylated CpG binding proteins.

Legend

Protein

Chemical

Disease

Primary Citation of related structures