6CHK image
Deposition Date 2018-02-22
Release Date 2018-03-07
Last Version Date 2024-03-13
Entry Detail
PDB ID:
6CHK
Keywords:
Title:
Crystal structure of LacI family transcriptional regulator from Lactobacillus casei, Target EFI-512911, with bound TRIS
Biological Source:
Source Organism:
Method Details:
Experimental Method:
Resolution:
1.80 Å
R-Value Free:
0.19
R-Value Work:
0.14
R-Value Observed:
0.14
Space Group:
H 3 2
Macromolecular Entities
Polymer Type:polypeptide(L)
Molecule:Transcriptional regulator, LacI family
Gene (Uniprot):LSEI_2103
Chain IDs:A
Chain Length:282
Number of Molecules:1
Biological Source:Lactobacillus paracasei
Primary Citation
Automatic recognition of ligands in electron density by machine learning.
Bioinformatics 35 452 461 (2019)
PMID: 30016407 DOI: 10.1093/bioinformatics/bty626

Abstact

MOTIVATION The correct identification of ligands in crystal structures of protein complexes is the cornerstone of structure-guided drug design. However, cognitive bias can sometimes mislead investigators into modeling fictitious compounds without solid support from the electron density maps. Ligand identification can be aided by automatic methods, but existing approaches are based on time-consuming iterative fitting. RESULTS Here we report a new machine learning algorithm called CheckMyBlob that identifies ligands from experimental electron density maps. In benchmark tests on portfolios of up to 219 931 ligand binding sites containing the 200 most popular ligands found in the Protein Data Bank, CheckMyBlob markedly outperforms the existing automatic methods for ligand identification, in some cases doubling the recognition rates, while requiring significantly less time. Our work shows that machine learning can improve the automation of structure modeling and significantly accelerate the drug screening process of macromolecule-ligand complexes. AVAILABILITY AND IMPLEMENTATION Code and data are available on GitHub at https://github.com/dabrze/CheckMyBlob. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

Legend

Protein

Chemical

Disease

Primary Citation of related structures