Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Protein-ligand affinity prediction method based on interaction energy terms and machine learning

A technology of interaction energy and prediction method, applied in the direction of drug reference, etc., can solve the problems of insufficient diversity, poor correlation, ignoring coupling effect, etc., and achieve the effect of accurate prediction results.

Active Publication Date: 2018-12-18
EAST CHINA NORMAL UNIV
View PDF5 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the current scoring function based on experience still has related problems such as poor correlation between predicted value and experimental value, large target dependence and poor sensitivity to congeners
There are many reasons for these problems. For example, too few interaction energy items cause the differentiation to be ignored to a certain extent. The protein-ligand complex data set is rarely enough to cause insufficient diversity. Linear regression ignores the coupling effect between each interaction energy item.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Protein-ligand affinity prediction method based on interaction energy terms and machine learning
  • Protein-ligand affinity prediction method based on interaction energy terms and machine learning
  • Protein-ligand affinity prediction method based on interaction energy terms and machine learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052]The present invention constructs 107 different interaction items for different amino acid residues by collecting 3746 protein and ligand complex crystal structures and their binding affinity experimental values ​​in the PDBbind library, and adopts the random forest method in the machine learning method, An empirical scoring function was thus established for predicting the affinity of a given complex.

[0053] Concrete steps of the present invention:

[0054] Step 1: Collect and prepare 3746 complex structures and their affinity data from the PDBbing database. The affinity type of the ligand is Kd or Ki, and the ligand affinity values ​​of all complexes have more than 100 distributions in picomolar, nanomolar, micromolar and millimolar levels.

[0055] Step 2: Preprocess all proteins by PDBFixer. Processing steps include filling in missing amino acid residues, filling in missing atoms, and adding hydrogen.

[0056] Step 3: Obtain protein atomic charges based on the amb...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention relates to a protein-ligand affinity prediction method based on interaction energy terms and machine learning. The method is characterized by dispersing the various interaction energy terms of a ligand and a protein pocket onto the main side chains of 20 amino acid residues; training the protein-ligand interaction energy information with known activity in a PDB library by a machine learning method; and scoring the ligand and protein affinity by using the obtained model. By dispersing the interaction energy terms, the method can fully consider the influence of the main side chains of different amino acid residues on the affinity, and uses machine learning to perform nonlinear fitting so as to contribute to processing the correlation or coupling between interaction energies, thereby reducing errors caused by different amino acid structures in the calculation of affinity. The method contributes to the prediction of the affinity of active molecules so as to achieve apurpose of improving prediction accuracy.

Description

technical field [0001] The invention relates to the technical field of scoring functions, in particular to a method for predicting protein-ligand affinity based on interaction energy between ligand and protein binding pocket residues and machine learning. Background technique [0002] In the process of drug design, scoring functions are usually used to predict the binding affinity between protein targets and their ligands, thereby improving the success rate of drug design and reducing the cost of drug screening. In recent years, it has been paid more and more attention by relevant scientific research institutions and pharmaceutical companies. Common scoring function methods can be divided into force field-based scoring functions, experience-based scoring functions and knowledge-based scoring functions. Commonly used scoring functions include PLP, ChemScore, X-Score, and GlideScore, etc. Among them, the scoring function based on experience is the most widely used. Empirica...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G16H70/40
CPCG16H70/40
Inventor 季长鸽王卫军张增辉闫玉娜段观福单金文
Owner EAST CHINA NORMAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products