Molecular virtual screening method based on protein sequence alignment

A protein sequence, virtual screening technology, applied in the fields of bioinformatics and computer applications, can solve the problem that the virtual screening method cannot work well

Pending Publication Date: 2021-05-18
ZHEJIANG UNIV OF TECH
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although existing methods can be used for virtual screening of drug molecules, it is generally necessary to know the three-dimensional structure of a given protein or to know at least one binding molecule, so when there is no three-dimensional structure of the protein or the binding molecule is unknown, the existing Virtual screening methods don't work well
[0004] To sum up, the existing molecular virtual screening methods are still far from the requirements of practical applications in terms of screening scenarios and screening effects, and urgently need to be improved.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Molecular virtual screening method based on protein sequence alignment
  • Molecular virtual screening method based on protein sequence alignment
  • Molecular virtual screening method based on protein sequence alignment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] The present invention will be further described below in conjunction with the accompanying drawings.

[0036] refer to figure 1 and figure 2 , a molecular virtual screening method based on protein sequence alignment, comprising the following steps:

[0037] 1) Input a protein sequence P to be subjected to molecular screening with the number of residues L;

[0038] 2) For the protein sequence P, use the HHblits (https: / / toolkit.tuebingen.mpg.de / # / hhblits) program to search the database UniRef90 (ftp: / / ftp.uniprot.org / pub / databases / uniprot / uniref / uniref90 / ), generating a multiple sequence alignment information comprising M sequences, denoted as MSA;

[0039] 3) For the MSA file, calculate the position-specific frequency matrix whose size is L×20, denoted as PSFM:

[0040]

[0041] Among them, PSFM i,j Represents the element in row i and column j in PSFM, i=1,2,...,L,j=1,2,...,20, Res j Indicates the jth of 20 residues (A, R, N, D, C, Q, E, G, H, I, L, K, M, F, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a molecular virtual screening method based on protein sequence alignment, which comprises the following steps: according to an input protein sequence to be subjected to molecular screening, acquiring multi-sequence coupling information of protein by using an HHbits program; the frequency PSFM of the same residues appearing at the corresponding positions of the protein sequence to be predicted and the multi-sequence coupling information being calculated; by using the same method, generating the PSFM of each protein sequence in a protein-ligand interaction database BioLiP; calculating the residue alignment score and similarity matching quality of the to-be-predicted protein and each protein in BioLiP, and obtaining a potential seed molecule set according to the matching quality score; and calculating the sum of the two-dimensional fingerprint values of each molecule in the molecule database and all the molecules in the seed molecule set, sorting all the molecules in the DrugBank according to scores, and obtaining x.NDrugBank molecules with the top scores as a molecular sieve selection set of protein sequences to be subjected to molecular screening. The method can be used for any screening scene.

Description

technical field [0001] The invention relates to the fields of bioinformatics and computer applications, in particular to a molecular virtual screening method based on protein sequence alignment. Background technique [0002] Identifying lead molecules that interact with a given protein and appropriately modify its biological behavior is a fundamental challenge in pharmaceutical research. The virtual screening method uses molecular docking software on the computer to simulate the interaction between the target and the candidate drug, and calculates the affinity between the two, so as to reduce the number of actual screening compounds and improve the efficiency of lead compound discovery. Therefore, proposing a fast and accurate virtual screening method has important guiding significance for the design and development of drug molecules. [0003] Research literature found that many virtual screening methods have been proposed, such as: LncLocator (Cao Zhen, PanXiaoyong, Yang Y...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G16B35/20G16B30/10G06K9/62
CPCG16B35/20G16B30/10G06F18/22
Inventor 胡俊郑琳琳董世建白岩松樊学强张贵军
Owner ZHEJIANG UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products