Protein data feature extraction method

A data feature and extraction method technology, applied in the field of data processing, can solve the problems of easy loss of spatial information, low precision, large amount of calculation, etc., to achieve the goal of ensuring scientificity and accuracy, improving accuracy, and describing comprehensive appearance features Effect

Pending Publication Date: 2020-07-14
QINGDAO NAT LAB FOR MARINE SCI & TECH DEV CENT
View PDF6 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The present invention aims at the technical problems that the protein feature extraction method in the prior art only uses a three-dimensional model to extract a large amount of calculation, t

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Protein data feature extraction method

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0032] Example one

[0033] Such as figure 1 As shown, the protein data feature extraction method of this embodiment includes the following steps:

[0034] S1. Preprocess the original 3D model of protein, including data type conversion and data size standardization, to obtain a preprocessed 3D model;

[0035] The original three-dimensional model of protein is not uniform in size during construction and the selected scale is inconsistent. The pdb format file is used to describe the three-dimensional position of each atom and the off format file is used to describe the protein molecular surface. These files are not conducive to feature extraction. Therefore, in this step, the original file is converted into a data type file that is convenient for processing, and the data size is standardized to facilitate unified processing.

[0036] S2. Acquire multiple two-dimensional views of the preprocessed three-dimensional model, extract the image feature matrix of each of the two-dimensional vi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a protein data feature extraction method. The method comprises the following steps: (1) preprocessing an original three-dimensional model of protein to obtain a preprocessed three-dimensional model; (2) acquiring a plurality of two-dimensional views of the preprocessed three-dimensional model, extracting an image feature matrix of each two-dimensional view, and fusing all the image feature matrixes to obtain a two-dimensional feature matrix of the protein; (3) acquiring a three-dimensional characteristic matrix of the protein; and (4) performing fusion calculation on the two-dimensional feature matrix and the three-dimensional feature matrix of the protein to obtain a protein data feature matrix. According to the method, the two-dimensional view feature informationand the three-dimensional model space structure information of the protein are extracted, so that the appearance feature description of the protein is more comprehensive. Incompleteness caused by onlyadopting two-dimensional feature information extraction is avoided, and scientificity and accuracy of protein model similarity calculation can be guaranteed.

Description

technical field [0001] The invention belongs to the technical field of data processing, and in particular relates to a protein data feature extraction method. Background technique [0002] The database of protein molecules is rapidly increasing, and since proteins display multiple possible conformations in solution, the detection of shape similarity and identity is of biological relevance in the drug discovery process and in the molecular characterization of disease. Therefore, learning how to represent and highlight the features of protein 3D models is of great significance to both the medical field and the biological field. [0003] Currently, protein feature extraction methods are mainly divided into two types, one is protein feature extraction method based on 3D stereo model, and the other is protein feature extraction method based on multi-view. [0004] In protein feature extraction methods based on 3D volumetric models, proteins are described as model-based features,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/00G06K9/46G06N3/04G06N3/08
CPCG06N3/08G06V20/64G06V10/422G06N3/045Y02A90/10
Inventor 魏志强聂婕刘安安聂为之苏育挺
Owner QINGDAO NAT LAB FOR MARINE SCI & TECH DEV CENT
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products