Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Protein data feature extraction method

A data feature and extraction method technology, applied in the field of data processing, can solve the problems of easy loss of spatial information, low precision, large amount of calculation, etc., to achieve the goal of ensuring scientificity and accuracy, improving accuracy, and describing comprehensive appearance features Effect

Pending Publication Date: 2020-07-14
QINGDAO NAT LAB FOR MARINE SCI & TECH DEV CENT
View PDF6 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The present invention aims at the technical problems that the protein feature extraction method in the prior art only uses a three-dimensional model to extract a large amount of calculation, the accuracy is not high when two-dimensional feature extraction is used, and spatial information is easily lost, and a protein data feature extraction method is proposed. can solve the above problems

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Protein data feature extraction method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0033] Such as figure 1 As shown, the protein data feature extraction method of the present embodiment includes the following steps:

[0034] S1. Preprocessing the original 3D model of the protein, including data type conversion and data size standardization, to obtain a preprocessed 3D model;

[0035] The original 3D model of the protein is not uniform in size and the selected scale is not consistent when it is constructed, and the pdb format file is often used to describe the 3D position of each atom and the off format file is used to describe the surface of protein molecules. These files are not conducive to feature extraction. Therefore, in this step, the original file is converted into a data type file that is convenient for processing, and the data size is standardized to facilitate unified processing.

[0036] S2. Obtain multiple two-dimensional views of the preprocessed three-dimensional model, extract image feature matrices of each of the two-dimensional views, and f...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a protein data feature extraction method. The method comprises the following steps: (1) preprocessing an original three-dimensional model of protein to obtain a preprocessed three-dimensional model; (2) acquiring a plurality of two-dimensional views of the preprocessed three-dimensional model, extracting an image feature matrix of each two-dimensional view, and fusing all the image feature matrixes to obtain a two-dimensional feature matrix of the protein; (3) acquiring a three-dimensional characteristic matrix of the protein; and (4) performing fusion calculation on the two-dimensional feature matrix and the three-dimensional feature matrix of the protein to obtain a protein data feature matrix. According to the method, the two-dimensional view feature informationand the three-dimensional model space structure information of the protein are extracted, so that the appearance feature description of the protein is more comprehensive. Incompleteness caused by onlyadopting two-dimensional feature information extraction is avoided, and scientificity and accuracy of protein model similarity calculation can be guaranteed.

Description

technical field [0001] The invention belongs to the technical field of data processing, and in particular relates to a protein data feature extraction method. Background technique [0002] The database of protein molecules is rapidly increasing, and since proteins display multiple possible conformations in solution, the detection of shape similarity and identity is of biological relevance in the drug discovery process and in the molecular characterization of disease. Therefore, learning how to represent and highlight the features of protein 3D models is of great significance to both the medical field and the biological field. [0003] Currently, protein feature extraction methods are mainly divided into two types, one is protein feature extraction method based on 3D stereo model, and the other is protein feature extraction method based on multi-view. [0004] In protein feature extraction methods based on 3D volumetric models, proteins are described as model-based features,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/00G06K9/46G06N3/04G06N3/08
CPCG06N3/08G06V20/64G06V10/422G06N3/045Y02A90/10
Inventor 魏志强聂婕刘安安聂为之苏育挺
Owner QINGDAO NAT LAB FOR MARINE SCI & TECH DEV CENT
Features
  • Generate Ideas
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More