Characteristic extraction and coding method and system based on multi-modal protein sequence

A protein sequence and feature extraction technology, applied in the field of bioinformatics, can solve problems such as high-dimensional redundant information, affecting the accuracy and efficiency of protein analysis, and not integrating multiple physical and chemical properties of multiple protein amino acid sequences to achieve accurate interaction effect

Active Publication Date: 2018-11-16
SHENZHEN UNIV
View PDF4 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The technical problem to be solved by the present invention is to provide a feature extraction and encoding method and system based on multimodal protein sequences in view of the above-mentioned defects of the prior art, aiming at solving the problem that the protein feature extraction method in the prior art does not have comprehensive The various physical and chemical properties of protein amino acid sequences can easily lead to problems such as high-dimensional redundant information, which affects the accuracy and efficiency of protein analysis

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Characteristic extraction and coding method and system based on multi-modal protein sequence
  • Characteristic extraction and coding method and system based on multi-modal protein sequence
  • Characteristic extraction and coding method and system based on multi-modal protein sequence

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] In order to make the object, technical solution and advantages of the present invention more clear and definite, the present invention will be further described in detail below with reference to the accompanying drawings and examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0039] In order to solve the problem of protein feature extraction in the prior art, the present invention provides a feature extraction and encoding method based on multimodal protein sequences, specifically as figure 1 As shown, the method includes:

[0040] Step S100 , extracting features of the protein sequence based on the relative mutation rate, hydrophilic property, and hydrophobic property of the amino acid sequence of the protein, respectively, to obtain protein features of three modalities.

[0041] Step S200, performing deep polynomial network encoding on the protein features...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a characteristic extraction and coding method and system based on a multi-modal protein sequence. The method comprises the steps of: performing characteristic extraction on theprotein sequence respectively based on the relative mutation rate, the hydrophilic property and the hydrophobic property of a protein amino acid sequence, and obtaining three-modal protein characteristics; respectively performing depth polynomial network coding on the three-modal protein characteristics, so that three kinds of senior characteristic expression can be respectively obtained; and performing depth polynomial network coding again after cascading the three kinds of senior characteristic expression, so that fused protein characteristics are obtained. Compared with the traditional protein characteristic extraction method, multiple physicochemical properties of the protein amino acid sequence are combined; relatively reliable protein characteristics are extracted; and thus, proteinand protein interaction can be analyzed more accurately.

Description

technical field [0001] The invention relates to the technical field of bioinformatics, in particular to a feature extraction and encoding method and system based on multimodal protein sequences. Background technique [0002] In recent years, thanks to the improvement of computer storage capacity and computing power, many experts and scholars have devoted themselves to the study of protein and protein interactions (Protein and Protein Interactions, PPIs) based on computational methods, and proteins usually function in pairs. Therefore, the study of protein-protein interactions (PPIs) can play a key role in revealing and obtaining protein functions, and how to extract features for proteins is a hot and difficult point. [0003] Although there are currently many feature extraction models based on computational methods applied to the analysis of protein-protein interactions, most protein feature extraction methods only consider the characteristics of one protein amino acid seque...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F19/18
Inventor 雷海军李诗淇温玉婷雷柏英蔡晔杨张
Owner SHENZHEN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products