Gene recognition system of self-adaptation filter based on variable step size minimum mean square error

An adaptive filter and minimum mean square error technology, applied in genomics, instruments, special data processing applications, etc., can solve the problems of large random noise in short gene sequences, various adjustment parameters, and difficult storage, etc., to improve recognition performance Effect

Inactive Publication Date: 2017-07-04
HARBIN INST OF TECH SHENZHEN GRADUATE SCHOOL
View PDF4 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] At present, the research on gene identification has achieved many excellent results, and many mature prediction systems have been formed. However, these systems mainly rely on some classic machine learning labeling methods, such as HMM and CRF. The general model structure is complex and there are many adjustment parameters. , the training is time-consuming and difficult to store, and the system is too specific, which is not conducive to the transfer between species
In addition, it is impossible to effectively suppress and solve problems such as large random noise in short gene sequences, sparse feature information and low recognition rate.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Gene recognition system of self-adaptation filter based on variable step size minimum mean square error
  • Gene recognition system of self-adaptation filter based on variable step size minimum mean square error
  • Gene recognition system of self-adaptation filter based on variable step size minimum mean square error

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0053] Taking the input of a single DNA sequence as an example, the default system has pre-loaded the training parameter values ​​​​of the relevant model from the configuration file. The recognition process is as follows: image 3 shown. The system starts to read an unknown input sequence each time, first counts the composition of the four bases A, C, G, and T, calculates the ratio of GC content to AT content, and updates the weight of each base corresponding to the filter output And save, so that when calculating the total power spectrum or signal-to-noise ratio value at sequence N / 3, the filter power spectrum output of a single base is weighted. Then use a single base as the step length to slide processing, and each time you slide a base position forward, it is judged whether it has reached the end of the sequence. If the feedback result is yes, it means that the entire sequence has been processed completely, and the system will automatically perform some necessary operation...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a gene recognition system of a self-adaptation filter based on a variable step size minimum mean square error. The system comprises an improvement unit of a variable step size LMS self-adaptation filter algorithm and a processing unit of a gene structure multi-characteristic weight fusion algorithm, wherein the improvement unit of the variable step size LMS self-adaptation filter algorithm is used to conduct filtering processing of a gene sequence by the variable step size LMS self-adaptation improvement algorithm, so gene characteristics in that random noise is rare, and cycle 3 behaviors are strong can be obtained; and the processing algorithm of the gene structure multi-characteristic weight fusion algorithm is used to extract characteristics of the gene sequence through a multi-characteristic weight fusion strategy, so that characteristic vectors with a higher expression ability can be obtained. The invention proposes the variable step size LMS self-adaptation filter improvement algorithm and the multi-characteristic weight fusion algorithm; and the two algorithms are integrated into the same gene recognition system, so recognition performance of the system is further improved.

Description

technical field [0001] The invention relates to the field of identifying and locating codable gene regions of DNA sequences obtained by sequencing in bioinformatics, and in particular relates to a gene identification system based on a variable step size minimum mean square error (LMS) adaptive filter. Background technique [0002] At present, the research on gene identification has achieved many excellent results, and many mature prediction systems have been formed. However, these systems mainly rely on some classic machine learning labeling methods, such as HMM and CRF. The general model structure is complex and there are many adjustment parameters. , the training is time-consuming and difficult to store, and the system is too specific, which is not conducive to the transfer between species. In addition, it cannot effectively suppress and solve problems such as large random noise in short gene sequences, sparse feature information and low recognition rate. [0003] In orde...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F19/18
CPCG16B20/00
Inventor 郭睿徐勇张健
Owner HARBIN INST OF TECH SHENZHEN GRADUATE SCHOOL
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products