Unlock instant, AI-driven research and patent intelligence for your innovation.

Motif mining method based on sequence and shape information deep fusion

A shape information and motif mining technology, which is applied in the field of computer recognition and deep learning, can solve the problems of machine learning model computing performance and optimization method limitations, and the inability to make full use of massive sequencing data, etc., to achieve the effect of efficient fusion

Active Publication Date: 2021-07-09
TONGJI UNIV
View PDF3 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, such traditional methods often ignore the sequence relationship between K-mers, and machine learning models are limited by computing performance and optimization methods, and cannot make full use of massive sequencing data.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Motif mining method based on sequence and shape information deep fusion
  • Motif mining method based on sequence and shape information deep fusion
  • Motif mining method based on sequence and shape information deep fusion

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] The following will clearly and completely describe the technical solutions in the embodiments of the present invention in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0028] In order to make the above objects, features and advantages of the present invention more obvious and understandable, the present invention will be described in further detail below in conjunction with the accompanying drawings and specific embodiments.

[0029] The present invention proposes a motif mining method based on the deep fusion of sequence and shape information, the process is as attached figure 1 shown, including:

[0030] S...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a motif mining method based on sequence and shape information deep fusion. The method comprises the following steps of: S1, constructing a deep embedded convolutional neural network model, and carrying out space alignment mixing on a DNA sequence and shape information to serve as input of the neural network model; S2, training the deep embedded convolutional neural network model to obtain a motif binding strength predicted value; and S3, evaluating the performance of the constructed deep embedded convolutional neural network model based on a regression coefficient R2 between the predicted value and actual binding strength. According to the method, the advantages of the convolutional neural network in the aspect of feature extraction are utilized, and deep fusion of sequence and shape features is realized.

Description

technical field [0001] The present invention relates to the technical field of computer recognition and deep learning, in particular to a motif mining method based on deep fusion of sequence and shape information. Background technique [0002] The binding behavior of transcription factors plays an important role in the regulation of gene expression, and the identification of transcription factor binding sites is of great significance for understanding the binding mechanism and related cellular activities. However, transcription factor binding is a delicate biophysical process with many influencing factors and difficult modeling. To this end, researchers have developed a variety of binding site prediction models. The position weight matrix models the sequence-specific preference of transcription factor binding through a probabilistic statistical model, and each column element value in the matrix represents the probability distribution of the four nucleotides {A, C, G, and T}...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G16B30/00G06N3/08G06N3/04
CPCG16B30/00G06N3/08G06N3/045
Inventor 黄德双张寅东
Owner TONGJI UNIV