Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Meta search method for gene tissue-specific sequence pattern and search result assessment method

A tissue-specific, pattern-searching technology, applied in the field of bioinformatics, can solve problems such as low computational complexity and inability to guarantee the optimal solution of problems, and achieve the effect of improving robustness, reliability, and credibility

Inactive Publication Date: 2011-11-02
TIANJIN UNIV
View PDF3 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Its advantage is that it has low computational complexity, fast calculation speed, and is suitable for searching solutions in a large space; the disadvantage is that it cannot guarantee the optimal solution of the problem, and can only obtain a suboptimal solution similar to the optimal solution.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Meta search method for gene tissue-specific sequence pattern and search result assessment method
  • Meta search method for gene tissue-specific sequence pattern and search result assessment method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] In the present invention, for each specific tissue type group (including the HK gene set), a uniform distribution set of generated patterns is simulated, and its distribution density is correlated. For the known tissue-specific patterns and the patterns containing tissue-specific regulatory factor binding, the distribution density is enhanced by calculating the correlation strength, so as to construct the data set of the empty model, and then estimate the distribution parameters, and obtain a better and more precise evaluation.

[0032] The present invention mainly includes three steps: firstly, extract the required information from the existing biological databases (eukaryotic promoter database EPD, nucleosome position region database NPRD, gene regulatory transcription factor database Transfac, DNA methylation information database MethDB). Tissue-specific genes, that is, the promoter sequence (promoter sequence), transcription factor binding site, nucleosome positioni...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a meta search method for a gene tissue-specific sequence pattern and a search result assessment method and belongs to the field of biological information science. The search method comprises the following steps: extracting promoter sequences of tissue-specific genes and housekeeping (HK) genes from a bioinformatics database as input initial data; carrying out local a search algorithm and an exhaustive search algorithm on the input initial data respectively; storing result tissue generated from the operation of the two search algorithms into a filter matrix; estimating the pattern probability by utilizing the data in the filter matrix; and merging the motifs. In the assessment method, Bayes factor analysis evaluation statistics are utilized to obtain the significance of the search result of the gene tissue-specific motifs. Compared with prior art, the methods provided by the invention have the advantages that the pattern search frame used in the invention integrates multiple algorithms, the principle of 'average result overcomes single choice' which prevails in bioinformatics is conformed, the robustness and creditability of results are improved, the creditability of search results is increased, and over estimation and low estimation of the pattern are avoided.

Description

technical field [0001] The invention relates to the field of biological information science, in particular to a search method for gene tissue-specific sequence pattern elements. Background technique [0002] Tissue-specific genes refer to genes that are specifically expressed in different cell types. Tissue-specific gene expression includes information such as binding sites between transcription factors and gene promoter sequences, sequence characteristics of gene promoter regions, alternative splicing (Alternative Splice), and epigenetic characteristics. in: [0003] Transcription factors are a group of protein molecules that can specifically bind to a specific sequence at the 5' end of a gene to ensure that the target gene is expressed at a specific time and space with a specific intensity. Transcription factors, also known as sequence-specific DNA-binding factors, bind to specific DNA sequences, thereby controlling the transfer of genetic information from DNA to mRNA. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F19/24
Inventor 许华琳宫秀军
Owner TIANJIN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products