Computation method for annotating semantic similarity by gene

A technology of semantic similarity and calculation method, applied in the field of gene annotation semantic similarity calculation, can solve the problem that the hierarchical decreasing ratio is difficult to determine, there is no accurate calculation of gene annotation semantic similarity, and node semantic similarity cannot be calculated. Reuse, etc.

Inactive Publication Date: 2009-02-04
SHANGHAI UNIV
View PDF7 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although Jiang et al. improved the Resnik's method by considering the influence of node depth on semantic similarity, but as long as it is based on Resnik's method, there are two shortcomings: first, this method is based on node counting, ignoring the " Belong to (is-a)" and "Part of (part-of)" two different effects on node similarity; second, the node semantic similarity calculated by Resnik's method cannot be reused, because two The semantic similarity of nodes is affected by other nodes in the set they are in
Wang's method has two disadvantages: First, it is difficult to determine the descending ratio according to the relationship layer. Wang suggested to take 0.8 and 0.6, which is actually very random, and the determination of descending comparison has a direct impact on the similarity of GO nodes; Second, the weight value of the ancestor point is represented by the maximum weight value, ignoring the influence of different paths on the similarity of GO nodes
[0006] Therefore, there is currently no method for accurately calculating the semantic similarity of gene annotations.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Computation method for annotating semantic similarity by gene
  • Computation method for annotating semantic similarity by gene
  • Computation method for annotating semantic similarity by gene

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0060] In this example, the method for calculating the semantic similarity of gene annotations is used for the calculation of the semantic similarity of gene annotations in the yeast isoleucine degradation metabolic pathway, so as to prove the effectiveness of the present invention.

[0061] In biology, if certain gene products participate in a certain biochemical reaction in the body, it means that these genes have the same biological function. Suppose there is substrate A in gene g 1 □g 10 Under the action of the product, it is finally converted into product D through three steps of biochemical reactions, as shown in the attached figure 2 shown. Based on the above conclusions, it can be considered that the figure 2 middle g 1 □g 4 The function is similar to that of g 5 □g 7 The function is similar to that of g 8 □g 10 functions are similar. if g 1 □g 10 Mapped to the MFO graph, calculate g according to formula (11) 1 □g 10 The semantic distance between each o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a computation method of gene annotation semantic similarity. The method establishes correlation between the gene and the gene body node through the gene body correlation file provided by the gene body association; then the semantic similarity of the gene body node is firstly computed; the gene annotation semantic similarity is computed finally according to the semantic similarity of the gene body node. The computation method has the advantages of computing the gene annotation semantic similarity automatically and in large quantity.

Description

technical field [0001] The invention relates to a method for calculating the semantic similarity of gene annotation, which belongs to the technical field of bioinformatics. Background technique [0002] Gene ontology (gene ontology, GO) is an important gene annotation database, and biologists often use online tools such as AmiGO and QuickGO to retrieve gene GO annotations. After obtaining the gene annotations, it is necessary to compare the similarity of the semantics of the gene annotations, that is, to examine whether the functions of some genes are similar, or whether some genes are jointly involved in the metabolic process of certain substances, etc. At present, comparing the similarity of genes is mainly done manually. Since biologists usually need to compare dozens or even hundreds of genes, it will be very time-consuming and labor-intensive to compare the similarity of so many genes manually. , and the manual comparison will also be affected by subjective factors. U...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F19/00G06F19/24
Inventor 吴飞珍马文丽王妹陈启龙郑文岭姚文娟施国明
Owner SHANGHAI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products