Gene and phenotype association knowledge base and establishment method and application thereof
A knowledge base and gene technology, applied in the field of bioinformatics, can solve problems that consume money, manpower and time, and have not been reported
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0053] This embodiment discloses a method for constructing a gene-phenotype association knowledge base, including the following steps:
[0054] S1: Get document entity;
[0055] S2: Determine and identify the document type;
[0056] S3: extract the gene entry and phenotype entry in the literature entity, and obtain the literature corpus;
[0057] S4: store the association relationship between genes and phenotypes, and obtain the knowledge base of associations between genes and phenotypes.
[0058] Specific steps are as follows:
[0059] S1: Get Document Entity
[0060] The PubMed database (https: / / www.ncbi.nlm.nih.gov / pubmed / ) was used to collect literature title and abstract information. Compared with the full-text literature, its information volume is smaller and the analysis efficiency is higher. As of July 2018, a total of 27,853,513 articles have been obtained.
[0061] S2: Determine the type of document
[0062] Before judging the document type, a filtering step is...
Embodiment 2
[0101] This embodiment discloses a gene-phenotype association knowledge base. The gene-phenotype association knowledge base includes a document acquisition unit, a document type judgment unit, an entry extraction unit and a storage unit.
[0102] The document acquisition unit is used to acquire document entities. The document type judging unit is used for judging and identifying the document type. The entry extraction unit is used to extract gene entries and phenotype entries in the document entity to obtain the document corpus. The storage unit is used to store the association relationship between genes and phenotypes, and obtain the knowledge base of associations between genes and phenotypes.
Embodiment 3
[0104] This embodiment discloses a method for quantifying the relationship between genes and phenotypes using the gene-phenotype association knowledge base constructed by the method described in Example 1, or using the gene-phenotype association knowledge base described in Example 2, Include the following steps:
[0105] (1) Extract the association information of target phenotype and target gene
[0106] (2) Calculate the amount of information of each phenotype and each gene separately
[0107] use the formula Calculate the amount of information P of phenotype y y . G y is the number of genes associated with phenotype y, G total is the total number of all gene sets. The parent phenotype of phenotype y is phenotype z, G z is the number of genes associated with phenotype z. Wherein, the parent phenotype refers to the phenotype including the upper level, and the data of the parent phenotype comes from the HPO database. For example, under the HP:0012647 (abnormal inflamm...
PUM

Abstract
Description
Claims
Application Information

- Generate Ideas
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com