Unlock instant, AI-driven research and patent intelligence for your innovation.

Medical ontology alignment method based on Point Set Registration

An ontology and medical technology, applied in the field of medical knowledge graphs, can solve the problems of low fusion accuracy of medical knowledge graphs, heterogeneity of medical ontology data, poor data matching accuracy, etc., and achieve simple and easy algorithms, rigorous interpretability, The effect of improving accuracy

Pending Publication Date: 2021-07-20
HARBIN INST OF TECH
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The purpose of the present invention is to solve the existing methods to solve the existence of medical ontology data heterogeneity, resulting in low accuracy of medical knowledge map fusion, poor data matching accuracy, and large data processing volume, and propose a PointSet Registration-based Medical Ontology Alignment Method

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Medical ontology alignment method based on Point Set Registration
  • Medical ontology alignment method based on Point Set Registration
  • Medical ontology alignment method based on Point Set Registration

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment approach 1

[0058] Specific implementation mode 1: In this implementation mode, the specific process of a medical ontology alignment method based on Point Set Registration is as follows:

[0059] Point SetRegistration is point cloud registration;

[0060] Step 1. Embedding each concept in the two sets of medical ontology datasets to obtain a vector representation of the concept;

[0061] Step 2. For the ontology alignment problem, a mixed Gaussian model is established based on Step 1;

[0062] Step 3, use the EM algorithm to solve the mixed Gaussian model obtained in step 2, and obtain the transformation relationship T between two sets of medical ontology datasets (medical ontology dataset FMA and medical ontology dataset NCI) θ (y m );

[0063] Step 4, the vectors of the two groups of medical ontology obtained in step 1 (using the TransE method obtained in step 1 to embed each concept in the ontology data set FMA and NCI, the vector representation of the concept obtained;) represents ...

specific Embodiment approach 2

[0068] Embodiment 2: The difference between this embodiment and Embodiment 1 is that in the first step, each concept in the two sets of medical ontology data sets is embedded to obtain a vector representation of the concept; the specific process is:

[0069] Using the TransE method, using the triplet relationship contained in the medical ontology dataset FMA as input, embedding each concept (such as: Monoblast) in the medical ontology dataset FMA, and obtaining the vector representation of the concept X N×D ;

[0070] Using the TransE method, using the triplet relationship contained in the medical ontology dataset NCI as input, embedding each concept (such as: Chondroblast) in the medical ontology dataset NCI, and obtaining the vector representation of the concept Y M×D ;

[0071] x N×D and Y M×D The expression is:

[0072] x N×D =(x 1 ,...,x n ,...,x N ) T ;

[0073] Y M×D =(y 1 ,...,y m ,...,y M ) T ;

[0074] where x 1 is the concept vector obtained by embe...

specific Embodiment approach 3

[0076] Embodiment 3: The difference between this embodiment and Embodiment 1 or 2 is that in Step 2, a mixed Gaussian model is established based on Step 1 for ontology alignment; the specific process is:

[0077] Will Y M×D y in 1 ,...,y M The vector is regarded as the centroid of the mixture Gaussian model, X N×D x in 1 ,...,x N Vectors are viewed as points generated by a mixture of Gaussian models;

[0078] Thus, the original ontology alignment problem is transformed into a problem of solving the parameters of the mixed Gaussian model;

[0079] Establish the probability density function of the mixed Gaussian model, the expression is as follows:

[0080]

[0081] In the formula, p(m) is the prior probability of the mth model, p(x n |m) is x given the mth Gaussian model n The conditional probability distribution of x n is the concept vector obtained by embedding the nth concept in the medical ontology dataset FMA, and M is the size of the ontology dataset NCI (y 1...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a medical ontology alignment method based on Point Set Registration, and relates to a medical ontology alignment method. The objective of the invention is to solve the problems of low accuracy, poor data matching precision and large data processing amount of existing medical knowledge graph fusion. The method comprises the following steps: 1, obtaining vector representation of a concept; 2, establishing a Gaussian mixture model; 3, obtaining a transformation relation; 4, mapping the vector representation into the same vector space through a transformation relation; 5, in the vector space, for a certain concept in one group of medical ontologies, if an embedded vector of a concept in the other group of medical ontologies exists in a given threshold radius of an embedded vector corresponding to the concept, determining that the two groups of medical ontology objects have an alignment relationship; 6, determining whether new alignment occurs or not, if so, generating a new triple positive example by using a new alignment relationship, and executing the step 1, or if not, outputting a result. The method is applied to the field of medical knowledge maps.

Description

technical field [0001] The invention relates to a medical ontology alignment method based on Point Set Registration, which belongs to the field of medical knowledge graphs. Background technique [0002] Ontology uses concepts and the relationship between concepts to represent domain knowledge, and provides support for applications such as semantic annotation, knowledge discovery and sharing, data integration and decision-making. Due to the various ways and angles of medical ontology construction, this leads to heterogeneity among different medical ontologies, that is, the same concept usually has different contexts and not identical meanings in different medical ontologies. [0003] In order to integrate such a large medical ontology, medical ontology automatic matching tools become an inevitable solution. Medical ontology alignment is an important technology to solve the heterogeneity of medical ontology data, and it is of great significance in the fusion of medical knowle...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/36G06K9/62
CPCG06F16/367G06F18/2321G06F18/214
Inventor 刘扬段晨婕卓兴良刘晓燕郭茂祖
Owner HARBIN INST OF TECH