Method for disambiguating entities in medical disease diagnosis record

A disease diagnosis and entity technology, applied in the field of disambiguation of disease entities and surgical entities based on medical disease diagnosis records, can solve problems such as incomplete diagnosis information of disease names, high cost, and difficulty in feature extraction

Active Publication Date: 2017-07-14
PEKING UNIV
View PDF2 Cites 24 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The supervised learning method can use the labeling information and the characteristics of the training data to mine the regularity between the entity and the candidate entity. The disadvantage is that it needs to manually label the data, and the cost is high; Need to label data, can use the semantic information of the entity context, but feature extraction is more difficult
[0004] At present, the researc

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for disambiguating entities in medical disease diagnosis record
  • Method for disambiguating entities in medical disease diagnosis record
  • Method for disambiguating entities in medical disease diagnosis record

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0089] Below in conjunction with accompanying drawing, further describe the present invention through embodiment, but do not limit the scope of the present invention in any way.

[0090] The invention provides a method for disambiguation of named entities in disease diagnosis records based on a heterogeneous associated disease network and a graphical model. By establishing a heterogeneous associated disease network, the accompanying relationship between disease entities and the relationship between disease entities and surgical entities can be obtained. Associative relationship, use multi-layer filtering mechanism to generate candidate disease entities and surgical entities for disambiguated disease entities and surgical entities, build graph models for candidate disease entities and candidate surgical entities, and use personalized webpage ranking on heterogeneous networks (Heterogeneous The Personalized Page Rank (He-PPR) algorithm sorts candidate disease entities and candida...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for disambiguating entity names in a medical disease diagnosis record. Based on a heterogeneous concomitant disease network and a graph model, the entity names in the medical disease diagnosis record are disambiguated. The similarity between to-be-disambiguated entity names and candidate entity names is used as local information, and the contribution of other to-be-disambiguated entities in the same record to current to-be-disambiguated entities serves as global information, so that the accuracy of medical entity name disambiguation can be improved; the heterogeneous concomitant disease network is established according to the disease diagnosis record and annotation data, so that the relationships between the diseases and between the disease and the operation can be reflected more intuitively and credibly; and the entity names are subjected to standard name mapping accurately and efficiently, so that the problem of ambiguity of medical disease entity names in diagnosis information is solved, and the practical application demands are met.

Description

technical field [0001] The invention relates to the fields of natural language text information processing and medical big data mining, in particular to a method for disambiguating disease entities and operation entities based on medical disease diagnosis records. Background technique [0002] Medical disease diagnosis records include the name of the main disease diagnosed by the patient, the name of the secondary diagnosed disease (ie, the name of the accompanying disease), and the operation for the diagnosis of the disease. For the same disease name, due to the variety of diseases and differences in doctor experience, there are often many different expressions for the same disease name, which brings great challenges to the standardization of medical electronic medical record data. [0003] The task of named entity disambiguation is to establish a mapping relationship with the corresponding entity in the knowledge base for a given entity reference in the text (reference ref...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F19/00G06F17/30
CPCG06F16/288G06F19/324
Inventor 宋国杰刘徽李鹏宇
Owner PEKING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products