Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Semi-supervised learning method for constructing medical knowledge graph from Chinese electronic medical records

A semi-supervised learning and electronic medical record technology, applied in the field of semi-supervised learning, can solve the problems of lack of high-quality, large-scale Chinese electronic medical record annotation corpus, inability to accurately identify medical knowledge of medical records, complex sentence features, etc., to avoid templates Design and feature engineering, error reduction, wide applicability effects

Inactive Publication Date: 2021-03-23
SOUTHWEAT UNIV OF SCI & TECH
View PDF6 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] 1. Due to the inclusion of patient sensitive information, and the labeling of medical entities and entity relationships relies on domain knowledge, there is currently a lack of high-quality, large-scale Chinese electronic medical record labeling corpus in China
[0005] 2. Chinese electronic medical records contain a large number of professional medical vocabulary and special characters, and there is no clear boundary between Chinese words and words, resulting in very complex sentence features that are difficult to extract. Existing technologies cannot accurately identify medical knowledge in medical records

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Semi-supervised learning method for constructing medical knowledge graph from Chinese electronic medical records

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] The present invention will be further described below in conjunction with the accompanying drawings.

[0038] Such as figure 1 As shown, the embodiment of the present invention provides a semi-supervised learning method for constructing a medical knowledge map from Chinese electronic medical records, which mainly includes three stages, which are data preprocessing stage, knowledge extraction stage and knowledge storage stage.

[0039] Among them, the main content of the data preprocessing stage is: obtain the original data set of Chinese electronic medical records, design a medical knowledge description system according to the Chinese electronic medical records in the original data set, and manually mark the relationship between medical entities, and initially construct a Chinese electronic medical record marking corpus.

[0040] The main content of the knowledge extraction stage is: according to the medical knowledge description system, combined with deep neural networ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the technical field of Chinese electronic medical record processing, and discloses a semi-supervised learning method for constructing a medical knowledge graph from Chinese electronic medical records. The method is an end-to-end method, and comprises the following steps: firstly, obtaining an original corpus of a Chinese electronic medical record, performing operations such as data preprocessing, knowledge description system design, manual entity relationship marking and the like, and sorting related medical terms to construct a Chinese medical word segmentation dictionary; completing the knowledge extraction work in combination with a machine learning algorithm and a deep neural network; expanding a Chinese electronic medical record mark data set step by step in combination with a semi-supervised learning method, storing all extracted knowledge triples in a Neo4j database, and constructing a medical knowledge graph. According to the method provided by the invention, the Chinese electronic medical record annotation corpus is provided, and the medical knowledge in the medical record can be accurately identified.

Description

technical field [0001] The invention relates to the technical field of Chinese electronic medical records processing, in particular to a semi-supervised learning method for constructing a medical knowledge graph from Chinese electronic medical records. Background technique [0002] Knowledge Graph is essentially a language network, its nodes represent entities, and connections represent various semantic relationships between entities, which can connect scattered knowledge to each other to form a A huge, networked knowledge system built with the "semantic network" as the skeleton. As more and more semantic WWW data is opened on the Internet, various Internet search engine companies at home and abroad have begun to build knowledge graphs based on this to improve service quality, such as Google Knowledge Graph (Google Knowledge Graph), Baidu "Zhixin" and so on. The construction of knowledge graphs in the medical field is a major research hotspot at present. Electronic medical...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G16H10/60G16H50/70G06F16/33G06F16/35G06F16/36G06K9/62G06N3/04G06N3/08
CPCG16H10/60G16H50/70G06F16/3344G06F16/35G06F16/367G06N3/08G06N3/047G06N3/048G06N3/045G06F18/2415G06F18/241
Inventor 杨春明郭鑫张晖李波赵旭剑
Owner SOUTHWEAT UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products