Chinese electronic medical record named entity recognition method

A technology of named entity recognition and electronic medical records, which is applied in the fields of electrical digital data processing, special data processing applications, instruments, etc., can solve problems such as inability to handle named entity boundary extraction well, powerlessness, difficulty in learning feature information, etc.

Inactive Publication Date: 2019-06-11
SOUTH CHINA UNIV OF TECH
View PDF3 Cites 58 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In model construction, most methods use LSTM to process input vectors and selectively retain historical information to deal with long-term dependency problems. However, with the growth of sentences and the movement of time steps, it gradually becomes incapable, and it is difficult to learn feature information farther away. , cannot handle the boundary extraction problem of named entities well

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese electronic medical record named entity recognition method
  • Chinese electronic medical record named entity recognition method
  • Chinese electronic medical record named entity recognition method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0057] The present invention will be further described in detail below in conjunction with the embodiments and the accompanying drawings, but the embodiments of the present invention are not limited thereto.

[0058] Such as Figure 1 to Figure 4 As shown, the Chinese electronic medical record named entity recognition method provided in this embodiment mainly combines the part-of-speech feature and the self-attention mechanism. In the data preprocessing stage, by reducing the method of entity part-of-speech tagging, the word vector and part-of-speech tagging of common vocabulary, and the character vector and alternative part-of-speech tagging of named entity vocabulary are obtained. The text vector and the corresponding part-of-speech tagging vector are concatenated and input into the model that integrates the part-of-speech and self-attention mechanism, and the weight vector of the input vector relative to the entire sentence is calculated at each moment through the self-atte...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Chinese electronic medical record named entity identification method. The method comprises the following steps: 1) constructing a common vocabulary dictionary; 2) simple part-of-speech tagging; 3) constructing a text and part-of-speech vector mapping table; 4) training a prediction model of the named entity; and 5) predicting the label of the named entity. According to the method, the part-of-speech characteristics are added to improve the boundary distinguishability of the named entity and the common vocabularies, so that the boundary accuracy of the named entity isimproved. At the same time, a self-attention mechanism is introduced into the bidirectional LSTM-CRF model, and the relevancy between the input at each moment and other components in the sentence is calculated, so that the long dependency problem is relieved, and the named entity recognition accuracy is improved.

Description

technical field [0001] The invention relates to the technical field of named entity recognition of Chinese electronic medical records, in particular to a method for recognizing named entities of Chinese electronic medical records. Background technique [0002] Named Entity Recognition (NER) in electronic medical records is to find out some clinical entities related to patients from the descriptive text of electronic medical records, such as the patient's diseased part, symptoms, used drugs and operations, etc. . Named entity recognition of Chinese electronic medical records is the key to information extraction of Chinese electronic medical records, which can lay the foundation for Chinese health information processing such as medical record retrieval, disease prediction, and construction of medical knowledge graphs. However, there are many unregistered words in electronic medical records, and the number continues to increase. Moreover, compared with English, the recognition...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27
Inventor 董守斌蔡晓玲胡金龙袁华董守玲
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products