Entity recognition method and device based on Chinese medical records, equipment and storage medium

A technology of entity recognition and medical records, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problem of low recognition accuracy and achieve the effect of improving accuracy

Active Publication Date: 2019-06-11
PING AN TECH (SHENZHEN) CO LTD
View PDF4 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The technical problem to be solved by the present invention is to overcome the problem of low accuracy of Chinese named entity recognition based on deep learning in the prior art, and propose a method, device, equipment and storage medium for entity recognition based on Chinese medical records. The corresponding features of the text content in the Chinese cases are extracted and converted into feature vectors, and then the feature vectors are used as the input of the model to improve the accuracy of entity recognition

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Entity recognition method and device based on Chinese medical records, equipment and storage medium
  • Entity recognition method and device based on Chinese medical records, equipment and storage medium
  • Entity recognition method and device based on Chinese medical records, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] The present invention is further illustrated below by means of examples, but the present invention is not limited to the scope of the examples.

[0048] First, the present invention proposes an entity recognition method based on Chinese medical records.

[0049] In the first embodiment, if figure 1 As shown, the described entity recognition method based on Chinese medical records comprises the following steps:

[0050] Step 01: Use a word segmentation tool to segment the Chinese medical records.

[0051] Since it is aimed at Chinese medical records, the word segmentation tools also use Chinese word segmentation tools. The word segmentation tools mentioned here are all existing, and the common ones are jieba, SnowNLP, THULAC, NLPIR, etc., and will not be described in detail.

[0052] Separate individual words and words in a sentence through word segmentation, and also separate punctuation for subsequent entity identification.

[0053] The word segmentation tool was us...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an entity recognition method based on a Chinese medical record, and belongs to the field of natural language processing. The method comprises the following steps: carrying outword segmentation on Chinese medical records; outputting a first feature vector used for representing the position of each word in the word group; identifying the radicals of each character in the Chinese medical record, and comparing the identified radicals of each character with preset entity radicals one by one; outputting a second feature vector for representing a comparison result corresponding to each word; splicing the output first feature vector and second feature vector corresponding to each word behind the initial vector of each word to obtain a vector set for representing the Chinese medical record; and inputting the vector set into the trained model to extract entities therein. According to the method, corresponding features are extracted from the text content in the Chinese case and converted into feature vectors to serve as input of the model, so that the entity recognition accuracy of the model is improved.

Description

technical field [0001] The invention relates to the field of natural language processing, and relates to an entity recognition method, device, equipment and storage medium based on Chinese medical records. Background technique [0002] At present, there is a great demand for the application of named entity recognition in medical cases, such as querying, searching, sorting, etc., in order to achieve the purpose of building medical knowledge base, medical knowledge graph and promoting medical automatic question answering. [0003] The effect of existing Chinese named entity recognition based on deep learning is difficult to improve, and it was previously applied to other languages, such as English. Because of the limitations of deep learning models and the differences in language characteristics between languages, this limits the application of named entity tasks in Chinese. And because of the differences between the general field, other fields and the medical field, its appl...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27
Inventor 丁佳佳曹灵宇倪渊谢国彤
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products