Entity identification method based on Chinese electronic medical records

A technology for electronic medical records and entity recognition, which is applied in electronic digital data processing, medical data mining, and special data processing applications, etc. It can solve problems such as small corpus size and entity type definitions that cannot cover medical entities.

Inactive Publication Date: 2018-10-09
上海熙业信息科技有限公司
View PDF3 Cites 41 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This study is the first attempt to study named entity recognition in Chinese electronic medical records, but the d

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Entity identification method based on Chinese electronic medical records
  • Entity identification method based on Chinese electronic medical records
  • Entity identification method based on Chinese electronic medical records

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention more clear, the specific implementation manners of the present invention will be clearly and completely described below in conjunction with the accompanying drawings. It should be understood that the specific implementation methods described here are only used to illustrate and explain the present invention, and are not intended to limit the present invention.

[0025] Features and exemplary cases of various aspects of the invention are described in detail below. In the following detailed description, numerous specific details are set forth in order to provide a thorough understanding of the present invention. It will be apparent, however, to one skilled in the art that the present invention may be practiced without some of these specific details. The following description of the embodiments is only to provide a better understanding of the present invention by s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an entity identification method based on Chinese electronic medical records, and relates to the technical field of medical entity identification. In order to overcome the defects of the lack of a public Chinese electronic medical record annotation corpus in China currently, by constructing and managing a medical dictionary, a semi-automatic corpus annotation method is put forward, and the complexity of manual annotation is reduced. Secondly, the problems are solved that existing electronic medical record entity recognition methods based on characteristics mostly aim at ordinary texts or general electronic medical record texts, and unique characteristics of the Chinese electronic medical records are not considered. By means of the method, besides basic characteristicsof the general text, the unique chapter information characteristics of the Chinese electronic medical records are also extracted; core word characteristics obtained by counting character frequenciesand word frequencies are added into extension characteristics after the collected dictionary is subjected to single-character and word segmentation, a relationship of words is also added to the extension characteristics by clustering word vectors, and the accuracy of the entity identification of the Chinese electronic medical records is effectively improved.

Description

technical field [0001] The invention relates to the field of natural language processing, in particular to named entity recognition of electronic medical records. Background technique [0002] The earliest electronic medical record information extraction usually adopts the method of combining dictionaries and rules. With the construction of electronic medical record annotation corpus, research on electronic medical record information extraction based on machine methods is gradually carried out. I2B2, the National Research Center for Integrated Biology and Clinical Informatics in the United States, introduced the information extraction task of English electronic medical records in 2010. The evaluation includes three sub-tasks, namely the identification of entities such as medical problems, examinations, and treatments, entity modification identification, and entity relationships. Extracted and provided 349 manually marked electronic medical records and 827 unlabeled electron...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27G16H10/60G16H50/70
CPCG16H10/60G16H50/70G06F40/284G06F40/295
Inventor 闫凤麒张贝贝陆明名
Owner 上海熙业信息科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products