Entity recognition system for breast electronic medical records based on multi-criteria active learning

A technology of entity recognition and active learning, which is applied in the field of medical natural language processing, can solve problems such as time-consuming, difficult clinical medical data, and manpower consumption, so as to improve representativeness and universality, improve execution efficiency, and reduce misdiagnosis and missed diagnosis rate effect

Active Publication Date: 2021-12-07
DONGHUA UNIV +1
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Since electronic medical records belong to text data in a specific professional field, its corpus annotation not only takes a lot of time, but also requires manpower with strong medical expertise, and it is difficult to obtain a large amount of annotated clinical medical data.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Entity recognition system for breast electronic medical records based on multi-criteria active learning
  • Entity recognition system for breast electronic medical records based on multi-criteria active learning
  • Entity recognition system for breast electronic medical records based on multi-criteria active learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] Below in conjunction with specific embodiment, further illustrate the present invention. It should be understood that these examples are only used to illustrate the present invention and are not intended to limit the scope of the present invention. In addition, it should be understood that after reading the teachings of the present invention, those skilled in the art can make various changes or modifications to the present invention, and these equivalent forms also fall within the scope defined by the appended claims of the present application.

[0038] The embodiment of the present invention relates to a system that uses an active learning algorithm to sample training data, and then uses a deep learning algorithm to extract clinical medical entities from breast electronic medical records, including: 1) a data preprocessing module for breast clinical electronic medical records: The medical record data is analyzed from the content, structural features, language features ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a breast electronic medical record entity recognition system based on multi-standard active learning, which is characterized in that it comprises: a preprocessing module; an entity recognition module; and an active learning module. The present invention considers three aspects of labeling data volume, sentence labeling cost, and data sampling balance, and designs an active learning selection strategy for text sequence labeling to reduce the total labeling workload. On the one hand, the present invention can be used to build systems such as identification marks for breast disease risk patients, disease drug recommendation, decision-making assistance, etc., to help doctors improve the implementation efficiency of standardized diagnosis and treatment of breast diseases, and to provide scientific basis and suggestions; on the other hand, it can also assist Doctors discover potential abnormalities in the diagnosis and treatment process, reduce misdiagnosis and missed diagnosis rates, and improve the cure rate of patients with breast diseases, which is of great value to the intelligent development of breast disease research.

Description

technical field [0001] The invention relates to the field of medical natural language processing, in particular to a breast electronic medical record entity recognition system based on multi-standard active learning. Background technique [0002] With the popularization and development of hospital information technology, a comprehensive information system with electronic medical record system as the core and effective integration of multiple clinical information systems has been gradually formed. During the decades of use of the electronic medical record system, a large amount of medical text data has been accumulated, and many institutions and teams have emerged to conduct a lot of research on the medical text structure. [0003] Electronic medical records are important clinical information resources closely related to medicine and health generated during medical activities. They not only contain rich medical professional knowledge, but also reflect detailed health informat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/295G16H50/70G06N3/04G06N3/08
CPCG16H50/70G06N3/049G06N3/08G06N3/045
Inventor 潘乔张敬谊陈德华王梅金妍红王晔
Owner DONGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products