A multi-label classification method for electronic medical records based on symptom extraction and feature representation

A technology of electronic medical records and classification methods, applied in the field of medical big data analysis, can solve the problems of electronic medical records lack of relevant information, unusable, full-text data affecting the classification effect, etc.

Active Publication Date: 2019-03-12
湖南科创信息技术股份有限公司
View PDF12 Cites 22 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The multi-label classification of electronic medical records relies on the features extracted from the medical record text. Currently, there are methods based on the entire text information, but there is a large amount of redundant information in the full text data that affects...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A multi-label classification method for electronic medical records based on symptom extraction and feature representation
  • A multi-label classification method for electronic medical records based on symptom extraction and feature representation
  • A multi-label classification method for electronic medical records based on symptom extraction and feature representation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0068] The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0069] The invention discloses a multi-label classification scheme of electronic medical records based on symptom extraction and its characterization model and using bidirectional circulation. Not only the relationship between symptoms and diseases is very important for the multi-label classification of electronic medical records, but also the relationship between symptoms also affects the multi-label classification of electronic medical records. Based on this, the present invention takes into account the relationship between symptoms and diseases The TF-IDF symptom representation scheme of the correlation between symptoms and the Word2Vec symptom representation scheme considering the correlation between symptoms. MetaMap was used to extract the symptom entities in the electronic medical records. A Bidirectional Long Short-Term Memor...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a multi-label classification method of electronic medical record based on symptom extraction and feature representation. Considering the influence of disease and symptom and thecorrelation between symptoms on the multi-classification problem of disease label of electronic medical record, two different symptom representation methods are adopted: using TF-IDF builds symptom vectors and learns symptom vectors using word2vec. The two symptom vector sequences extracted from the electronic medical record are respectively used as input sequences of the two two-way LSTM models,and the two-way LSTM models are trained. For the electronic medical records of unknown disease tags, two symptom vectors corresponding to the symptoms are extracted to form two symptom vector sequences, and two trained bi-directional LSTM models are input respectively to obtain two probability vectors. The weighted combination of the two probability vectors is used to obtain the final classification vector. This method has good classification effect and applicability.

Description

technical field [0001] The invention belongs to the field of medical big data analysis, in particular to a multi-label classification method for electronic medical records based on symptom extraction and feature representation. Background technique [0002] The multi-label classification of electronic medical records (EMR) is an important task in the field of medical applications. Its purpose is to automatically generate disease labels for electronic medical records based on information such as symptoms, test indicators, drugs, and text in electronic medical records. , not only can save the cost of large-scale electronic medical record management and maintenance, but also provide convenience for medical knowledge mining and application. The multi-label classification based on electronic medical records can also be used in auxiliary diagnosis systems and hospital guidance systems, which can greatly improve doctors' work efficiency and shorten patients' visit time. The multi-...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/35G16H10/60G06N3/04
CPCG06N3/049G16H10/60
Inventor 李敏郭东霖卢长利
Owner 湖南科创信息技术股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products