Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Multi-label classification method for electronic medical records based on symptom extraction and feature representation

A technology of electronic medical records and classification methods, applied in the field of medical big data analysis, can solve problems such as unusability, lack of relevant information in electronic medical records, and full text data affecting the classification effect, so as to achieve reliable classification and avoid the influence of redundant information on classification. , the effect of high accuracy

Active Publication Date: 2021-11-02
湖南科创信息技术股份有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The multi-label classification of electronic medical records relies on the features extracted from the medical record text. Currently, there are methods based on the entire text information, but there is a large amount of redundant information in the full text data that affects the classification effect; there are also inspection and detection indicators based on records in the text, Index information such as clinical data, medical codes, and drugs, but due to the lack of relevant information in some electronic medical records, these methods cannot be used

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-label classification method for electronic medical records based on symptom extraction and feature representation
  • Multi-label classification method for electronic medical records based on symptom extraction and feature representation
  • Multi-label classification method for electronic medical records based on symptom extraction and feature representation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0068] The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0069] The invention discloses a multi-label classification scheme of electronic medical records based on symptom extraction and its characterization model and using bidirectional circulation. Not only the relationship between symptoms and diseases is very important for the multi-label classification of electronic medical records, but also the relationship between symptoms also affects the multi-label classification of electronic medical records. Based on this, the present invention takes into account the relationship between symptoms and diseases The TF-IDF symptom representation scheme of the correlation between symptoms and the Word2Vec symptom representation scheme considering the correlation between symptoms. MetaMap was used to extract the symptom entities in the electronic medical records. A Bidirectional Long Short-Term Memor...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention provides a multi-label classification method for electronic medical records based on symptom extraction and feature representation. Considering the impact of diseases and symptoms and the relationship between symptoms on the multi-classification of disease labels in electronic medical records, two different symptom representations are used. Method: Use TF‑IDF to construct symptom vectors and word2vec to learn symptom vectors. Two kinds of symptom vector sequences extracted from electronic medical records are used as input sequences of the two bidirectional LSTM models respectively, and two bidirectional LSTM models are trained; for electronic medical records with unknown disease labels, the symptoms extracted from them correspond to two The symptom vector constitutes two kinds of symptom vector sequences, which are respectively input into two trained bidirectional LSTM models to obtain two probability vectors; the two probability vectors are weighted and combined to obtain the final classification vector. This method has good classification effect and applicability.

Description

technical field [0001] The invention belongs to the field of medical big data analysis, in particular to a multi-label classification method for electronic medical records based on symptom extraction and feature representation. Background technique [0002] The multi-label classification of electronic medical records (EMR) is an important task in the field of medical applications. Its purpose is to automatically generate disease labels for electronic medical records based on information such as symptoms, test indicators, drugs, and text in electronic medical records. , not only can save the cost of large-scale electronic medical record management and maintenance, but also provide convenience for medical knowledge mining and application. The multi-label classification based on electronic medical records can also be used in auxiliary diagnosis systems and hospital guidance systems, which can greatly improve doctors' work efficiency and shorten patients' visit time. The multi-...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/35G16H10/60G06N3/04
CPCG06N3/049G16H10/60
Inventor 李敏郭东霖卢长利
Owner 湖南科创信息技术股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products