Check patentability & draft patents in minutes with Patsnap Eureka AI!

Electronic medical record text classification method

A technology for electronic medical records and text classification, applied in text database clustering/classification, neural learning methods, unstructured text data retrieval, etc., can solve the problems of slow model convergence, dense text terms, missing sentence components, and poor classification effect To achieve the effect of improving the effect, alleviating the gradient problem, good stability and robustness

Pending Publication Date: 2022-02-15
XUZHOU MEDICAL UNIV
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At present, the text classification of electronic medical records using the neural network combination model is limited by the high-dimensional and sparse text features of electronic medical records, dense text terms, missing sentence components, etc., which will cause slow model convergence and poor classification results.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Electronic medical record text classification method
  • Electronic medical record text classification method
  • Electronic medical record text classification method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0041] First, collect and construct the original electronic medical record text data set. The experimental data set comes from the real electronic medical record text of the Affiliated Hospital of Xuzhou Medical University. 1000 medical record description sentences in disease and diagnosis, symptoms and signs and treatment, including 500 diabetes data and 500 Parkinson's disease data.

[0042] For the original electronic medical record data set, the Jieba word segmentation module is used to segment the text sequence in a precise mode. After the word segmentation task is completed, the word segmentation results are traversed in combination with the stop word list, and the stop words are removed to form the original corpus.

[0043] Convert the original corpus into a vocabulary T1, including word numbers and words, use the word2vec word vector tool to train vocabulary T1, the default skip-gram model, and express the word training as a low-dimensional dense word vector, forming vo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an electronic medical record text classification method. The method comprises the following steps: performing preprocessing operation on an original electronic medical record text data set to form an original corpus, converting the original corpus into a word list T1, training the word list T1 by utilizing a word vector tool, expressing word training as low-dimensional dense word vectors, and forming a word list T2; and then, correspondingly converting each piece of data of the text data set into a word vector sequence as input in a word number form, using a two-channel structure of a CNN-Attention neural network and a two-channel structure of a BiLSTM-Attention neural network for training a text feature vector, then splicing output of the two-channel structure to serve as overall output of the neural network to obtain the overall output of the neural network, and finally, calculating the probability of the tag category to which the text belongs by using a softmax classifier. According to the method, local and global text features of the electronic medical record text data set can be planned as a whole, good stability and robustness are achieved, and the effect of an electronic medical record text classification model is effectively improved.

Description

technical field [0001] The invention relates to a classification method, in particular to an electronic medical record text classification method, which belongs to the technical field of application of natural language processing to medical electronic cases. Background technique [0002] Text classification refers to establishing a relationship model between text and categories. As one of the basic tasks of natural language processing, it is of great significance in sentiment analysis, social platform public opinion monitoring, and spam identification. The main algorithm models of text classification can be basically divided into three categories: the first category is based on rules, the second category is based on statistics and machine learning, and the third category is based on deep learning methods. [0003] The first type of rule-based method relies on the help of professionals to formulate a large number of judgment rules for predefined categories, and the degree of ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G16H10/60G06F16/35G06F40/30G06K9/62G06N3/04G06N3/08
CPCG16H10/60G06F16/35G06F40/30G06N3/049G06N3/084G06N3/048G06N3/044G06N3/045G06F18/2411G06F18/2415G06F18/214
Inventor 李超凡马凯
Owner XUZHOU MEDICAL UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More