Traceable medical record classification method

A classification method and medical record technology, applied in the field of medical information processing, can solve the problems of inability to go deep, deep learning to trace the Attention mechanism, inexplicable traceability, and low classification accuracy.

Active Publication Date: 2021-04-23
SHAN DONG MSUN HEALTH TECH GRP CO LTD
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

These methods have many obvious shortcomings: 1. When the text features of medical records are extracted by statistical language processing methods, the sequence structure of the original text is destroyed. Although the process of classification is more explanatory, the characteristics of medical records and the text information of medical records are only There is a statistical correlation, and there is no one-to-one correlation, so it cannot be traced back
2. Since the text features of medical record texts extracted based on statistical language processing methods are all literal statistics, it is impossible to go deep into the semantic information of each word, so the classification accuracy i

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] The present invention will be further described below.

[0027] A traceable method of classifying medical records, including:

[0028] a) Obtain medical record data and express it as a collection {(D 1 , L 1 ),(D 2 , L 2 ),...,(D n , L n )}, there are n data in the collection, D 1 is the medical record text in the first data, L 1 is the category label corresponding to the medical record text in the first data, and the number of categories in all labels is m.

[0029] b) Randomly initialize the trainable label embedding matrix, denoted as K, the size of which is m rows and h columns. Each row of data corresponds to a fixed medical record category.

[0030] c) Using a pre-trained language model, input a medical record text, and represent the output data as a matrix U, whose size is l rows and h columns, where l represents the length of the text of the input medical record, and each row of data is the same as the medical record Each text in is one-to-one correspo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a traceable medical record classification method, and the method comprises the steps: carrying out the semantic integration on the context information of all the characters in the medical record by using the language model, and finally suppressing the influence of meaningless characters on the classification result through a gating mechanism, so the signal-to-noise ratio of intermediate data of the classification model is improved, and then the classification accuracy of medical record texts is improved. Meanwhile, meaningful characters for the classification result can be traced back through a gating value. Finally, while high-accuracy medical record classification is achieved, the gating values of the characters are output, and tracing of the character basis of medical record classification is achieved through the gating values of the characters.

Description

technical field [0001] The invention relates to the technical field of medical information processing, in particular to a traceable medical record classification method. Background technique [0002] The classification of medical records has a wide range of applications in the fields of current medical and health statistics, disease coding quality control, DRGs and medical insurance audits. But nowadays, the classification of medical records cannot be 100% accurate, and the medical industry is due to its rigor and other characteristics. The human-machine collaborative experience in product development is particularly important. There are two main technical solutions for traditional medical record classification methods: [0003] Methods based on statistical language processing: such as TF-IDF, BM25, N-GRAM, hidden semantic analysis, topic models, etc. The common feature of these methods is to extract text feature information from medical record texts from a statistical poi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/35G06F40/216G06K9/62G16H10/60
CPCG06F16/355G06F40/216G16H10/60G06F18/2415G06F18/214
Inventor 张伯政吴军樊昭磊何彬彬桑波
Owner SHAN DONG MSUN HEALTH TECH GRP CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products