Medical term standardization method and device, computer equipment and storage medium

A medical terminology, medical technology, applied in the field of data processing, can solve problems such as low accuracy of standard accuracy

Active Publication Date: 2020-08-25
深圳平安医疗健康科技服务有限公司
View PDF9 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the embodiment of the present application is to propose a standardization method for medical terminology to solve the problem of low standard accuracy of medical terminology in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Medical term standardization method and device, computer equipment and storage medium
  • Medical term standardization method and device, computer equipment and storage medium
  • Medical term standardization method and device, computer equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment approach

[0100] see Figure 4 , Figure 4 Another specific implementation of step S3 is shown, and the specific implementation process is described in detail as follows:

[0101] S34: Using the N-gram model, performing part-of-speech tagging on the word-segmentation unit, and assigning a label to the word-segmentation unit to obtain the part-of-speech unit.

[0102] Specifically, each word segmentation unit has its own part-of-speech and label. The N-gram model is used to tag the part-of-speech unit, and according to the label classification of the preset medical terminology database, the word segmentation unit is tagged one by one. Get the part of speech unit.

[0103] Among them, the N-gram model is a language model commonly used in large-vocabulary continuous speech recognition. For Chinese, it is called the Chinese Language Model (CLM, Chinese Language Model), which can realize the part-of-speech tagging of vocabulary.

[0104] In a specific embodiment, after the word segmentati...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a medical term standardization method and device, computer equipment and a storage medium, and the method comprises the steps: obtaining medical text data, and carrying out thedata cleaning of the medical text data to obtain an initial text; performing word segmentation processing on the initial text by adopting a word segmentation engine to obtain word segmentation unitscorresponding to the initial text; identifying medical feature words in the word segmentation unit through a deep learning entity identification mode of medical knowledge to obtain target segmented words; conducting reverse indexing on the target segmented words, confirming the medical term text containing the target segmented words and the frequency of occurrence of the target segmented words inthe medical term text, and obtaining the medical term text and serving as candidate text; and selecting the candidate text with the maximum similarity value as a standard medical term text. Accordingto the method and the device, the accuracy of medical term standardization is effectively improved, so that the availability of the data of the medical term text is improved.

Description

technical field [0001] The present application relates to the technical field of data processing, and in particular to a standardization method, device, computer equipment and storage medium of medical terminology. Background technique [0002] Medical terms are professional terms in the medical field, used to refer to various things, phenomena, characteristics, relationships and processes in the medical field (such as diseases, drugs, surgical operations, inspections, etc.). These terms are necessary components of clinical information systems to express medical information. [0003] When medical text data has not undergone data standardization processing, medical text data contains many non-standard data, such as non-standard medical aliases, synonyms, etc., and cannot achieve a unified standard, so it is difficult for medical terminology data to be followed up. Medical applications cause data waste. The standardization of data refers to the unified correspondence of non-...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/30G06F40/295G06F16/31
CPCG06F40/295G06F40/30G06F16/319
Inventor 施维郭建福张旭
Owner 深圳平安医疗健康科技服务有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products