Multi-language medical term specification standardization system and method based on deep adversarial learning

A multilingual, medical terminology technology

Active Publication Date: 2021-09-10
杭州莱迈医疗信息科技有限公司
View PDF7 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] 1. Due to the short history of Chinese medical informatization construction, the knowledge base and knowledge map resources containing Chinese medical standard terms, synonyms and other important semantic relationships, such as the relationship between diseases and drugs, are limited
[0008] 2. The term standardization method using synonyms as the main matching method is not enough to cover a large number of diverse term irregular express

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-language medical term specification standardization system and method based on deep adversarial learning
  • Multi-language medical term specification standardization system and method based on deep adversarial learning
  • Multi-language medical term specification standardization system and method based on deep adversarial learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0050] The present invention will be further explained below in conjunction with the accompanying drawings and specific embodiments.

[0051] Such as figure 1 As shown, this embodiment provides a standardization system for multilingual medical terminology based on deep adversarial learning. It includes a standard medical term base, a file preprocessing module, a candidate term set generation module, a candidate term set rearrangement module, and an output module.

[0052] The file preprocessing module is used to unify the format of medical terminology strings in the real world, and divide long strings into individual Chinese and English word tokens. The file preprocessing module specifically includes medical terminology labeling corpus, encoding, character unification module and first word segmentation module; medical terminology labeling corpus: used to train the automatic term matching model based on deep learning, and combine the artificially constructed real-world medical...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a multi-language medical term specification standardization system based on deep adversarial learning. The system comprises a standard medical term library, a file preprocessing module, a candidate term set generation module and a candidate term set rearrangement module. The file preprocessing module is used for unifying formats of medical term character strings in the real world and dividing long character strings into single Chinese and English tokens; the candidate term set generation module is used for screening first n candidate standard terms most similar to real-world medical terms from a standard medical term set; and the candidate term set rearrangement module is used for forming term pairs by using real-world medical terms and screened candidate terms, marking the term pairs into positive examples and negative examples according to the matching degree, taking the positive examples and the negative examples as input of a deep learning model, and generating an automatic term matching model through iterative optimization of the model.

Description

technical field [0001] The invention relates to a standardization system and method for multilingual medical terminology based on deep adversarial learning, and belongs to the field of medical technology. Background technique [0002] In the field of domestic medicine, especially in the writing of electronic medical records, there are many kinds of terms and the situation of non-standardization is serious. The previous scheme is to extract similar standard terms according to Chinese character strings after information extraction from electronic medical records, and then experts in the medical field Full manual or semi-automatic proofreading of these terms is performed. The standardization of medical terminology is time-consuming and laborious, and the efficiency is not high. [0003] After retrieval, Publication No. CN109446340A, a medical standard terminology ontology management system and method, equipment and storage medium, the patent discloses the term concept system, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/31G06F16/36G06F40/247G06F40/289G06F40/30G06K9/62G06N3/04G16H10/60
CPCG06F40/289G06F40/30G06F16/367G06F16/316G06F40/247G16H10/60G06N3/045G06F18/214
Inventor 任元凯江振荣
Owner 杭州莱迈医疗信息科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products