Medical text relation extraction method based on pre-training model and fine tuning technology

A technology of relation extraction and pre-training, applied in patient care, neural learning methods, healthcare informatics, etc., to improve performance

Active Publication Date: 2019-08-16
WUYI UNIV
View PDF5 Cites 29 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, such methods generally require a larg

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Medical text relation extraction method based on pre-training model and fine tuning technology
  • Medical text relation extraction method based on pre-training model and fine tuning technology

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] The specific embodiment of the present invention will be further described below in conjunction with accompanying drawing:

[0035] Such as figure 1As shown, this embodiment provides a method for extracting medical text relations based on pre-training models and fine-tuning technology. The present invention uses deep neural networks to extract medical documents such as Chinese and English medical documents and medical records marked with the relationship between drug entities and disease entities. Train the model in the text corpus, and use the trained model to extract the relationship between the Chinese and English medical texts that are not marked with the above-mentioned entity relationship, and extract the relationship between the relevant drug and the disease, including the following steps:

[0036] S1), preprocessing the medical relationship extraction corpus, which is mainly to perform word segmentation and stem extraction on the input medical text, and manually...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a medical text relation extraction method based on a pre-training model and a fine tuning technology. The method comprises the steps of preprocessing of medical relation extraction corpora, model pre-training and fine tuning. According to the method, the pre-training model is used as the input of the one-dimensional convolutional neural network model, but the word embedding is used as the input of the one-dimensional convolutional neural network model in the prior art, and the pre-training model is more favorable for improving the extraction performance of the medicaltext relationship compared with the word embedding; according to the method, the one-dimensional convolutional neural network model and the pre-training model are combined for use, and the one-dimensional convolutional neural network is used for finely adjusting the pre-training model, so that the performance of the model is improved; the training error of the one-dimensional convolutional neuralnetwork is propagated back to a pre-training model to realize a model fine tuning process which is a dynamic model training process; in a traditional method, word embedding is combined with input of different layers, a main task model is still trained from the beginning, pre-trained embedding is regarded as a fixed parameter, and the usability of the method is limited.

Description

technical field [0001] The present invention relates to the technical field of natural language data analysis and processing, in particular to a method for extracting text relationships in the medical field, and more specifically to a method for extracting medical text relationships based on pre-training models and fine-tuning techniques. Background technique [0002] There are two Chinese explanations about medical treatment: 1. Healing, 2. Treatment of diseases. The history of Chinese medicine has been around for thousands of years, but this word has only appeared in recent decades. It is a new word to be in line with international standards. Before that, treatment was mostly used, and medical care also included health care content. [0003] Medical data is of great value, especially in relation to medical texts. Medical data mining has risen to a national strategy, and it is also a research hotspot in the competition between academia and industry around the world. How ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/332G06F16/36G06F17/27G06N3/04G06N3/08G16H10/00
CPCG06F16/3329G06F16/36G06N3/084G16H10/00G06F40/284G06N3/044G06N3/045Y02A90/10
Inventor 陈涛吴明芬杨开漠
Owner WUYI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products