Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Graph convolutional network biomedical information extraction method based on multi-head attention mechanism

A biomedical and convolutional network technology, applied in the field of biomedical information extraction, can solve problems such as performance degradation, difficulty in meeting the requirements of extraction tasks, and difficulty in detecting cross-sentence entities, so as to improve performance and reduce the impact of noisy data Effect

Pending Publication Date: 2021-12-31
XI AN JIAOTONG UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Existing methods rely on traditional neural networks to extract relationships between long and difficult sentences, mainly using convolutional neural networks or recurrent neural networks. However, traditional neural networks have always had bottlenecks in processing overly long sentences. As the length of sentences continues to increase, the performance continues to decline. , it is difficult to meet the requirements of relation extraction tasks in pathological scenarios
[0003] Existing technologies are mainly based on recurrent neural networks and convolutional neural networks for medical text information extraction. However, pathology reports include long texts and complex descriptions, so entities across sentences are difficult to detect, which will lead to key information being easily extracted. was left out
Compared with traditional machine learning and neural network methods, graph neural networks can capture semantic and grammatical information in non-adjacent sentences by relying on the dependency structure between sentences. However, it is still difficult to distinguish the correlation of text features using existing methods.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Graph convolutional network biomedical information extraction method based on multi-head attention mechanism
  • Graph convolutional network biomedical information extraction method based on multi-head attention mechanism
  • Graph convolutional network biomedical information extraction method based on multi-head attention mechanism

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0088] A biomedical information extraction method using a graph convolutional network based on a multi-head attention mechanism, which is applied to the following scenarios:

[0089] Dataset: Two public biomedical datasets are used: CDR corpus and Chemprot corpus. At the same time, a cross-institutional dataset of cancer pathology reports is constructed. The characteristics are as follows, CDR corpus: whether there is a relationship between chemical substances and diseases.

[0090] Chemprot corpus: whether there is a relationship between chemicals and proteins;

[0091] Cancer pathology report: including cancer type, tumor resection location, largest tumor diameter, histological subtype, histological grade, TNM stage and lymph node metastasis.

[0092] Specific steps are as follows:

[0093] 1. Establish a graph convolutional network model based on the multi-head attention mechanism for the text data information of the dataset

[0094] 1.1 Use python to implement the initia...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a graph convolutional network biomedical information extraction method based on a multi-head attention mechanism. The method comprises the following steps of: firstly, establishing a graph convolutional network model based on a multi-head attention mechanism; then training a hybrid model; optimizing the model and adjusting parameters to obtain a final model; and finally, performing relation extraction in the biomedical text and feature extraction of a pathological report by using the final model. According to the method, the influence of noise data can be effectively reduced while the performance of relation extraction is improved, and valuable content in long-distance information in the biomedical text is effectively kept. A transfer learning method is used for processing pathological reports of different formats and writing styles, and good universality and reusability are achieved. The method is applied to information and relation extraction with cancer pathology reports as data sources, the recognition effect is good, universality is high, and pathology detection efficiency is effectively improved.

Description

technical field [0001] The invention belongs to the technical field of biomedical information extraction, and relates to a text information extraction method using a graph convolutional network, in particular to a graph convolutional network biomedical information extraction method based on a multi-head attention mechanism. Background technique [0002] Cancer is the number one killer of human health. As an important and easy-to-observe morphological feature of cancer, phenotype provides a valuable window for understanding this complex thing. It is necessary to automatically extract the relationship between phenotype and diagnosis, and perform information extraction and semantic understanding on massive patient pathology reports. However, the complexity of pathological phenotypes and the complexity of diagnostic logic make the corresponding descriptions exist in pathology reports in the form of long and difficult sentences, which poses a great challenge to the task of relati...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/33G06F16/35G06N3/04G06N3/08
CPCG06F16/3347G06F16/3344G06F16/353G06N3/08G06N3/044G06N3/045Y02D10/00
Inventor 李辰吴佳伦林思源张若楠龚铁梁汤凯雯
Owner XI AN JIAOTONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products