Biomedical event extraction method integrated with structured element recognition

A biomedical and event extraction technology, applied in the field of information extraction, can solve the problems of low precision and cascading errors in complex event extraction

Active Publication Date: 2018-03-20
DALIAN UNIV OF TECH
View PDF7 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The present invention provides a biomedical event extraction system integrated with structured element recognition, realizes the function of extracting biomedical events from a large number of biomedical documents, and solves the cascading errors existing in the staged event extraction system and the low extraction accuracy of complex events to improve the level of existing biomedical event extraction

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Biomedical event extraction method integrated with structured element recognition
  • Biomedical event extraction method integrated with structured element recognition
  • Biomedical event extraction method integrated with structured element recognition

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0084] The invention can perform trigger word recognition, structural element recognition and double decomposition processing on a large number of biomedical documents to extract biomedical events, thereby helping biomedical researchers to obtain effective information efficiently and quickly. Such as figure 1 A joint system framework diagram for bi-decomposition-based biomedical event extraction.

[0085] 1. User input biomedical literature corpus

[0086]The user inputs the training corpus in the form of files. The BioNLP'09 corpus is taken as an example below, which includes the txt file containing the original text, the a1 file containing the protein, and the a2 file with marked biomedical events. For example, in the original sentence "prevented induction 1L-10 production by gp41 in monocytes", indicate the proteins "1L-10" and "gp41".

[0087] 2. The system extracts events from the corpus

[0088] (1) Raw corpus preprocessing

[0089] First of all, we need to process t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a biomedical event extraction method integrated with structured element recognition, and belongs to the technical field of biomedical information extraction. The biomedical event extraction method comprises the following steps of trigger word recognition based on PA online learning algorithm classification, structural element recognition based on a Viterbi algorithm and a PAonline learning algorithm, and joint event extraction based on the double-composition principle. By means of the extraction method, trigger word recognition, element recognition and double-composition operation can be conducted on a lot of biomedical literature to extract biomedical events. Word vectors and rich and effective features are added, so that the trigger word and element recognition performance is improved; by using structural element recognition, the precision of complex event extraction is improved; by using a joint event extraction system, cascading errors of a phased system areeliminated.

Description

technical field [0001] The invention belongs to the technical field of information extraction, and relates to a method for extracting biomedical events from biomedical literature, specifically referring to trigger word identification based on PA online learning algorithm, structural element identification based on Viterbi algorithm and PA online learning algorithm, and Joint event extraction based on double decomposition principle. Background technique [0002] Biomedical events are composed of event trigger words and elements. Event trigger words are often verbs or gerunds, and elements are usually biomedical entities or nested events. The mainstream biomedical event extraction methods include three sub-processes: trigger word recognition, feature recognition, and post-processing. The identification process of trigger words and elements is usually regarded as a classification problem, which can be done using traditional rule-based methods or machine learning methods. The ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/355G06F16/36
Inventor 李丽双钱爽刘阳
Owner DALIAN UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products