Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Biomedical event trigger word identification method based on syntactic word vector

A biomedical and event-triggered technology, applied in the field of recognition, can solve the problems of high feature vector dimension, inability to guarantee performance, sparseness, etc., to reduce the input feature dimension, improve the generalization ability, trigger word recognition performance, and the effect of accurate classification

Active Publication Date: 2015-10-07
DALIAN UNIV OF TECH
View PDF2 Cites 41 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This method is obviously better than the method based on rule matching, and has been widely used, but there are still some shortcomings, it is more dependent on the labeled data, when the labeled data is relatively small, its performance cannot be guaranteed; moreover, the construction of features is mainly for The current data set is manually constructed, the manual intervention is relatively large, the generalization performance is not good, the manually constructed feature vector has a high dimension and is sparse, and the training is time-consuming

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Biomedical event trigger word identification method based on syntactic word vector
  • Biomedical event trigger word identification method based on syntactic word vector
  • Biomedical event trigger word identification method based on syntactic word vector

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] The present invention will be further described below in conjunction with accompanying drawing.

[0040] like figure 1 As shown, a biomedical event trigger word recognition method based on syntactic word vectors, including the following steps:

[0041] Step 1. Preprocessing the unlabeled data: Select all Pubmed abstracts from 1997 to 2009 and preprocess them, including the following sub-steps:

[0042] (a), using the Genia Sentence Spliter sentence tool dedicated to the biological field to process the Pubmed abstract into sentences;

[0043] (b), using the GDep syntax analysis tool dedicated to the biological field to perform syntax analysis on the Pubmed abstract that has been segmented in substep (a); with the sentence "Leukotriene B4 stimulates c-fos and c-jun genetranscription and AP-1 binding activity in human monocytes." as an example, the final block diagram of the syntactic analysis tree is as follows Image 6 shown.

[0044] Step 2. Word vector training bas...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to an identification method, in particular to a biomedical event trigger word identification method based on a syntactic word vector. The biomedical event trigger word identification method comprises the following steps of: 1, pre-processing un-marked data; 2, carrying out word vector training based on syntactic context information; 3, constructing a candidate trigger word dictionary; 4, constructing a trigger word semantic feature vector; 5, training a deep learning model; and 6, identifying a biomedical event trigger word. According to the biomedical event trigger word identification method, syntactic information of the trigger word is precisely acquired by utilizing a larger number of trained word vectors capable of obtaining unmarked data, and input characteristic dimension is effectively reduced; concealed features among the input features are leaned by utilizing the deep learning model, so that the input features are sorted more precisely; and finally, fine adjustment is carried out on word vector information in a training process, so that the word vector information is more suitable for a data set, and thus, the generalization ability and the trigger word identification word of the model are effectively improved.

Description

technical field [0001] The present invention relates to a recognition method, more specifically, relates to a biomedical event trigger word recognition method based on a syntactic word vector. Background technique [0002] With the rapid development of systems biology, the need to reveal the complex relationships among biomolecules, cells, and tissues is becoming more and more urgent. At the same time, the published biomedical literature is also showing explosive growth. How to mine complex relationships among biomolecules, cells, tissues, etc. from a large number of biomedical literatures instead of traditional simple binary relationships (such as protein -protein relationship, drug-drug relationship, etc.) has become a research hotspot in the field of modern biomedical text mining. [0003] At present, most biomedical event extraction methods divide this task into two subtasks: trigger word recognition and element detection, and trigger word recognition plays a decisive r...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F17/30
Inventor 王健张建海林鸿飞张益嘉
Owner DALIAN UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products