Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A trigger word labeling system and method for biomedical events

A biomedical and trigger word technology, applied in the field of trigger word tagging of biomedical events, can solve problems such as inability to make full use of context information, and achieve the effect of improving recall rate and accuracy rate

Active Publication Date: 2019-07-19
NANJING UNIV OF POSTS & TELECOMM
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, there are also shortcomings such as not being able to make full use of context information, and needing to assume the independence of feature output.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A trigger word labeling system and method for biomedical events
  • A trigger word labeling system and method for biomedical events
  • A trigger word labeling system and method for biomedical events

Examples

Experimental program
Comparison scheme
Effect test

example 2

[0071] That is, use the abner named entity recognition tool to find out the protein molecule in the sentence. After the sentence in Example 1 is recognized, it finds "interferon regulatory factor 4" as the protein molecule. The sentence after replacement is as in Example 2: "Down-regulation of Protein1gene expression in leukemic cells due to hypermethylation of CpG motifs in the promoter region."

[0072] (1-3) Feature extraction includes

[0073] Extract syntactic and semantic features of words.

[0074] Syntactic features include morphological features, part-of-speech features, and ngram context features.

[0075] Morphological features include some part-of-speech features of the word itself, such as whether it is a number, whether it is a combination of numbers and characters, whether it contains symbols such as "+, -, / ", whether the first letter is capitalized, whether it is all uppercase, whether it is all lowercase, etc. , these features can be obtained by means of st...

example 3

[0088]

[0089]The table is part of the feature vector of the word sequence obtained after preprocessing, feature 0 is the word itself, feature 1 is the part of speech, feature 2 is the 3-gram context of the word, feature 3 is the path length of the nearest protein, and is marked as a trigger Word tagging, where T is a trigger word, P is a protein, M is a symbol, and O is a general word. Taking the current word "expression" as an example to construct a feature function:

[0090]

[0091]

[0092]

[0093]

[0094]

[0095]

[0096]

[0097]

[0098]

[0099]

[0100]

[0101]

[0102] The word itself and 3 features generate 4 transition feature functions and 4 state feature functions, these feature functions are substituted into the CRFs model, and the weights corresponding to each feature function are obtained through training, and the trigger word labeling model for biomedical events is obtained .

[0103] (3) label

[0104] In the pr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a trigger word tagging system and method for biomedical events. The trigger word tagging system comprises a pretreatment module, a tagging model building module and a tagging module, wherein the pretreatment module is used for acquiring a training sample and a testing sample and comprises a word segmentation unit, a protein molecule identification unit, a feature extraction unit and a pre-tagging unit; the word segmentation unit is used for acquiring the word sequence of an original text; the protein molecule identification unit is used for identifying protein molecules and replacing with a standard mode to bring more convenience for feature extraction and trigger word tagging; the feature extraction unit is used for extracting the word forms, the word characteristics and other syntactic properties and semantic properties, and finally pre-tags the word sequence as a training and testing sample set; the tagging model building module is used for building a feature template, generating characteristic functions, and estimating weights corresponding to the characteristic functions to obtain a CRFs trigger word tagging model; the tagging module is used for trigger word tagging of an unknown test sequence and displays the result on a GUI interface.

Description

technical field [0001] The invention relates to a trigger word labeling method and system for biomedical events, belonging to the field of computer and information technology. Background technique [0002] The development of biology and information technology has made the experimental research of life science more and more in-depth, and the experimental data from all levels of life science are revealing the mysteries of life more and more comprehensively from all angles. The number of biomedical documents recording these mysteries of life has also shown exponential growth, coupled with the diversity of media forms and structural complexity, these documents have become veritable big data today. As of the end of 2014, the US National Library of Medicine (NCBI) database had more than 24 million citation records of biomedical literature. Mysterious and unpredictable life information is scattered like pearls in these massive biotechnology documents. For medical users, accurate r...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/27
Inventor 龚乐君
Owner NANJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products