Trigger word tagging system and method for biomedical events
What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A biomedical and trigger word technology, applied in the field of trigger word tagging of biomedical events, can solve problems such as insufficient use of context information
Active Publication Date: 2016-01-20
NANJING UNIV OF POSTS & TELECOMM
View PDF2 Cites 29 Cited by
Summary
Abstract
Description
Claims
Application Information
AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology
Problems solved by technology
However, there are also shortcomings such as not being able to make full use of
Method used
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more
Image
Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
Click on the blue label to locate the original text in one second.
Reading with bidirectional positioning of images and text.
Smart Image
Examples
Experimental program
Comparison scheme
Effect test
example 2
[0071] That is, use the abner named entity recognition tool to find out the protein molecule in the sentence. After the sentence in Example 1 is recognized, "interferonregulatoryfactor4" is found to be the protein molecule.
[0072] (1-3) Feature extraction includes
[0073] Extract syntactic and semantic features of words.
[0074] Syntactic features include morphological features, part-of-speech features, and ngram context features.
[0075] Morphological features include some part-of-speech features of the word itself, such as whether it is a number, whether it is a combination of numbers and characters, whether it contains symbols such as "+, -, / ", whether the first letter is capitalized, whether it is all uppercase, whether it is all lowercase, etc. , these features can be obtained by means of string processing functions and regular expressions, such as judging whether "-" is included, you can use the String.contains() function in the java language to judge. The featur...
example 3
[0088]
[0089] The table is part of the feature vector of the word sequence obtained after preprocessing, feature 0 is the word itself, feature 1 is the part of speech, feature 2 is the 3-gram context of the word, feature 3 is the path length of the nearest protein, and is marked as a trigger Word tagging, where T is a trigger word, P is a protein, M is a symbol, and O is a general word. Taking the current word "expression" as an example to construct a feature function:
[0090] b 1 ( X , i ) = 1 i f X i = exp r e s ...
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more
PUM
Login to view more
Abstract
The invention discloses a trigger word tagging system and method for biomedical events. The trigger word tagging system comprises a pretreatment module, a tagging model building module and a tagging module, wherein the pretreatment module is used for acquiring a training sample and a testing sample and comprises a word segmentation unit, a protein molecule identification unit, a feature extraction unit and a pre-tagging unit; the word segmentation unit is used for acquiring the word sequence of an original text; the protein molecule identification unit is used for identifying protein molecules and replacing with a standard mode to bring more convenience for feature extraction and trigger word tagging; the feature extraction unit is used for extracting the word forms, the word characteristics and other syntactic properties and semantic properties, and finally pre-tags the word sequence as a training and testing sample set; the tagging model building module is used for building a feature template, generating characteristic functions, and estimating weights corresponding to the characteristic functions to obtain a CRFs trigger word tagging model; the tagging module is used for trigger word tagging of an unknown test sequence and displays the result on a GUI interface.
Description
technical field [0001] The invention relates to a trigger word labeling method and system for biomedical events, belonging to the field of computer and information technology. Background technique [0002] The development of biology and information technology has made the experimental research of life science more and more in-depth, and the experimental data from all levels of life science are revealing the mysteries of life more and more comprehensively from all angles. The number of biomedical documents recording these mysteries of life has also shown exponential growth, coupled with the diversity of media forms and structural complexity, these documents have become veritable big data today. As of the end of 2014, the US National Library of Medicine (NCBI) database had more than 24 million citation records of biomedical literature. Mysterious and unpredictable life information is scattered like pearls in these massive biotechnology documents. For medical users, accurate r...
Claims
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more
Application Information
Patent Timeline
Application Date:The date an application was filed.
Publication Date:The date a patent or application was officially published.
First Publication Date:The earliest publication date of a patent with the same application number.
Issue Date:Publication date of the patent grant document.
PCT Entry Date:The Entry date of PCT National Phase.
Estimated Expiry Date:The statutory expiry date of a patent right according to the Patent Law, and it is the longest term of protection that the patent right can achieve without the termination of the patent right due to other reasons(Term extension factor has been taken into account ).
Invalid Date:Actual expiry date is based on effective date or publication date of legal transaction data of invalid patent.