Method and system for news event extraction based on neural network

An event extraction and neural network technology, which is applied in the field of news event extraction methods and systems, can solve the problems of lack of event extraction methods, ignoring context relationships, and wrong judgment of event categories, so as to solve the problem of news event identification and avoid ambiguity. , the effect of improving the accuracy

Inactive Publication Date: 2017-10-10
CHINA UNIV OF MINING & TECH
View PDF4 Cites 64 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Based on the above research status, there are mainly the following problems in the extraction of news events: First, the discrimination of news events mainly depends on the trigger words themselves, ignoring the contextual relationship. When encountering ambiguous candidate trigger words, it is easy to cause the event category Misjudgment
Second, network texts, especially Weibo texts, are mostly irregular sentences, and current event extraction methods lack research on extracting events from irregular sentences.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for news event extraction based on neural network
  • Method and system for news event extraction based on neural network
  • Method and system for news event extraction based on neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] The present invention will be further described below through specific embodiments.

[0044] Such as figure 1 Shown is a neural network-based news event extraction system, including a text and processing module, a neural network training module, and a news event prediction module, in which:

[0045] The text and processing module is used for data preprocessing of the original text of the training corpus, including: segmenting the original text of the training corpus to obtain event sentences, and then performing word segmentation and naming body recognition on the event sentences; according to the news event information manually marked, Sequence labeling of event sentences, labeling of trigger words according to their types, labeling of non-trigger words as no category, and obtaining event sentence sequences; and expressing event sentence sequences in the form of word vectors;

[0046] The neural network training module includes a two-way long-short-term memory network...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and system for news event extraction based on neural network. The method comprises the steps of: conducting data pre-processing on original text of training corpus; introducing an event sentence sequence represented by a word vector into a bidirectional long and short memory network, using the bidirectional long and short memory network to train and obtain semantic features of each candidate trigger word; introducing the event sentence sequence represented by a word vector into a convolutional neural network, using the convolutional neural network to train and obtain global features of the event sentences where the candidate trigger words are in; according to the semantic features of candidate trigger words and the global features of the event sentences where the candidate trigger words are in, using softmax as a classifier to classify each candidate trigger word, and therefore finding out the trigger words for news events, and according to the trigger word type, judging the type of the event. The method and system can quickly and accurately extract news events and deal with news events contained in non-standard statements, and has the advantages of high efficiency and universal applicability.

Description

technical field [0001] The present invention relates to natural language processing, in particular to a news event extraction method and system based on the combination of bidirectional long-short-term memory network (BiLSTM) and convolutional neural network (CNN). Background technique [0002] With the development of computers and the increasing popularity of the Internet, a large amount of information appears in front of people in the form of electronic text. In a large number of network texts, how to discover valuable news events has become an urgent problem to be solved, and event extraction is produced under this background. As a subtask of information extraction, event extraction is a research hotspot in information extraction, and its research content is to automatically discover specific types of events and their event elements from natural texts. [0003] Extracting corresponding events from text is usually realized by identifying trigger words of events, so trigge...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F17/30
CPCG06F16/35G06F40/295G06F40/30
Inventor 周勇刘兵陈斌王重秋
Owner CHINA UNIV OF MINING & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products