News text-oriented event line extraction method based on deep clustering model

An event line and text technology, applied in the field of information processing, can solve the problem of not being able to extract text event representation, and achieve the effects of fast speed, clear event representation, and simple model structure

Active Publication Date: 2020-05-08
SOUTHEAST UNIV
View PDF5 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Purpose of the invention: In order to overcome the deficiencies in the prior art, the present invention provides an unsupervised event line extraction method based on a deep clustering model for news texts, which can solve the problem that the event line cannot be extracted from the text during the event line extraction process. Containing the defects of event representation, realize the extraction of event lines in news texts without labeling data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • News text-oriented event line extraction method based on deep clustering model
  • News text-oriented event line extraction method based on deep clustering model
  • News text-oriented event line extraction method based on deep clustering model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] Below in conjunction with accompanying drawing and specific embodiment, further illustrate the present invention, should be understood that these examples are only for illustrating the present invention and are not intended to limit the scope of the present invention, after having read the present invention, those skilled in the art will understand various aspects of the present invention All modifications of the valence form fall within the scope defined by the appended claims of the present application.

[0040] The embodiment of the present invention discloses a news text-oriented event line extraction method based on a deep clustering model, assuming that in the model, each news text m is assigned an event instance e, e is a location entity l, an organization entity o, The joint distribution of person entity p and keyword w. An event line s is the progression of events over time. Each event line can be viewed as a highly correlated event sequence s=[e 1 ,e 2 ,......

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a news text-oriented event line extraction method based on a deep clustering model. The method comprises the steps of preprocessing news texts; wherein an initial hidden eventof the text represents pre-training; grouping news texts in the corpus according to the release time; determining an event line to which each piece of news in each group belongs based on a deep clustering model; performing post-processing on the event elements with the same event line number in each group to obtain structured display of the event; and carrying out post-processing on each group ofextracted events with the same event line number to obtain an event line. The neural network model is adopted to automatically extract the event features implicit in the text, manual feature selectionand construction are avoided, and the extracted implicit event features of the text can provide support for downstream applications; event feature extraction and event line extraction can be carriedout at the same time, and the possibility of error propagation is reduced. Compared with a conventional event line extraction method, the method is higher in extraction accuracy and recall rate.

Description

technical field [0001] The invention relates to a method for unsupervised event line extraction of news text by using a computer, and belongs to the technical field of information processing. Background technique [0002] With the rapid development of online news media websites and mobile news applications, the massive news reports generated by social media every day have become the main way for people to obtain and pay attention to domestic and foreign events, which has had a huge impact on society. However, the value of massive news reports varies from high to low, and different people pay different attention to them. Moreover, for some events that last for a long time, people tend to ignore the correlation and development trend between events. Therefore, people urgently need a tool that can automatically extract hot events from massive news texts and show how events change dynamically over time. [0003] Event line extraction mainly focuses on extracting popular time fro...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/9535G06F40/295
CPCG06F16/9535Y02D10/00
Inventor 周德宇司加胜郭林森
Owner SOUTHEAST UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products