Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

News topic timeline abstract generating method based on breakthrough point

A breakthrough point and timeline technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as inability to guarantee close correlation, small redundancy, and failure to consider related major events

Inactive Publication Date: 2012-08-22
TSINGHUA UNIV
View PDF0 Cites 36 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0024] The above three methods did not consider the relevant major events that occurred on the day of the breakthrough point when generating the summary of the breakthrough point, but only considered the selection of sentences with a large amount of information and low redundancy, so the generated summary cannot be guaranteed to be consistent with the breakthrough point itself. closely related

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • News topic timeline abstract generating method based on breakthrough point
  • News topic timeline abstract generating method based on breakthrough point
  • News topic timeline abstract generating method based on breakthrough point

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] The technical solution of the present invention will be further described in detail below.

[0040] Such as figure 2 As shown, this method of generating news topic timeline summaries based on breakthrough points includes the following steps:

[0041] (1) Use the topic keyword entered by the user as the search term, use the crawler to download all the news articles searched with the search term from the relevant news websites, and then perform preprocessing on these news articles. The preprocessing includes: lowercase letters, remove stop words, numbers and punctuation marks, from which a news corpus of the target topic is constructed;

[0042] (2) Establish a topic activity hidden Markov model for the activity trend of the target topic in each time segment, and delete the time segment where the target topic is not active;

[0043] (3) In each time segment where the target topic is active, first use the topic activity hidden Markov model in step (2) to dig out each to...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a news topic timeline abstract generating method based on a breakthrough point, and the method is used for automatically and efficiently discovering significant instants and important events happening in the development process of a target news topic, thereby greatly assisting readers in understanding the evolution process of a news topic. The method comprises the following steps: (1) according to topic keywords input by users, downloading all news articles (searched by using search terms) from related news websites, and then carrying out pretreatment on the news articles; (2) establishing a topic-activity hidden Markov model for the activity variation trend of a target topic in each time slice, and deleting the time slices in which the target topic is not active; (3) carrying out modeling on a topic shift sequence in each time slice by using a topic-shift hidden Markov model; (4) extracting sentences (relevant to an important event happening on that day) as an abstract of the breakthrough point; and (5) outputting the timeline abstract of the target topic.

Description

technical field [0001] The invention relates to the technical field of computer application technology, in particular to a method for generating news topic timeline summaries based on breakthrough points. Background technique [0002] In today's era of information explosion, people can read and download various news reports on a news topic from the Internet for free. Due to the large number of related news articles on a news topic (especially hot news topics) on the Internet, it is difficult for readers to efficiently and time-savingly understand the development trend and evolution process of a target news topic from numerous related news reports. [0003] The difficulty of generating news topic timeline summaries includes how to determine important time points (ie, breakthrough points) in the development of a news topic from news reports related to a news topic, and how to generate timeline summaries based on news related to a breakthrough point. The methods in the prior a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
Inventor 黄民烈朱小燕
Owner TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products