News text event and time extraction and standardization system for event tracking

A time extraction and event technology, applied in text database query, text database clustering/classification, unstructured text data retrieval, etc., can solve problems such as long duration, difficulty in time extraction and normalization, and difficulty in grasping primitive time , to achieve the effect of accurate primitive time

Pending Publication Date: 2020-12-11
HANGZHOU XUJIAN SCI & TECH CO LTD
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The problem to be solved by the present invention is that it is difficult to grasp the event primitive time with longer duration, and the problem of time extraction and standardization difficulty; the purpose of the invention is to provide a kind of news text event, time extraction and standardization system for event tracking. Carry out preprocessing such as clustering and part-of-speech tagging on the text, then perform sub-event extraction and normalization on the processed text, perform similarity detection on the normalized sub-events, perform time extraction and normalization on non-repeated events, and finally extract new sub-events Insert the timeline of the event to complete the continuous tracking of the event

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • News text event and time extraction and standardization system for event tracking
  • News text event and time extraction and standardization system for event tracking
  • News text event and time extraction and standardization system for event tracking

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028]The technical solutions in the embodiments of the present invention are clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Apparently, the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0029] Such as Figure 1~2 Shown, the present invention provides a kind of news text event for event tracking, time extraction and normalization system, comprise data acquisition and processing module (01), news text preprocessing module (02), event and time entity extraction module (03 ), time normalization module (04), time axis establishment module (05);

[0030] Data collection and processing module (01): Obtain daily news texts and comments and o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention aims to provide a news text event and time extraction and standardization system for event tracking. The news text event and time extraction and standardization system comprises a data acquisition and processing module (01), a news text preprocessing module (02), an event and time entity extraction module (03), a time standardization module (04) and a time axis establishment module (05); the invention comprises the following steps: firstly, carrying out clustering, part-of-speech tagging and other preprocessing on texts, then carrying out sub-event extraction and standardizationon the processed texts, carrying out similarity detection on standardized sub-events, carrying out time extraction and standardization on non-repetitive events, and finally, inserting new sub-events into an event time axis to which the new sub-events belong so as to complete continuous tracking of the events. When the time is normalized, the selection of the primitive time is not only limited to the current text, but is continuously associated with the preorder event of the event, so that the primitive time acquired by the method is more accurate.

Description

technical field [0001] The invention belongs to the technical field of event tracking, and in particular relates to a news text event, time extraction and normalization system for event tracking. Background technique [0002] With the rapid development of natural language processing, the recognition, extraction and reasoning of event information play an important role in text understanding. Especially in news event texts, the requirements for timeliness are high. From the beginning, duration, disposal to the end of an event, what happened at each point in time can establish an event tracking timeline for the event, which is very important for understanding the development of the situation, summarizing the event, and analyzing and summarizing the event afterwards. [0003] Defects and insufficiencies of the existing technology: At present, due to the complexity of Chinese expressions, the difficulty of semantic understanding, and the difficulty of text processing, most studi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/953G06F16/33G06F16/35G06F40/253G06F40/295G06F40/30
CPCG06F16/3344G06F16/35G06F16/953G06F40/253G06F40/295G06F40/30
Inventor 朱安安邱彦林陈尚武
Owner HANGZHOU XUJIAN SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products