Method for generating incident statement sentence material base

A material library and event description technology, applied in the field of computational linguistics, can solve problems such as low work efficiency and inability to meet the processing requirements of massive Chinese text data, and achieve the effect of accurate recognition

Active Publication Date: 2011-10-05
HYLANDA INFORMATION TECH
View PDF4 Cites 32 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the excavation and sorting of sentences in this patent application is largely dependent on manual work, and the work efficiency is not high, and it cannot meet the processing requirements of massive Chinese text data at all.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for generating incident statement sentence material base
  • Method for generating incident statement sentence material base
  • Method for generating incident statement sentence material base

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] figure 1 This is a schematic diagram of the basic process of converting from an article database to a sentence-level material database in the method for generating a statement sentence material database for this event. From figure 1 It can be seen that for each Chinese article in the article database, various types of sentence materials, such as "event statement" sentences, "direct quotation" sentences, etc., can be obtained through sentence-level material extraction operations. These "event statement" sentences, "direct quotations" and the like can be stored in corresponding event statement sentence material databases or direct quotation material databases respectively. It should be noted that for many sentences in the text, not every sentence can form valuable and meaningful material. Only those sentence types that have been determined and structured can form corresponding sentence-level materials. According to the actual needs of network editing work, a subset of ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for generating an incident statement sentence material base. The method comprises the following steps of: converting an article into a set consisting of a plurality of long sentences; aiming at the converted long sentence set, identifying and extracting time points and extracting incident description verbs; identifying and extracting named entities of a personal name, a place name, a mechanism name and a product name from the long sentences obtained in the first step, extracting and marking element information comprising the occurrence time of an incident, theoccurrence place and the type of the incident to obtain a structural result; and extracting the original section of an incident statement sentence and the structural result, and storing into a database so as to generate the incident statement sentence material base. The incident statement sentence material base generated by the method can provide service such as updating, searching, inquiring andthe like in the Internet, and provides application such as writing, editing, subject making and the like for the media information field.

Description

technical field [0001] The invention relates to a method for generating a language material library, in particular to a method for generating a sentence-level material library for event statement sentences, and belongs to the technical field of computational linguistics. Background technique [0002] Material library, also called corpus, is the totality of language materials that are stored in computers and can be retrieved, inquired, and analyzed by computers. The material library has the characteristics of "large scale" and "authenticity", so it is the most ideal language knowledge resource. [0003] Text is the most basic and commonly used information carrier. In computer language processing work, text processing and processing technology is particularly important. Text information usually exists in the form of chapters. In many current Internet information processing applications, texts are also used as processing units, such as: network information, search engines, e...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 宋传宝
Owner HYLANDA INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products