Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Generative Q&A Dataset Generation Method for News Events

A technology of event generation and data collection, which is applied in the fields of electronic digital data processing, digital data information retrieval, and special data processing applications, etc. Lack of problems such as question and answer data sets to achieve the effect of ensuring accuracy and effectiveness

Active Publication Date: 2021-08-03
PEKING UNIV
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, event generation question answering is not the same as database question answering. The expected answer of event generation question answering is a natural language sentence. Some database question answering data set technologies cannot meet the needs of event generation intelligent automatic question answering

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Generative Q&A Dataset Generation Method for News Events
  • A Generative Q&A Dataset Generation Method for News Events
  • A Generative Q&A Dataset Generation Method for News Events

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] Below in conjunction with accompanying drawing, further describe the present invention through embodiment, but do not limit the scope of the present invention in any way.

[0023] The invention provides an event-oriented method for automatically generating a news scene generative question-answer data set, which can automatically construct and obtain a news scene generative question-answer data set, eliminating the workload of manually labeling data.

[0024] figure 1 It is a block flow diagram of the method for automatically constructing an event-oriented news scene generation type question answer data set provided by the present invention; it specifically includes the following steps:

[0025] 1) Extract all events with corresponding pages (link pages) from the event lists of all years in the current event page;

[0026] 2) For each event with a corresponding page, take the title of the event page as the core of the question, generate a question by generating a questi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for generating a news event generative question answer data set, which is used to construct an event-oriented news scene generative question answer data set, and can realize event generative intelligent automatic question answer; including: extracting and obtaining all pages with corresponding links The event of the event; the core information of the generated question template and the generated question is spliced ​​to generate the question of the event; the news link pages in all references below the event page are extracted, and the news text in the news link pages in the references is used as a corpus Put into the corpus; use the first paragraph of the body part of the event page as the reference answer to the question of the generated event. The method of the invention is automatically generated without manual labeling, and the generated news scene data has high accuracy and effectiveness.

Description

technical field [0001] The invention belongs to the technical field of intelligent generation of a question answering system, relates to a method for extracting news event data sources and generating a data set, and in particular to a method for constructing an event-oriented news scene generating question answering data set. Background technique [0002] An intelligent automatic question answering system is a system that can respond to questions raised by users. At present, intelligent automatic question answering systems and technologies are used in many scenarios, such as Apple's Siri, Microsoft's Xiaoice, and Baidu's Dumi. In the most ideal state, all questions that humans want to ask can be answered by machines, and all instructions made by humans can be responded to by machines reasonably. A successful automatic question answering system requires many different types and aspects of technology as support. [0003] Currently, a similar task for an intelligent automatic...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/332
CPCG06F16/3329
Inventor 沙磊穗志方
Owner PEKING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products