Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A method and apparatus for training data for natural language questioning and answering system

A training data, natural language technology, applied in the direction of electrical digital data processing, special data processing applications, digital data information retrieval, etc., can solve problems such as missing information, and achieve the effect of improving accuracy and rational use

Inactive Publication Date: 2019-11-01
NTT DOCOMO INC
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, training data is extremely precious in today's big data era, and not making full use of low-quality data for training means that a lot of valuable information is lost, resulting in the need to screen from an extremely large number of sample data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and apparatus for training data for natural language questioning and answering system
  • A method and apparatus for training data for natural language questioning and answering system
  • A method and apparatus for training data for natural language questioning and answering system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] In order to make the objects, technical solutions and advantages of the present invention more apparent, exemplary embodiments according to the present invention will be described in detail below with reference to the accompanying drawings. Apparently, the described embodiments are only some embodiments of the present invention, rather than all embodiments of the present invention, and it should be understood that the present invention is not limited by the exemplary embodiments described here. Based on the embodiments described in the present invention, all other embodiments obtained by those skilled in the art without creative efforts shall fall within the protection scope of the present invention.

[0020] First, the basic idea of ​​the technology for providing training data for a natural language question answering system according to an embodiment of the present invention is briefly introduced. As mentioned earlier, in the training phase of the existing natural lan...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a method and equipment for training data of a natural language question-answering system, a computer readable storage medium and the natural language question-answering system.The method comprises the following steps: receiving original training data, wherein the original training data comprises at least one question in a question-answer pair form and a plurality of corresponding answers; determining data quality of the plurality of answers; based on the data quality, marking the plurality of answers as a first type of instances or a second type of instances; selectinga first type of instances and a second type of instances from the plurality of answers for combination to obtain a plurality of instance combinations; sorting the plurality of instance combinations, wherein the sorted instance combinations respectively correspond to training data of each time of training of the natural language question-answering system in a time sequence, the proportion of the first class of instances in the ranked plurality of instance combinations monotonically increases and the proportion of the second class of instances in the ranked plurality of instance combinations monotonically decreases.

Description

technical field [0001] The present invention relates to the field of artificial intelligence, and more specifically, the present invention relates to a method and device for providing training data for a natural language question answering system, a computer-readable storage medium, and a natural language question answering system. Background technique [0002] In recent years, with the continuous development of computer technology, the application of artificial intelligence in many fields has become more and more extensive. Natural language question answering system is an application of artificial intelligence in human natural language processing, which can accept questions described by users in natural language, and can find or infer the answers to user questions from a large amount of heterogeneous data, and Provide answers in natural language. With the help of natural language question answering system, users can ask questions in natural language and get accurate and fl...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/332G06F16/35G06F17/27
Inventor 张驰郭心语李安新陈岚赵军刘康何世柱
Owner NTT DOCOMO INC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products