Candidate answer screening method for global machine reading comprehension modeling

A technology of candidate answers and reading comprehension, applied in the direction of instruments, computer components, electrical digital data processing, etc., can solve problems such as inability to process, omission of the best candidate answer fragments, etc., and achieve the effect of improving the effect

Active Publication Date: 2018-12-07
黑龙江省工研院资产经营管理有限公司
View PDF11 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Obviously, the existing answer paragraph screening method is a local greedy method, which cannot deal with the phenomenon that multiple paragraphs in a chapter are related to the question, and will generate too many or too few candidate answer...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Candidate answer screening method for global machine reading comprehension modeling
  • Candidate answer screening method for global machine reading comprehension modeling
  • Candidate answer screening method for global machine reading comprehension modeling

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0028] This embodiment proposes a method for screening candidate answers in global-oriented machine reading comprehension modeling, such as figure 1 As shown, the method uses all the paragraphs corresponding to the question as the location range of candidate answer fragments. First, obtain the F1 value between the text fragments of the paragraphs, and use F1 to filter out the best candidate answer fragments. On the other hand, extract the paragraph and the question After using the logistic regression model to carry out the correlation scoring process, the candidate answer paragraph set after screening is obtained according to the score, and then it is judged whether the paragraph where the best candidate answer segment is located is in the candidate answer paragraph set, and The paragraph in which the best candidate answer segment is located is forcibly placed at the top of the candidate answer paragraph set, and finally the best candidate answer segment and the candidate answe...

Embodiment 2

[0053] This embodiment proposes a method for screening candidate answers in global machine reading comprehension modeling, and the specific process of the screening method for candidate answers is as shown in Table 1:

[0054] Table 1: Screening process of candidate answer paragraphs globally

[0055]

[0056]

[0057] The candidate answer screening method described in this embodiment, when training, mark the paragraph containing the answer as category 1, and the rest as category 0. When predicting, each paragraph will predict a probability value indicating that the paragraph contains the answer possibility. In this embodiment, the sample is randomly divided into 6:4 for parameter selection, the global screening strategy is adopted, and the number of selected paragraphs is set to a fixed value of 5.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a candidate answer screening method for global machine reading comprehension modeling, which belongs to the technical field of computer information screening. All the paragraphscorresponding to a question are taken as a candidate answer fragment locating range in the method. Firstly, the F1 values between the text fragments of paragraphs are obtained, and the best candidateanswer fragment is selected by using F1; and on the other hand, after the features between the paragraphs and the question are extracted, correlation scoring is carried out by using a logistic regression model, a selected candidate answer paragraph set is obtained according to the scores, whether the paragraph where the best candidate answer fragment is located is in the candidate answer paragraph set is determined, and the paragraph where the best candidate answer fragment is located is forcibly put in the first place of the candidate answer paragraph set. Finally, the best candidate answerfragment and the candidate answer paragraph set are output. The method has the advantage that the efficiency of training and prediction is improved.

Description

technical field [0001] The invention relates to a method for screening candidate answers in global-oriented machine reading comprehension modeling, and belongs to the technical field of computer information screening. Background technique [0002] Large-scale datasets play an extremely important role in advancing a field of research. Several datasets have also been released in the field of machine reading comprehension, which has greatly facilitated research in this area. For example, for the SQuAD dataset, several machine reading comprehension models have outperformed human annotations. The largest data set in Chinese is DuReader, which is a large-scale human-annotated reading comprehension data set for the open field of the real world. The questions and passages in it are collected from search engines, and the answers are all artificial. label. [0003] In reading comprehension tasks, for a question, there may be multiple articles that can provide the necessary answer i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/62G06F17/27
CPCG06F40/216G06F18/22G06F18/214
Inventor 杨沐昀张越李亚慧赵铁军徐冰郑德权曹海龙朱聪慧马晶义
Owner 黑龙江省工研院资产经营管理有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products