Reading understanding choice question answering method based on data enhancement

A technology for reading comprehension and multiple-choice questions, which is applied in the field of answering multiple-choice questions for reading comprehension based on data enhancement. The effect of the dataset

Active Publication Date: 2021-05-11
SHANXI UNIV
View PDF9 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] At present, there are relatively few data enhancement methods proposed for machine reading comprehension, and there are no relevant literature introductions for data enhancement methods specifically for reading comprehension multiple-choice questions.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Reading understanding choice question answering method based on data enhancement
  • Reading understanding choice question answering method based on data enhancement
  • Reading understanding choice question answering method based on data enhancement

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0058] The specific embodiments of the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0059] like figure 1 As shown, a method for answering multiple choice questions in reading comprehension based on data enhancement, including the following steps:

[0060] S1. Use the sliding window method to crop the background material of the multiple-choice reading comprehension questions;

[0061] S2. Standardize the background material, question stem and option data format of the reading comprehension multiple-choice questions;

[0062] S3. Using the TF-IDF method to extract candidate sentences from the perspective of word frequency, and obtain the sentence set X of the answer material;

[0063] S4. Use Bi-Attention to extract candidate sentences from the perspective of high-dimensional sentence vectors, and obtain the sentence set X of answer materials;

[0064] S5, merge and remove the sentence sets X and Y obtained by S...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the field of natural language processing, in particular to a reading understanding choice question answering method based on data enhancement. The method comprises the following steps of: cutting a background material of a reading understanding choice question by utilizing a sliding window method; standardizing background materials, question stems and option data formats of reading understanding choice questions; using a TF-IDF method to extract answer candidate sentences from a word frequency perspective to obtain an answer material sentence set X; extracting answer candidate sentences from the angle of the high-dimensional sentence vector by utilizing Bi-Attention to obtain an answer material sentence set X; combining the sentence sets X and Y obtained in the steps S3 and S4 to obtain a candidate sentence set Z; expanding the candidate sentence set Z by utilizing an EDA strategy adaptive to reading understanding choice questions to obtain a final data enhancement candidate sentence set; and inputting the final data enhancement candidate sentence set into a BERT model for reading understanding choice question answer prediction.

Description

technical field [0001] The invention relates to the field of natural language processing, in particular to a method for answering multiple choice questions for reading comprehension based on data enhancement. Background technique [0002] In recent years, the task of machine reading comprehension has attracted extensive attention from scholars in the field of natural language processing at home and abroad, and has become one of the core tasks for evaluating intelligent systems based on natural language understanding. [0003] Machine reading comprehension mainly includes multiple choice questions and subjective question and answer questions. Among them, the reading comprehension multiple-choice questions are further divided into textual comprehension and fragment comprehension multiple-choice questions. The two aim to select the best answer from multiple options based on the "understanding" of the background material. Long and key information is extremely hidden, and the an...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/216G06F40/279G06F40/289
CPCG06F40/216G06F40/279G06F40/289
Inventor 张虎张颖雷登斌潘邦泽杨陟卓李茹
Owner SHANXI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products