Machine reading understanding method based on iterative screening and pre-training enhancement

A reading comprehension and pre-training technology, applied in reasoning methods, instruments, electrical and digital data processing, etc., can solve problems such as ignoring the multi-step problem of document reasoning attributes, unable to make full use of single-step reasoning, and the inability of pre-trained models to enhance performance. To achieve the effect of rational use and enhance the experience

Pending Publication Date: 2022-07-01
NANJING UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Some existing methods are mainly aimed at the problem of single-step reading comprehension, that is, the question can be answered only based on a single document, but these methods are difficult to solve such problems as "the main literary works of the Nobel Prize winners who once taught at Nanjing University". Multi-step reasoning questions that require multiple documents to answer
For multi-step reasoning question answering, the current main method uses a document filter to filter out documents related to the current question, and then trains a reader to find answers and supporting sentences from related documents, but this method ignores the multi-step The reasoning attributes between question documents cannot make full use of the results of single-step reasoning. At the same time, when solving multi-step reasoning problems, most methods directly use pre-trained models for training, and do not perform on simple single-step reading comprehension datasets. Training, so that the performance of the pre-trained model cannot be enhanced in the field of machine reading comprehension

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Machine reading understanding method based on iterative screening and pre-training enhancement
  • Machine reading understanding method based on iterative screening and pre-training enhancement
  • Machine reading understanding method based on iterative screening and pre-training enhancement

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0069] The invention discloses a machine reading comprehension method based on iterative screening and pre-training enhancement. The method is applied to the need to adopt automatic means to improve the question and answer effect and improve the user's search or language question and answer experience.

[0070] like figure 1 As shown, the present invention provides a machine reading comprehension method based on iterative screening and pre-training enhancement, comprising the following steps:

[0071] Step 1. Use the HotpotQA dataset (References: Zhilin Yang, Peng Qi, Saizheng Zhang, Yoshua Bengio, William Cohen, Ruslan Salakhutdinov and Christopher D. Manning. HotpotQA: A dataset for diverse, explainable multi-hop questionanswering [C]. In EMNLP, 2018) Training the relevant document model, using whether it is a supporting document as a label and using equal sampling to obtain training positive and negative samples;

[0072] Step 2, use the pre-trained model to fine-tune the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a machine reading understanding method based on iterative screening and pre-training enhancement. The method comprises the following steps: constructing an iterative screening network; training an initial screening network by using problems and document pairs of a HotpotQA data set; finding out the document most relevant to the current document from the original data containing the interference document; and an iterative screening process: screening a support document most related to the initial screening network and the document from the current document set according to the document obtained by the initial network and the original problem, and obtaining a two-step support document related to the problem. Constructing a reader network: finely adjusting the SQuAD data by using a pre-training model to obtain a single-step reading understanding model; and carrying out two times of joint training on the multi-step reading understanding data set to obtain answers and support facts about the questions and the documents. According to the method, a better multi-step machine reading understanding effect can be realized without constructing a complex graph neural network and decomposing a multi-step problem.

Description

technical field [0001] The invention relates to a machine reading comprehension method, in particular to a machine reading comprehension method based on iterative screening and pre-training enhancement. Background technique [0002] In recent years, the rapid development of pre-training models based on large-scale unsupervised datasets has greatly improved the effects of many natural language processing tasks, providing sufficient power for the development of natural language processing. At the same time, with the rapid development of the Internet, information is growing at an explosive rate. Electronic information such as encyclopedias, news, life, books, etc. has gradually become an indispensable part of our life. We contact the world, know the world and understand the world from these electronic information. However, due to the rapid growth of information, it has become increasingly difficult to quickly and efficiently obtain the information we want from the complex data...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/33G06F16/35G06F40/216G06N5/04
CPCG06F16/3344G06F16/35G06F40/216G06N5/04
Inventor 杨育彬雷伟俊李昕宜
Owner NANJING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products