Researcher pre-training method based on KL regularization under open domain questions and answers

A pre-training and questioning technology, applied in the field of conversational open-domain question answering, can solve problems such as difficulty in improving training effect, large semantic difference between positive and negative samples, and large content gap, so as to improve semantic understanding ability, improve training effect, and improve stability. sexual effect

Pending Publication Date: 2022-06-24
HANGZHOU DIANZI UNIV +1
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in the subsequent reasoning stage, the representation of the question is a combination of historical questions (which can contain answers) and the current question. During training and reasoning, the inconsistent representation of the question makes the model unable to truly understand the dialogue question.
[0005] (2) The semantic difference between positive and negative samples is large, and it is difficult to improve the training effect
Due to the large gap between the related articles of other questions and the related articles of the current question in the same batch, it is relatively simple to distinguish positive and negative samples during training, while in the inference stage, it may be necessary to find the most relevant one among many articles with similar content. Article, not enough difficulty in training will lead to poor training effect

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Researcher pre-training method based on KL regularization under open domain questions and answers
  • Researcher pre-training method based on KL regularization under open domain questions and answers
  • Researcher pre-training method based on KL regularization under open domain questions and answers

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] In order to clarify the technical solutions and working principles of the present invention, the embodiments disclosed in the present invention will be further described in detail below with reference to the accompanying drawings. This embodiment provides a retrieval pre-training method based on KL regularization under open domain question answering, such as figure 1 As shown, the method mainly includes the following steps:

[0025] Step 1, build training data:

[0026] Get the current question q from the OR-QuAC training dataset c and historical question and answer pairs Take the size of the historical dialogue window as w, and take the splicing of the question and the answer as the question, denoted as

[0027] q or =[CLS]q 1 [SEP]a 1 [SEP]q c-w [SEP]a c-w [SEP]…[SEP]q c-1 [SEP]a c-1 [SEP]q c [SEP].

[0028] A rewrite of the current question q rw is provided by the CANARD dataset, which replaces some demonstrative pronouns in the current problem with sub...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a searcher pre-training method based on KL regularization under open domain questions and answers. According to the method, training is carried out by using two forms of rewriting of the current question and splicing of the historical question (including answers) and the current question, and the retrieval results of the two question forms are pulled in by using KL divergence, so that the training stability is improved, and the semantic understanding capability of a question encoder on the questions is improved. According to the method, the difficult negative samples, namely the articles retrieved by the TF-IDF but not containing the correct answers, are introduced, so that the training effect of the retriever is improved, and the retriever can discriminate articles more strongly associated with the current question from a batch of articles with similar contents.

Description

technical field [0001] The invention relates to the field of conversational open domain question answering, in particular to a retrieval pre-training method based on KL regularization. Background technique [0002] In the field of open-domain question answering, a two-stage system is mainly used, which consists of two components: a retriever and a reader. First, the retriever finds N articles with high relevance to the question from a large number of Wikipedia articles in advance according to the dialogue question, and secondly, the reader finds the correct answer from these N articles according to the dialogue question. [0003] The present invention focuses on the retrieval device, and the traditional retrieval device can be realized by retrieval algorithms such as TF-IDF and BM25. These retrieval algorithms are based on the overlapping frequency of words, it cannot handle highly semantically related but little lexical overlap, and it is not trainable. In recent years, w...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/332G06F16/33G06N20/00
CPCG06F16/3329G06F16/3344G06N20/00
Inventor 殷昱煜江艺璇梁婷婷陶志伟胡海胖李尤慧子李玉
Owner HANGZHOU DIANZI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products