A machine reading understanding model training method and device based on answer sampling

A reading comprehension and machine technology, applied in the field of machine reading comprehension model training based on answer sampling, can solve problems such as inconsistencies in evaluation indicators, achieve the effect of reducing the amount of data processing and extracting accurate answers from the model

Active Publication Date: 2019-04-16
安徽省泰岳祥升软件有限公司
View PDF3 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, the above-mentioned method of using the likelihood function to maximize the probability of the observed training samples as the goal of the training phase will have the problem of inconsistency with the evaluation indicators used in the testing phase.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A machine reading understanding model training method and device based on answer sampling
  • A machine reading understanding model training method and device based on answer sampling
  • A machine reading understanding model training method and device based on answer sampling

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, the same numerals in different drawings refer to the same or similar elements unless otherwise indicated. The implementations described in the following exemplary examples do not represent all implementations consistent with the present invention. Rather, they are merely examples of apparatuses and methods consistent with aspects of the invention as recited in the appended claims.

[0046] The method for training a machine reading comprehension model based on answer sampling provided in this embodiment is mainly applied to a model whose output answer is presented in the form of a start position and an end position in a document. Take the reading comprehension model QAnet jointly launched by the Google Brain team and Carnegie Mellon University (CMU) as an example. The model ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a machine reading understanding model training method and device based on answer sampling. The method includes: Specifically, when machine reading understanding model training is carried out, utilizing a maximum likelihood estimation function to train a target machine reading understanding model, obtaining a basic machine reading understanding model , then utilizing a minimum risk training loss function to continue to train the basic machine reading understanding model so as to achieve fine adjustment of the basic machine reading understanding model parameters, and continuing to optimize the model. Due to the fact that the minimum risk training loss function is adopted to optimize the basic model trained through the maximum likelihood estimation function, answers canbe extracted through the trained model more accurately. In addition, when the minimum risk training loss function is used for loss calculation, k-is carried out on the output answer of the model; Andloss calculation is carried out on answers obtained by top dynamic sampling, so that the data processing amount can be reduced.

Description

technical field [0001] The present application relates to the technical field of natural language processing, in particular to a method and device for training a machine reading comprehension model based on answer sampling. Background technique [0002] At present, deep learning has achieved fruitful results in the fields of image recognition and speech recognition. Machine Reading Comprehension (MRC) has become a new hotspot in the field of artificial intelligence research and application. Its main function is to read and understand a given article or Context to automatically give answers to relevant questions. [0003] With the development of machine reading comprehension technology, the task of reading comprehension is also constantly upgrading, from the early "cloze form" to "single document reading comprehension" based on Wikipedia, such as Stanford SQuAD (Stanford Question Answering Dataset, Stanford Question Answering Dataset) as the task of the dataset. And further...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06N3/08G06F16/332
CPCG06N3/08
Inventor 李健铨刘小康陈夏飞晋耀红杨凯程陈玮张乐乐董铭慆
Owner 安徽省泰岳祥升软件有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products