Global normalized reader systems and methods

A normalized and global technology, applied in inference methods, neural learning methods, instruments, etc., can solve problems such as high cost, no system performance training data, and limited document applicability.

Active Publication Date: 2018-11-02
BAIDU USA LLC
View PDF3 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, current extractive question answering methods face many limitations
First, computation is evenly distributed across the entire document regardless of answer position, making it impossible to ignore or focus computation on specific parts, limiting applicability to longer documents
Second, they generally rely on costly bidirectional attention mechanisms or must line up all possible answer spans (Span)
And third, although data extensions for question answering have been proposed, current methods still do not provide training data that can improve the performance of existing systems, nor do they allow explicit use of question nature or item types to control generation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Global normalized reader systems and methods
  • Global normalized reader systems and methods
  • Global normalized reader systems and methods

Examples

Experimental program
Comparison scheme
Effect test

Embodiment approach

[0036] Given a document and a question, extractive question answering can be thought of as a search problem. figure 1 A question answering method according to an embodiment of the disclosure is depicted. In an embodiment, the sentence containing the answer span is selected (105); then, the first word of the answer span is selected (110), and finally the last word is selected (115).

[0037] The process described in figure 2 The example shown in uses the actual model's predictions. figure 2 A Global Normalized Reader (GNR) model implementation for answering questions according to an embodiment of the disclosure is depicted. In the depicted embodiment, the probabilities are global, normalized over the bundle, rather than local probabilities. It should be noted that the final predicted probability is greater than the first decision and the second decision, and the model is more reliable during the search process.

[0038] Initially, the higher score was attributed to the pi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Presented herein are systems and methods for question answering (QA). In embodiments, extractive question answering (QA) is cast as an iterative search problem through the document's structure: selectthe answer's sentence, start word, and end word. This representation reduces the space of each search step and allows computation to be conditionally allocated to promising search paths. In embodiments, globally normalizing the decision process and back-propagating through beam search makes this representation viable and learning efficient. Various model embodiments, referred to as Globally Normalized Readers (GNR), achieve excellent performance. Also introduced are embodiments of data-augmentation to produce semantically valid examples by aligning named entities to a knowledge base and performing swaps new entities of the same type. This methodology also improved the performance of GNR models and is of independent interest for a variety of natural language processing (NLP) tasks.

Description

technical field [0001] The present disclosure generally relates to systems and methods for computer learning that provide improved computer performance, features and usage. Background technique [0002] Question answering (QA) and information extraction systems have proven to be of great value in applications such as collection of medical information about drugs and genes, large-scale health impact research, or educational material development. [0003] Recent advances in neural network-based extractive question answering models are rapidly closing the gap with human performance in several benchmark QA tasks and providing a smarter, more responsive connection between information discovery and its usability in high-stakes decision-making , wherein the benchmark QA task is such as Stanford Question Answering Dataset (SQuAD), Microsoft Machine Reading Comprehension Dataset (MS MARCO) or NewsQA. [0004] However, current extractive question answering methods face many limitatio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06N3/08G06N5/04G06F40/30G06N3/044G06N3/084G06N5/02G06N7/01
Inventor 乔纳森·赖曼约翰·米勒
Owner BAIDU USA LLC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products