Supercharge Your Innovation With Domain-Expert AI Agents!

Extraction type machine reading understanding method integrated with multiple paragraph information

A reading comprehension and extraction technology, applied in the field of extraction machine reading comprehension, can solve the problems of long scientific papers, cannot effectively handle the length, and needs to be improved, and achieves the effect of improving the effect and good generalization.

Pending Publication Date: 2021-12-24
BEIJING INSTITUTE OF TECHNOLOGYGY
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the present invention is to solve the technical defects that the existing machine reading comprehension model cannot effectively process long scientific papers due to the limitation of the input length, and the semantic similarity performance of the generated text needs to be improved even if the input length requirement is met. An extractive machine reading comprehension method for multiple paragraphs of information, the method can automatically read scientific and technological papers, and answer questions such as "what is the motivation of this paper (Motivation)", "what is the model like (Model)", "what is the experimental result?" Questions such as "Experiment", "What conclusion did the researchers draw (Conclusion)", and finally integrate all the answers into a complete paper explanation to help researchers obtain literature summaries with high semantic similarity, so as to quickly understand the paper Content, keeping up with the latest developments in the field

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Extraction type machine reading understanding method integrated with multiple paragraph information
  • Extraction type machine reading understanding method integrated with multiple paragraph information
  • Extraction type machine reading understanding method integrated with multiple paragraph information

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0037] This embodiment describes the steps of constructing the data set, the statistical information of the data set, the complete algorithm flow, model parameters and experimental results.

[0038] (1) Data construction stage

[0039] In order to better evaluate the performance of the present invention and prior art in answering scientific paper questions, the present invention constructs a data set for testing, which contains 200 pieces of data in total, and the construction process is divided into the following steps:

[0040] Step A: Use a crawler to crawl on the paperweekly website (the link to the paper, the link to the paper explanation) and save it in the database;

[0041] Step B: Formatting and processing the thesis explanation;

[0042] Step B.1: According to the link of the paper explanation, manually screen and remove the data with problems such as layout confusion, too short content, and incorrect text;

[0043] Step B.2: 8 senior students from the School of Co...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides an extraction type machine reading understanding method integrated with multiple paragraph information, and belongs to the technical field of reading understanding in natural language processing. A reading understanding system depended on the extraction type machine reading understanding method integrating the multiple pieces of paragraph information comprises a paragraph scoring device, a paragraph reader and an answer selector. The method comprises the following steps: S1, the paragraph scoring device obtains the possibility that paragraphs contain correct answers according to the correlation degree of questions and paragraphs; s2, the paragraph reader extracts the most possible N answers from the paragraphs according to the questions, and quantifies the possibility that the N answers are correct answers; and S3, results of the paragraph reader and the paragraph scoring device are fused by an answer selector, and thus multiplying the answer by the possibility of the paragraph where the answer is located to obtain the most possible answer in the whole article. The method gets rid of the limitation of the input length in the prior art, and can help the user to read and understand on the scientific research paper, so that the content of the paper is quickly known, and the latest progress in the field is closely followed.

Description

technical field [0001] The invention relates to an extractive machine reading comprehension method incorporating multiple paragraph information, and belongs to the technical field of reading comprehension in natural language processing. Background technique [0002] Machine reading comprehension is a technique that enables computer systems to understand the semantics of an input text and answer related questions. Because the task of reading comprehension can properly evaluate the computer system's ability to understand natural language, it has always been a research hotspot in the field of natural language processing technology. With the introduction of large-scale data sets, it is possible to train deep neural networks. For example, most mainstream machine reading comprehension methods use the SQUAD dataset proposed by Stanford University in 2016 for training and evaluation. [0003] On the other hand, since entering the 21st century, human science and technology have bee...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/211G06F40/242G06F40/253G06F40/284G06N3/04G06N3/08
CPCG06F40/211G06F40/284G06F40/253G06F40/242G06N3/084G06N3/047G06N3/044G06N3/045
Inventor 毛先领熊婧雯黄河燕
Owner BEIJING INSTITUTE OF TECHNOLOGYGY
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More