Unlock instant, AI-driven research and patent intelligence for your innovation.

Machine reading understanding method for complex data

A technology for reading comprehension and complex data, applied in the field of natural language processing, it can solve problems such as long answers, long answer lengths, inconsistency of training objectives and evaluation indicators, etc., to achieve the effect of improving consistency and improving practicability

Active Publication Date: 2019-08-16
深圳智能思创科技有限公司
View PDF13 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] The purpose of the present invention is to propose a solution to the problem that the length of the answer in the real data mentioned in the background technology is relatively long, and there is inconsistency between the training target and the evaluation index in the current machine reading comprehension prediction method for processing this type of data. A machine reading comprehension method for complex data, specifically an answer prediction method based on Gaussian distribution, to greatly improve the consistency between the two, and solve the noise problem and long answer problem in real application scenarios

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Machine reading understanding method for complex data
  • Machine reading understanding method for complex data
  • Machine reading understanding method for complex data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] The technical solution of the present invention will be further described below in conjunction with the accompanying drawings.

[0030] The present invention is a machine reading comprehension method for complex data, such as figure 1 As shown, the specific steps are as follows:

[0031] S1. Preprocessing

[0032] The whole preprocessing module mainly preprocesses the original data, including word segmentation, counting word frequency and constructing vocabulary. like figure 2 As shown, the flow of the entire preprocessing module process is as follows:

[0033] S1.1 participle

[0034] The original data is mainly data expressed in natural language, and the data form is mainly an original article. In order to carry out follow-up work, the original article data needs to be segmented. Concretely, the present invention uses jieba word segmentation tool to carry out word segmentation to raw data;

[0035] S1.2 word frequency statistics

[0036] For the result after ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a machine reading understanding method for complex data, which comprises the following steps: S1, preprocessing: carrying out word segmentation on original data information, counting word frequencies in corpora, selecting vocabularies with higher word frequencies to form a vocabulary, and replacing words which are not in the vocabulary with special marks; S2, establishmentof a convolutional neural network-based paragraph ranking model, wherein the model is used for ranking a plurality of paragraphs in an article, and selecting the paragraph with the highest ranking foranswer extraction; and S3, an answer prediction method based on Gaussian distribution: improving an extraction method in a reading understanding task, and converting answer positioning using a classification method into a probability learning method based on Gaussian distribution. The invention provides a paragraph ranking model and Gaussian distribution answer prediction method for the noise problem and the long answer problem of a machine reading understanding task in a complex data set, and the practicability of the machine reading understanding task in a complex scene is effectively improved.

Description

technical field [0001] The invention relates to a machine reading comprehension method for complex data, which belongs to the technical field of natural language processing. Background technique [0002] In recent years, deep learning technology has developed rapidly, and has surpassed human level in areas such as image recognition, speech recognition and Go. Natural language processing is the core technology for realizing artificial intelligence. In the field of natural language processing, deep learning technology has also made important breakthroughs in machine translation and human-computer dialogue. As the core technology in the field of natural language processing, machine reading comprehension has achieved rapid development with the help of deep learning technology. [0003] Machine reading comprehension, as the name suggests, is to let the machine learn to read and understand articles, that is, to find answers from related articles for a given question. Machine rea...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/27G06N3/04
CPCG06F40/216G06F40/242G06F40/284G06N3/045Y02D10/00
Inventor 李舟军刘俊杰肖武魁王昌宝
Owner 深圳智能思创科技有限公司
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More