Interrogative sentence similarity measurement method based on deep convolutional neural network

A convolutional neural network and neural network technology, applied in the field of similarity measurement of questions, can solve problems such as low word segmentation accuracy, low accuracy, and damage to the overall semantics of questions, so as to avoid sentence separation and avoid value selection Inconsistent range effects

Inactive Publication Date: 2018-05-11
INSPUR FINANCIAL INFORMATION TECH CO LTD
View PDF5 Cites 22 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

In this new technology, it uses advanced techniques such as analyzers or machine learning algorithms to analyze data from large amounts of unstructured documents without any mistakes made during previous processing steps like tokenization. It achieves these technical results through specific methods called convolutional Neural Networks which can identify patterns within complex datasets with high accuracy.

Problems solved by technology

Technological Problem addressed in this patents relating to traditional search robot systems involves finding matches that match users' queries accurately despite their ambiguous or incorrect phrases being used frequently during conversations (such as technical problem). Current solutions involve complex algorithms like phrase analysis and machine learning techniques such as nearest neighbor searching.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Interrogative sentence similarity measurement method based on deep convolutional neural network
  • Interrogative sentence similarity measurement method based on deep convolutional neural network
  • Interrogative sentence similarity measurement method based on deep convolutional neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0047] In order to make the objectives, technical solutions and advantages of the present invention clearer, the present invention will be described in detail below with reference to the accompanying drawings and specific embodiments.

[0048] Refer to Figure 1-Figure 3 The method for measuring the similarity of question sentences based on deep convolutional neural network of the present invention includes the following steps:

[0049] S1. Generate a raw corpus through the relevant pages of the knowledge domain, crawl the Chinese characters appearing in the raw corpus, and generate the corresponding character vector of each Chinese character;

[0050] S2. Replace each Chinese character in the question with a corresponding character vector to obtain the character vector set corresponding to the question; the character vector set is calculated by convolutional neural network to obtain the corresponding sentence meaning vector;

[0051] S3. The questions are combined in pairs, and the si...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an interrogative sentence similarity measurement method based on a deep convolutional neural network. The method comprises the following steps that: S1: through a knowledge domain related page, generating a raw corpus, crawling Chinese characters which appear in raw language materials, and generating a word vector corresponding to each Chinese character; S2: replacing each Chinese character in an interrogative sentence by the corresponding word vector to obtain a word vector set corresponding to the interrogative sentence, wherein the word vector set carries out calculation through the convolutional neural network to obtain a corresponding sentence meaning vector; and S3: carrying out pairwise combination on the interrogative sentences, and calculating the cosine function absolute values of the sentence meaning vectors corresponding to two interrogative sentences to obtain a similarity between two interrogative sentences. By use of the method, an individual-character analysis method is adopted to avoid an influence on subsequent analysis by word segmentation errors, the convolutional neural network takes the whole interrogative sentence as a whole to extractwhole sentence characteristics, and a sentence meaning splitting problem brought by using word similarity matrixes is avoided.

Description

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Owner INSPUR FINANCIAL INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products