Unlock instant, AI-driven research and patent intelligence for your innovation.

Calculation method and device for English sentence similarity

A technology of sentence similarity and calculation method, which is applied in the field of calculation method and device of English sentence similarity, can solve the problems of slow training time, complex model, huge parameters, etc., shorten training time, simplify training model, and achieve good test effect Effect

Active Publication Date: 2020-09-25
BEIJING UNIV OF POSTS & TELECOMM
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, the existing models for processing sentence similarity, whether it is an LSTM model or a hybrid model of LSTM and CNN, are all deep neural networks, with huge parameters, slow training time, and complex models are their common shortcomings.
Complex and large-scale networks may achieve good results, but their complexity directly restricts the practical application of such models

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Calculation method and device for English sentence similarity
  • Calculation method and device for English sentence similarity
  • Calculation method and device for English sentence similarity

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals designate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary and are intended to explain the present invention and should not be construed as limiting the present invention.

[0035] Before introducing the calculation method and device of English sentence similarity, first briefly introduce the importance of sentence similarity calculation.

[0036] In language processing and research such as question answering systems, information retrieval, plagiarism detection, and speech recognition, sentence similarity calculation is a basic and core task. At present, with the rise of neural networks, a large number of methods based on recurrent neural networks and convolutional neural networks have been applied t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method and device for calculating the similarity of English sentences, wherein the method includes the following steps: obtaining English sentences; obtaining a word vector sequence according to the English sentence, and expanding the dimension of the word vector sequence to obtain a three-dimensional tensor Sentence feature model; input the three-dimensional tensor of the sentence feature model as multi-channel image data to the convolutional neural network model to extract feature vector groups; calculate the similarity of English sentence pairs based on the cosine distance of the feature vector groups. The method can calculate sentence similarity through a neural network model with a simple model structure, short training time, and good convergence, so that the training model can be effectively simplified, the training time can be shortened, and the test effect is good.

Description

technical field [0001] The invention relates to the technical field of text processing, in particular to a method and device for calculating English sentence similarity. Background technique [0002] There are roughly three types of word vector-based neural network models related to the sentence similarity problem. One is to use the LSTM (Long Short-Term Memory, long-term short-term memory network) network as the main model to process sentence similarity. First, the words in the sentence are represented by the corresponding word vectors; then, the word vectors are input in turn Go to the LSTM network for further processing, and finally get the similarity of sentences. One is to use a CNN (Convolutional Neural Network, Convolutional Neural Network) network to classify sentences. To deal with the sentence classification problem, it is necessary to set the category for the sentence and use the tagged sentence to train the model. The word vector is a two-dimensional matrix co...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/205G06F40/284G06N3/04
CPCG06F40/205G06F40/284G06N3/045
Inventor 姚海鹏刘惠文张培颖
Owner BEIJING UNIV OF POSTS & TELECOMM