Sentence similarity calculation method, device and storage medium

A calculation method and similarity technology, applied in calculation, computer parts, semantic analysis, etc., can solve the problems of low calculation efficiency, neglect of word order relationship, low credibility, etc., and achieve the effect of improving calculation efficiency and

Active Publication Date: 2019-01-01
GUANGZHOU DUOYI NETWORK TECH +2
View PDF8 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] 1. The string-based method lacks consideration of the importance of each component in the sentence, resulting in low credibility for the calculation results of sentence similarity;
[0006] 2. The method based on the bag-of-words model ignores the word order relationship in the sentence, and as the size of the sentence increases, it is easy to cause the disaster of dimensionality and the operation efficiency is low;
[0007] 3. The knowledge-based method relies on the construction of the knowledge base. Since the information in the knowledge base needs to be guaranteed to be effective and reliable, the update efficiency of the knowledge base is slow, resulting in low credibility for the calculation results of the similarity of sentences containing emerging vocabulary.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sentence similarity calculation method, device and storage medium
  • Sentence similarity calculation method, device and storage medium
  • Sentence similarity calculation method, device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0047] see figure 1 , is a schematic flowchart of a method for calculating sentence similarity provided in Embodiment 1 of the present invention. The calculation method includes steps S110 to S140.

[0048] S110. Obtain a sentence pair to be calculated; wherein, the sentences in the sentence pair to be calculated include a user sentence input by a user and a standard sentence stored in a corpus in advance.

[0049] Specifically, the user sentence input by the user ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a calculation method of sentence similarity, which relates to the field of sentence analysis. The method comprises the following steps: acquiring a sentence pair to be calculated; calculating a semantic contribution degree of each word in the pair of sentences to be calculated to obtain a weight vector of the user statement and a weight vector of the standard statement; comparing each word in each sentence of the sentence pair to be calculated with the word of another sentence to obtain inter-word similarity, and calculating a similarity vector of the user sentence anda similarity vector of the standard sentence according to the inter-word similarity; calculating a sentence similarity between the user statement and the standard statement according to a weight vector and a similarity vector of the user statement and a weight vector and a similarity vector of the standard statement. The embodiment of the invention also provides a sentence similarity device and astorage medium, which can effectively improve the calculation efficiency of sentence similarity calculation and the credibility of the calculation result.

Description

technical field [0001] The invention relates to the field of sentence analysis, in particular to a calculation method, device and storage medium for sentence similarity. Background technique [0002] In the development of chat robots, obtaining the same or similar sentences according to the sentences entered by the user is one of the core processes, and the realization of this core process depends on the calculation method of sentence similarity. [0003] In the prior art, there are generally three different implementation methods for calculating sentence similarity, including a string-based method, a bag-of-words model-based method, and a knowledge-based method. Among them, the character string is used as an independent knowledge unit, and the sentence similarity is calculated according to the similarity of the characters; the bag-of-words model is used to learn through the neural network, and the text similarity is calculated according to the information obtained from the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06K9/62
CPCG06F40/30G06F40/289G06F18/22G06F18/24
Inventor 徐波
Owner GUANGZHOU DUOYI NETWORK TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products