Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Sentence similarity calculation method and system, electronic equipment and storage medium

A technology of sentence similarity and calculation method, applied in computing, computer parts, electrical and digital data processing, etc., can solve the problems of long response time, insufficient data mining depth, and large amount of calculation, so as to deepen the mining depth and shorten the response. effect of time

Pending Publication Date: 2022-01-28
BEIJING MININGLAMP SOFTWARE SYST CO LTD
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The embodiment of the present application provides a sentence similarity calculation method, system, electronic equipment and storage medium, to at least solve the problem of limited generalization ability, insufficient depth of data mining and large amount of calculation in existing methods when calculating sentence similarity , the problem of long response time in the online environment

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sentence similarity calculation method and system, electronic equipment and storage medium
  • Sentence similarity calculation method and system, electronic equipment and storage medium
  • Sentence similarity calculation method and system, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] In order to make the purpose, technical solutions and advantages of the present application clearer, the present application will be described and illustrated below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application. Based on the embodiments provided in the present application, all other embodiments obtained by persons of ordinary skill in the art without creative efforts shall fall within the protection scope of the present application.

[0032] Obviously, the accompanying drawings in the following description are only some examples or embodiments of the present application, and those skilled in the art can also apply the present application to other similar scenarios. In addition, it can also be understood that although such development efforts may be complex and lengthy, for those of ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a sentence similarity calculation method and system, electronic equipment and a storage medium. According to the technical scheme, the method comprises the steps: carrying out word segmentationon a first sentence and a second sentence with the similarity to be calculated, and training a word vector model; traversing all vocabularies in the first sentence and the second sentence, calculating word vectors of the vocabularies through the word vector model, and further calculating the similarity between the word vectors of all the vocabularies in the first sentence and the word vectors of all the vocabularies in the second sentence; determining a similarity value of each vocabulary in the first sentence relative to the second sentence based on a preset rule; averaging the similarity values of all vocabularies in the first sentence relative to the second sentence, and judging whether the first sentence and the second sentence are similar sentences or not according to the average value and based on a preset threshold value. According to the method and system, the problems that the generalization ability is limited, the data mining depth is not enough, the calculation amount is large and the response time in an online environment is long when the sentence similarity is calculated in an existing method are solved.

Description

technical field [0001] The invention belongs to the technical field of natural language processing, and in particular relates to a sentence similarity calculation method, system, electronic equipment and storage medium. Background technique [0002] In the field of natural language processing technology, especially in the construction of intelligent question answering systems, the calculation of sentence similarity is an important issue. The existing calculation methods of sentence similarity mainly include methods based on statistics and methods based on deep learning. Statistics-based methods mainly include Levenstein distance (edit distance), BM25 algorithm, TFIDF calculation method, and TextRank algorithm. Statistics-based methods have limited generalization ability and insufficient depth of data mining; deep learning-based methods mainly include: DSSM (Deep Structured Semantic Models) model, bert model, etc., the traditional deep learning-based models have a large amoun...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62G06F16/332G06F40/211G06F40/284
CPCG06F40/211G06F40/284G06F16/3329G06F18/22
Inventor 刘伟硕
Owner BEIJING MININGLAMP SOFTWARE SYST CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products