Sentence similarity calculation method and apparatus

A technology of sentence similarity and calculation method, which is applied in calculation, special data processing applications, instruments, etc., can solve the problems of large difference and low accuracy of sentence similarity, and achieve the effect of accurate calculation

Inactive Publication Date: 2015-12-23
BEIJING FOCUSEDU INT EDUCATION CONSULTATION
View PDF5 Cites 28 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

If you don't consider the sentence structure, compared from the isolated words, there are only two important words "with" and "cat" between the two sentences. too low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sentence similarity calculation method and apparatus
  • Sentence similarity calculation method and apparatus
  • Sentence similarity calculation method and apparatus

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] The specific implementation manners of the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. The following examples are used to illustrate the present invention, but are not intended to limit the scope of the present invention.

[0031] figure 1 It is the flowchart of the sentence similarity calculation method of an embodiment of the present invention; figure 1 , the method includes:

[0032] S101: Obtain a vector corresponding to each word;

[0033] In the existing technology, the establishment of the synonym list is basically done manually by dictionaries. The workload is heavy, and there is a problem of limited coverage, and it is difficult to quantify the similarity between words. In order to overcome this problem and quickly obtain various The vector corresponding to the word, so that it is convenient to reflect the degree of approximation between the words by the cosine distance between th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention discloses a sentence similarity calculation method and apparatus and relates to the technical field of automatic correcting. The method comprises: acquiring a vector corresponding to each word; performing syntax analysis on two sentences to be compared so as to acquire words forming compositions of the two sentences; calculating a first cosine distance between the vectors corresponding to the words forming the corresponding compositions of the two sentences; and according to the first cosine distance, determining similarity between the two sentences. According to the method provided by the present invention, by performing syntax analysis on the sentences and structurally holding semantics of the sentences, similarity between the sentences are more accurately calculated; and in addition, the word vectors based on a neural network model are adopted to represent the words, thereby more accurately calculating similarity between the words and getting rid of restrictions of a near-synonym dictionary.

Description

technical field [0001] The invention relates to the technical field of automatic correction, in particular to a method and device for calculating sentence similarity. Background technique [0002] In the automatic correction of English composition, an important factor is to calculate the relevance of the composition. The existing technical method is mainly to match the English characters through the topic sentence of each paragraph of the composition and the topic of the composition. The number of identical English characters reflects the similarity between the topic sentence and the topic, thereby reflecting the degree of relevance of the composition. . But counting the same English characters is not general, and the same semantics can be expressed by different words. Therefore, a more cutting-edge method is to establish a list of synonyms and match them in combination with the list of synonyms. [0003] However, in the prior art, the matching between sentences is perform...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27
Inventor 唐聪宋文略杨晓昊许轶肖迪
Owner BEIJING FOCUSEDU INT EDUCATION CONSULTATION
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products