Textual similarity calculation method and device, and intelligent robot

A text similarity and similarity calculation technology, applied in the field of text processing, can solve problems affecting the service quality of chat robots, user experience, inaccurate similarity, inaccuracy, etc., to improve service quality and user experience, overcome The accuracy is not high, the effect of improving the calculation accuracy

Inactive Publication Date: 2018-11-06
北京玄一科技有限公司
View PDF0 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The problem is that each individual vocabulary cannot accurately express the original meaning of the corresponding text, which results in inaccurate similarity between the texts calculated using each vocabulary, for example, there are two texts: I like you and you Like me, the meanings of these two texts are completely different, but the vocabulary after word segmentation of the two texts is exactly the same, then the similarity of the two texts calculated by using the existing technology is 1, which is obviously inaccurate
Furthermore, since the calculation of text similarity in the prior art is not accurate enough, the replies that the chat robot pushes for the user based on the text similarity must not be accurate enough, which seriously affects the service quality of the chat robot and the user experience.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Textual similarity calculation method and device, and intelligent robot
  • Textual similarity calculation method and device, and intelligent robot

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are part of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

[0056] In one embodiment, a text similarity calculation method is provided, such as figure 1 As shown, the method includes the following steps:

[0057] 110. Obtain the longest common subsequence of the first text and the second text;

[0058] In this step, the first text and the second text are two texts that need to calculate the similarity;

[0059] The longest common subsequence (LCS Longest Common Subsequence) refers to the longest subsequence a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Embodiments of the invention provide a textual similarity calculation method and a textual similarity calculation device, and an intelligent robot. The embodiments of the invention comprise the stepsof: first obtaining the longest common subsequences of two texts, then calculating the intersection and the union of the vocabulary sets corresponding to the two texts, and calculating according to the obtained intersection and union to obtain a first similarity, calculating a second similarity by using the vocabulary sets corresponding to the longest common subsequences and the previously obtained union, and finally calculating according to the first similarity and the second similarity to obtain a target similarity of the two text. The above technical solution combines the longest common subsequences and each vocabulary in the text to calculate the similarity of the two texts, thereby effectively improving the calculation accuracy of the text similarity. Progressively, chat robots or intelligent robots may provide users with more accurate answers by using accurate text similarity, which improves the service quality of the chat machines or intelligent robots, and the user experience.

Description

technical field [0001] Embodiments of the present invention relate to the technical field of text processing, and more specifically, relate to a text similarity calculation method and device, and an intelligent robot. Background technique [0002] Chatbot is a popular application driven by big data and artificial intelligence technology. During use, the user enters the chat content, that is, the user enters the question raised by the user, and the chatbot automatically generates the corresponding reply according to the question entered by the user. and give feedback to users. This artificial intelligence processing method can greatly improve service efficiency and user experience. There are currently many types of chatbots, such as Apple’s Siri, Microsoft’s Cortana and Xiaoice, Baidu’s Dumi, and Jingdong’s JIMI (JD, Instant Messaging Intelligence), and many others Chatbots, such as children's educational robots, vehicle control robots, etc. [0003] In the actual applicat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27
CPCG06F40/194G06F40/289
Inventor 杨凯程李健铨蒋宏飞
Owner 北京玄一科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products