Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method for computing semantic similarities among short texts

A technology of semantic similarity and similarity calculation, applied in computing, special data processing applications, instruments, etc., can solve problems such as poor grasp of text keywords and neglect of word weights

Active Publication Date: 2014-10-15
XIAMEN TUITE INFORMATION TECH
View PDF5 Cites 69 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This method fully considers the problem of semantic ambiguity, but ignores the weight of words in the text, and the effect of grasping the keywords of the text is not good.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for computing semantic similarities among short texts
  • Method for computing semantic similarities among short texts
  • Method for computing semantic similarities among short texts

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0044] Such as figure 1 as shown, figure 1 It is a flow chart of the method for short text semantic similarity calculation in the present invention.

[0045] The embodiment of the present invention provides a method for calculating the semantic similarity of short texts. The method for calculating the semantic similarity of short texts includes: The present invention provides a method for calculating the semantic similarity of short texts. The method for calculating the semantic similarity of short texts includes the following steps:

[0046] 1) Extract the features of the short text;

[0047] 2) Mat...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a method for computing similarities among short texts. The method for computing the similarities among the short texts includes steps of 1), extracting features of the short texts; 2), matching the extracted features of the short texts with one another, and computing the semantic similarities of the short texts. The method has the advantages that semantic fuzziness and weights of terms in the texts are sufficiently considered, and accordingly effects of accurately seizing keywords of the texts can be realized.

Description

technical field [0001] The invention relates to the technical field of text mining, in particular to a method for calculating the semantic similarity of short texts. Background technique [0002] People of different age groups and occupational backgrounds comment or share on Weibo on domestic and foreign news, film and television entertainment, personal life and other topics every day. At present, for the classification of microblog topics, it is entirely dependent on the user to manually use the "#" symbol to add hashtags in the microblog content, and the simplest string matching method is used for the classification of common topics. In this scenario, any two strings that do not match exactly will be treated as different topics. For example, two hashtags with the same semantics, "to travel" and "to travel", will be treated as different topics because the strings cannot be matched. Or, if the user does not add hashtags to the Weibo content, then this Weibo becomes an isol...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/27
Inventor 洪志令吴梅红
Owner XIAMEN TUITE INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products