Text semantic similarity analysis method

A technology of text similarity and analysis method, which is applied in the field of text similarity analysis based on semantic features, can solve the problems of inability to truly reflect the semantic relationship of words and the high complexity of matrix singular value decomposition, and achieve the effect of accurate and effective semantic relationship.

Active Publication Date: 2017-03-29
TONGJI UNIV
View PDF6 Cites 40 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

First of all, the complexity of matrix singular value decomposition is very high. Secondly, the latent semantic model does not consider the real semantic relationship, but the potential semantics obtained by using the co-occurrence of words, which cannot truly reflect the semantic relationship of words in the text.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text semantic similarity analysis method
  • Text semantic similarity analysis method
  • Text semantic similarity analysis method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] In order to make the purpose, technical solution and advantages of the present invention clearer, the text semantic similarity analysis method implemented according to the present invention will be further described in detail below in conjunction with the accompanying drawings. It should be understood that the specific embodiments described here are only used to explain the present invention, and are not intended to limit the present invention, that is, the protection scope of the present invention is not limited to the following embodiments, on the contrary, according to the inventive concept of the present invention, those skilled in the art Appropriate changes can be made by those skilled in the art, and these changes can fall within the scope of the invention defined by the claims.

[0037] Such as figure 1 As shown in the structural block diagram, according to a specific embodiment of the present invention, the following steps are included:

[0038] 1) Term extrac...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the text analysis field, particularly to a semantic characteristic-based text semantic similarity analysis method. According to the technical scheme, the similarity degree between texts is analyzed more accurately and effectively by calculation based on semantic relations of internal words of the texts. According to the method, shallow analysis on association relation between texts and between lexical items is performed through singular value decomposition; a lexical item-theme set is constructed by a bayesian network; the semantic similarity between the lexical items is calculated by mutual information and context; and finally, the text similarity is calculated through a graph structure. By adoption of the text semantic similarity analysis method, the semantic relation between texts can be measured and recognized more accurately and effectively.

Description

technical field [0001] The invention relates to the field of text analysis, in particular to a text similarity analysis method based on semantic features. Background technique [0002] With the rapid development of computer Internet, text similarity calculation has been widely used in many fields. For example, in machine translation, semantic similarity measures the quality of a machine translation by measuring how equivalent a reference translation is to the machine translation output. In addition, text similarity calculation is a basic and important work in the fields of information retrieval, sentiment analysis, text classification, text clustering, automatic question answering, semantic disambiguation, etc. [0003] As the carrier of natural language, text usually exists in an unstructured or semi-structured form. To calculate its similarity, it is necessary to convert unstructured text into structured information that is easy for computer recognition and processing. S...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27
CPCG06F40/30
Inventor 王俊丽周晴杨亚星
Owner TONGJI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products