Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text similarity processing method

A text similarity and processing method technology, applied in the field of text similarity processing, can solve the problems of low accuracy, high labor cost, and complex calculation process.

Inactive Publication Date: 2016-03-09
ZHEJIANG WANLI UNIV
View PDF2 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the embodiments of the present invention is to provide a text similarity processing method to solve the problems of low accuracy, complicated calculation process and high labor cost of existing text similarity calculation methods

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text similarity processing method
  • Text similarity processing method
  • Text similarity processing method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] The aforementioned and other technical contents, features and effects of the present invention will be clearly presented in the following detailed description of the preferred embodiments with reference to the drawings. Through the description of specific implementation methods, the technical means and effects of the present invention to achieve the intended purpose can be understood more deeply and specifically, but the attached drawings are only for reference and description, and are not used to explain the present invention limit.

[0041] The present invention proposes a text similarity processing method, please refer to figure 1 , which includes the following steps:

[0042] S11. Collect Internet information, and generate metadata feature vectors.

[0043] S12. Construct a concept tree according to the generated metadata feature vector, and generate a subtree based on the constructed concept tree.

[0044] S13. Calculate the similarity between texts in the colle...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention provides a text similarity processing method which comprises the following steps: acquiring Internet information and generating a metadata characteristic vector; constructing a concept tree according to the generated metadata characteristic vector, and generating subtrees based on the constructed concept tree; and calculating the similarity among texts in the acquired Internet information via the subtrees. The text similarity processing method has the advantages that manual operation is avoided, so as to reduce labor cost; and the accuracy is high.

Description

technical field [0001] The invention relates to an information processing method, in particular to a text similarity processing method. Background technique [0002] With the rapid growth of Internet information, how to obtain the information needed by users from massive information is an important research work at present. Text similarity calculation has been widely used in related research on text mining, artificial intelligence, information retrieval, knowledge management, etc. at home and abroad. At present, similarity calculation methods are mainly divided into traditional methods based on large-scale text set statistics and methods based on semantic calculation. [0003] Methods based on large-scale text set statistics usually use vector space model (VectorSpaceModel, VSM) and latent semantic indexing model (LatentSemanticIndexing, LSI) and other methods. These methods are based on keywords for similarity calculation, without considering the semantic information of f...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/22
CPCG06F40/194
Inventor 梁丰王遵义翁时锋
Owner ZHEJIANG WANLI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products