Internet forum message content similarity measuring method and system

A measurement method and similarity technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as non-compliance with network control, inaccurate measurement results, etc.

Inactive Publication Date: 2018-07-24
中国人民解放军火箭军工程大学
View PDF3 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Therefore, the content similarity between two texts is measured according to the cosine distance. Due to the symmetry of the cosine distance, the influence of the r...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Internet forum message content similarity measuring method and system
  • Internet forum message content similarity measuring method and system
  • Internet forum message content similarity measuring method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Apparently, the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts all belong to the protection scope of the present invention.

[0044] The purpose of the present invention is to provide a method and system for measuring content similarity of network forum messages that can improve measurement accuracy.

[0045] In order to make the above objects, features and advantages of the present invention more comprehensible, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0046] Such as fi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an internet forum message content similarity measuring method and system. In the method, the content similarity between test text message dk and reference text message d is calculated according to the test state vector Sk={s1k, s2k, ..., sMk} and the reference state vector S={s1, s2, ..., sM}. The content similarity between the test state vector Sk={s1k, s2k, ..., sMk} andthe reference state vector S={s1, s2, ..., sM} is not symmetrical, that is to say, the value of the content similarity between the two text messages is related to semantic features of the two text messages and also related to the selection of a reference message, the internet forum sensitive information control requirements are better met, and the measurement accuracy of the internet forum messagecontent similarity is improved.

Description

technical field [0001] The invention relates to the field of network public opinion management and control, in particular to a method and system for measuring content similarity of network forum messages. Background technique [0002] The method for measuring the similarity of text content in network forum messages in the prior art, the main technical idea is to establish a vector space model of the text to describe the content characteristics of the text, and measure the content by calculating the cosine distance between two text feature vectors similarity. [0003] In the prior art, the method for measuring the similarity of content by calculating the cosine distance between two text feature vectors is mainly characterized in that the cosine distance is symmetric, for example, there are two contents of text message A and text message B, and the text The content of message A is C A , the content of text message B is C B , when taking text message A as the benchmark, th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/3347G06F40/216G06F40/30
Inventor 姚俊萍李晓军沈涛李新社
Owner 中国人民解放军火箭军工程大学
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products