Method for detecting and tracking topics of online forum

An online discussion and topic detection technology, applied in the field of computer networks, can solve the problems of complex discussion areas, high real-time requirements of algorithms, and failure of detection and tracking methods to achieve good results, so as to achieve broad application prospects and reduce impact Effect

Inactive Publication Date: 2010-06-23
INST OF AUTOMATION CHINESE ACAD OF SCI
View PDF0 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] So far, no research has proposed a more effective solution algorithm for topic detection and tracking in the discussion area. Based on the above analysis and experiments, the existing topic detection and tracking methods for news reports cannot achieve good results in the content of the discussion area. Effect
At the same time, due to the extensive and complex content of the discussion area, the requirements for the real-time performance of the algorithm are also very high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for detecting and tracking topics of online forum
  • Method for detecting and tracking topics of online forum
  • Method for detecting and tracking topics of online forum

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] The following is a detailed description of each detail problem involved in the technical solution of the invention.

[0019] Main features of the present invention are:

[0020] 1) A post information classifier is used to filter out invalid posts. A large number of uninformative posts in the discussion area will bring a lot of noise to topic detection and tracking, and the information classifier can filter out such posts to a large extent and improve the operation effect of the system;

[0021] 2) Analyze user behavior. In addition to using traditional content text analysis, the method of the present invention analyzes the behavioral characteristics of users in the discussion area in combination with the characteristics of the discussion area;

[0022] 3) Results of content text and user behavior analysis using a two-layer fusion framework. Aiming at the differences between content text analysis and user behavior analysis, the method of the present invention uses a t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the technical field of computer network and discloses a method for detecting and tracking topics of an online forum. The method comprises the following steps: adopting an HTML parsing module to pretreat posts in the forum and reconstruct clews; utilizing the information measurement module of the posts and clews to check the information degree of a new post and associated clews and update the eigenvector of the clews; analyzing the content text of the clews in a clew database; analyzing the user behavior of the clews in the clew database; and integrating the analytic results of the content text and the user behavior of the clews to judge the topic category of the clews. Given the complexity of the online forum, the invention greatly solves the detecting and tracking problems of the online forum by the method for integrating the content and the user behavior and has good application prospects.

Description

technical field [0001] The invention relates to the technical field of computer networks, in particular to information retrieval technology in online discussion areas. Background technique [0002] With the rapid development of the Internet (the Internet), it has gradually become an important part of people's lives. In the era of Web 2.0, users of the Internet have changed from information receivers to publishers of information. The interaction of the Internet is becoming stronger and stronger, and the online discussion area is currently one of the most popular interactive applications on the Internet. , various forums, BBS, etc. on the Internet are typical examples of online discussion areas. Usually, users can speak freely and express their opinions in the online discussion area, so the information in the discussion area is in a state of mixed and disordered content semantically, which brings great difficulties to information processing and retrieval. Big challenge: On t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30H04L12/18
Inventor 胡卫明朱明亮吴偶
Owner INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products