Internet forum user interest modeling method based on uncertainty treatment

A technology of user interest and uncertainty, applied in the field of modeling of user interest in Internet forums, can solve problems such as loss, failure to meet the requirements of user interest modeling and analysis, and inability to explain the ambiguity of interest, etc., to achieve accurate calculation Effect

Inactive Publication Date: 2010-06-02
FUDAN UNIV
View PDF0 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] 1. Describing user interests purely in a probabilistic manner can only explain the randomness of users’ interest in a certain topic, but not the ambiguity of this interest, and ambiguity is an important aspect for people to analyze and understand user interests
[0005] 2. There are great differences in the number of times and lengths of postings or replies by users of online forums, which reflects the differences in user interests to a certain extent. However, the existing models only express relevant t

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Internet forum user interest modeling method based on uncertainty treatment
  • Internet forum user interest modeling method based on uncertainty treatment
  • Internet forum user interest modeling method based on uncertainty treatment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] (1) Download all page files in a period of time from Internet forums. Usually these files contain information such as the user's posting time, post title, post content, etc., but these information are surrounded by various HTML tags.

[0023] Preprocessing of page files: use Web information extraction technology to analyze these files, so as to convert the information of a post page into a structured set of user post records, each record contains (post time, post title, post person, post content, reply flag).

[0024] (2) Select the original post (the reply flag is false) from the user post record set, and all reply records under this post (the reply flag is true) to form a temporary post set. The basic requirement is that the temporary post set must have a specified user.

[0025] Extract the content of the original post and title, and use the existing word segmentation method with part-of-speech tagging to segment these contents into individual words, and only keep ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the technical field of internet user behavioral analysis, in particular to an internet forum user modeling method based on certainty, which comprises the steps of: introducing a subordinate function for expressing user interests during the modeling, computing parameters of the function by adopting a similar Gaussian type subordinate function based on behavioral characteristics of a user in the forum; extracting a user interest text by adopting a text processing method, computing user interest text vectors according to different weight configurations; establishing a user interest model in a higher-dimensional space formed by the text vectors and a subordinate function discourse; and describing interest distribution of the user in different topic spaces by adopting a probability density function. The model established by the invention can reflect the vagueness of the user interests and express the randomness of the user interests, and enable the expression of the user interests to more approach the user requirement, thereby being more reasonable and being used for various analysis occasions based on internet user interests.

Description

technical field [0001] The invention belongs to the technical field of network user behavior analysis, and in particular relates to a modeling method oriented to user interests of network forums. Background technique [0002] With the rapid promotion of the application of Web2.0 on the Internet, many interactive forum websites have appeared. These websites gather a large number of Internet users, where they publish posts, reply to posts, and show different interests in posts on different topics. For many commercial applications, accurately discovering user interests and discovering more interest groups is the primary condition for successful business development. Therefore, it is an effective way for commercial applications to acquire user groups by making full use of the behaviors of users posting or replying to posts in these forum websites to tap users' interests. [0003] At present, people's research on the interests of Internet users mainly focuses on the user's sear...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06Q30/00G06Q30/02
Inventor 曾剑平吴承荣
Owner FUDAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products