Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A method for determining the online behavior category of network users

A network user and behavior technology, applied in the Internet field, can solve problems such as difficulty in user labeling, infeasibility, difficulty in class probability distribution, etc.

Inactive Publication Date: 2019-03-05
NAT COMP NETWORK & INFORMATION SECURITY MANAGEMENT CENT
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, based on the behavior data of network users, it is very difficult to accurately predict the user's category probability distribution by relying on existing techniques.
On the one hand, the traditional classification method is to label the sample set, but because it is very difficult or even infeasible to label some users, the classifier based on it is often not accurate enough and the prediction effect is not ideal
On the other hand, the traditional clustering method can only divide users into one cluster, and a user may have multiple categories of behavior tendencies, so the classification method based on traditional clustering cannot reflect the real category distribution of network users

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method for determining the online behavior category of network users
  • A method for determining the online behavior category of network users
  • A method for determining the online behavior category of network users

Examples

Experimental program
Comparison scheme
Effect test

no. 1 example

[0049] The first embodiment of the present invention provides a method for determining the type of online behavior of a network user, such as figure 1 As shown, the method specifically includes the following steps:

[0050] Step S101: within a preset period of time, extract the online behavior characteristics of each network user to be tested, and form a user behavior characteristic matrix X through the quantitative method of the document vector space model according to the online behavior characteristics of all network users to be tested;

[0051] Specifically, the online behavior feature includes: a feature word marked based on the online behavior of the network user to be tested; the online behavior of the network user to be tested includes: the URL link clicked by the network user to be tested and the online search keywords.

[0052] According to the online behavior characteristics of all network users to be tested, the user behavior characteristic matrix X is formed by t...

no. 3 example

[0151] The third embodiment of the present invention provides a method for determining the type of online behavior of a network user, the method specifically includes the following steps:

[0152] Step S301: within a preset period of time, extract the online behavior characteristics of each network user to be tested, and form a user behavior characteristic matrix X through the quantitative method of the document vector space model according to the online behavior characteristics of all network users to be tested;

[0153]

[0154] Step S302: According to the user behavior characteristic matrix X, through the probabilistic latent semantic analysis method PLSA and EM algorithm, the behavior tendency set T and the "user-propensity" probability distribution matrix D are obtained;

[0155]

[0156] Each element vector in the behavior tendency set T represents each behavior tendency;

[0157]

[0158] Each row vector in the "user-propensity" probability distribution matrix ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention proposes a method for determining the online behavior category of network users, the method comprising: extracting the online behavior characteristics of each network user to be tested, and forming a user behavior characteristic matrix X through the quantization method of the document vector space model; The user behavior feature matrix X, through the probabilistic latent semantic analysis method PLSA and EM algorithm, obtains the behavior tendency set T and the "user-tendency" probability distribution matrix D; according to the user behavior feature matrix X, through the support vector machine SVM algorithm , to get the probability distribution matrix C of "characteristic word-category"; run T×C through matrix multiplication to get the mapping matrix M of "propensity-category"; run D×M through matrix multiplication to get the probability distribution matrix Y of "user-category"; according to any A probability distribution of network users to be tested in each category, and any network user to be tested is classified into the category with the highest probability value.

Description

technical field [0001] The invention relates to the technical field of the Internet, in particular to a method for determining the online behavior category of network users. Background technique [0002] A large number of cases show that the level of network security management can be effectively improved by using user behavior category information. However, based on the behavior data of web users, it is very difficult to accurately predict the class probability distribution of users by relying on existing techniques. On the one hand, the traditional classification method is to label the sample set, but because it is very difficult or even infeasible to label some users, the classifier based on it is often not accurate enough and the prediction effect is not ideal. On the other hand, traditional clustering methods can only divide users into one cluster, and a user may have multiple types of behavior tendencies. Therefore, traditional clustering-based classification methods ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): H04L12/24H04L12/26H04L29/06H04L29/08
CPCH04L41/14H04L43/08H04L63/20H04L67/535
Inventor 李鹏霄杜翠兰任彦易立钮艳佟玲玲段东圣刘晓辉查奇文
Owner NAT COMP NETWORK & INFORMATION SECURITY MANAGEMENT CENT
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products