Webpage junk detection method based on dynamic Bayesian model

A dynamic Bayesian, detection method technology, applied in the field of information security, can solve problems such as user rejection

Inactive Publication Date: 2011-11-16
NANJING UNIV OF POSTS & TELECOMM
View PDF2 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, the current search engine optimization technology is used by many short-sighted people, using some improper means of search engine optim

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Webpage junk detection method based on dynamic Bayesian model
  • Webpage junk detection method based on dynamic Bayesian model
  • Webpage junk detection method based on dynamic Bayesian model

Examples

Experimental program
Comparison scheme
Effect test

example

[0056] The session instance is as follows: (URL number, whether it was clicked)

[0057] 011021

[0058] 002130

[0059] 002131

[0060] The first line of the conversation instance represents the first conversation, and 3 results are returned, which are 011021. Every two numbers form a group, the first number in each group indicates the number of the website, the second number indicates whether the corresponding website is clicked, "0" means not clicked, and "1" means clicked. The second line represents the second conversation, and the third line represents the third conversation, in the same way as above.

[0061] 3. Calculate attractiveness and satisfaction based on the session file and the dynamic Bayesian model proposed by the present invention

[0062] Step 1) Calculate formula 1-4 from the session file;

[0063] Step 2) Calculate formulas 5 and 6 of the former term and the latter term;

[0064] α i ( e ) = P ( C 1 j , . . . C i - 1 ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a webpage junk detection method based on a dynamic Bayesian model, which relates to a method for detecting a cheating webpage. The webpage junk detection method mainly uses an improved dynamic Bayesian network model for modeling for click actions of users, and judges and identifies the cheating webpage; and a search engine query log records interactive information of the users and a search engine, wherein the content of the interactive information comprises the information including query terms, websites returned by the search engine, websites clicked by the users, timestamp and the like. Information including the clicked websites, a clicking order thereof and the like in the log reflects user preference. The webpage junk detection method models for the log click actions, and excavates a clicking causal relationship between the websites in a list sequence returned back by the search engine, thereby explaining which websites are considered to be associated with the query terms from the view of the users, and obtaining the relativity between the websites and the query from the view of the users; and the relativity is a connotative feedback, so that the cheating website is ranked low, and related websites are ranked higher.

Description

Technical field [0001] The invention relates to a method for detecting cheating webpages, which mainly adopts an improved dynamic Bayesian network model to model user click behaviors, and judge and identify cheating webpages, belonging to the field of information security. Background technique [0002] Search engines are a bridge to the current Internet, and a tool for netizens to find information of interest in a large number of web pages. Due to the huge user traffic on the Internet, this provides a huge potential market for advertising. The click-through rate of online advertisements as high as 3% or more can turn this potential object into a realistic advertising target object, which in turn leads to direct or indirect commodity purchase behavior. Compared with traditional advertising, the cost of this type of advertising is relatively low. As a result, a large number of small and medium-sized manufacturers who are eager to open the market but are unable to provide huge adv...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 张卫丰常成成田先桃张迎周周国强许碧欢陆柳敏
Owner NANJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products