An intelligent detection method for bad web pages based on deep belief network algorithm

A deep belief network and intelligent detection technology, which is applied in the direction of network data retrieval, network data navigation, website content management, etc., can solve the problems of reducing the trust of search engines, Internet security threats, and reducing user experience of search engines

Active Publication Date: 2022-04-12
INFORMATION & COMMNUNICATION BRANCH STATE GRID JIANGXI ELECTRIC POWER CO +1
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Bad webpages greatly reduce the user experience of search engines, thereby reducing the trust of search engines, and also bring a series of threats to the security of the entire Internet, for example: some websites that carry viruses, when users open the corresponding Viruses or important personal information will be stolen after the webpage

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An intelligent detection method for bad web pages based on deep belief network algorithm
  • An intelligent detection method for bad web pages based on deep belief network algorithm
  • An intelligent detection method for bad web pages based on deep belief network algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] The technical solutions in the embodiments of the present invention will be clearly described in conjunction with the accompanying drawings in the embodiments of the present invention; it is obvious that the described embodiments are only a part of the embodiments of the present invention, not all embodiments, based on The embodiments of the present invention and all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0050] see Figure 1-4 , a bad web page intelligent detection method based on deep belief network algorithm, comprising the following steps;

[0051] Step 1: Build a hierarchical structure model of discriminant indicators: In order to enrich the types of web page features and identify bad web pages more accurately, the content, links, quality and hidden features of web pages are extracted and a corresponding bad web page discriminant index system is es...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a bad web page intelligent detection method based on a deep belief network algorithm, comprising the following steps: constructing a hierarchical structure model of discriminant indicators: in order to enrich the types of web page features and identify bad web pages more accurately, the content of the web page, Links, quality and hidden features and established the corresponding bad web page identification index system; bad web page identification index sample set; index set balance processing based on SMOTE algorithm; extracted content, links, quality and hidden features of web pages and established corresponding To reduce the bad webpage discrimination index system, the SMOTE technology is used to balance the sample data set first, so that the classification effect of the classifier is not affected by the majority of samples in the sample data set. It is proposed to use DBN As a classifier, the processed samples are used as the input of the classifier to obtain the detection results, which verifies the efficiency of the classifier.

Description

technical field [0001] The invention relates to an intelligent detection method for bad webpages, in particular to an intelligent detection method for bad webpages based on a deep belief network algorithm. Background technique [0002] With the development of science and technology, the Internet is also showing a trend of rapid development, and search engines, as one of the important applications for users to use the Internet, have become an indispensable and important component for users to query information. According to a report released by the China Internet Network Information Center, in June 2017, the number of Internet users in China had increased to 751 million. As an essential part of the Internet, search engines are one of the most widely used Internet applications, and have gradually become an important channel for users to obtain and access Internet resources. [0003] The user can send a query request through the browser. According to the user's request, the se...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/954G06F16/958
Inventor 邱日轩肖子洋付晨
Owner INFORMATION & COMMNUNICATION BRANCH STATE GRID JIANGXI ELECTRIC POWER CO
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products