Unlock instant, AI-driven research and patent intelligence for your innovation.

A bad webpage intelligent detection method based on a deep belief network algorithm

A deep belief network and intelligent detection technology, which is applied in network data retrieval, network data navigation, website content management, etc., can solve problems such as reduced search engine user experience, reduced search engine trust, and theft of important information

Active Publication Date: 2019-04-09
国网江西省电力有限公司信息通信分公司 +1
View PDF2 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Bad webpages greatly reduce the user experience of search engines, thereby reducing the trust of search engines, and also bring a series of threats to the security of the entire Internet, for example: some websites that carry viruses, when users open the corresponding Viruses or important personal information will be stolen after the webpage

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A bad webpage intelligent detection method based on a deep belief network algorithm
  • A bad webpage intelligent detection method based on a deep belief network algorithm
  • A bad webpage intelligent detection method based on a deep belief network algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] The technical solutions in the embodiments of the present invention will be clearly described in conjunction with the accompanying drawings in the embodiments of the present invention; it is obvious that the described embodiments are only a part of the embodiments of the present invention, not all embodiments, based on The embodiments of the present invention and all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0050] see Figure 1-4 , a bad web page intelligent detection method based on deep belief network algorithm, comprising the following steps;

[0051] Step 1: Build a hierarchical structure model of discriminant indicators: In order to enrich the types of web page features and identify bad web pages more accurately, the content, links, quality and hidden features of web pages are extracted and a corresponding bad web page discriminant index system is es...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a bad webpage intelligent detection method based on a deep belief network algorithm. The method include the following steps: Constructing a hierarchical structure model of thediscrimination indexes: in order to enrich the types of webpage characteristics and more accurately identify bad webpages, extracting content, links, quality and hidden characteristics of the webpages, and establishing a corresponding bad webpage discrimination index system; setting the judgement index sample set of the bad webpage; Carrying out index set balancing processing based on an SMOTE algorithm. content of web page, links, quality and hidden features are extracted, and a corresponding bad webpage discrimination index system is established; the bad webpage discrimination index is reduced; A sample data set is subjected to balance processing operation by adopting the SMOTE technology, so that the classification effect of the classifier is not influenced by a plurality of types of samples in the sample data set, the DBN is used as the classifier, the processed samples are used as the input of the classifier to obtain the detection result, and the high efficiency of the classifieris verified.

Description

technical field [0001] The invention relates to an intelligent detection method for bad webpages, in particular to an intelligent detection method for bad webpages based on a deep belief network algorithm. Background technique [0002] With the development of science and technology, the Internet is also showing a trend of rapid development, and search engines, as one of the important applications for users to use the Internet, have become an indispensable and important component for users to query information. According to a report released by the China Internet Network Information Center, in June 2017, the number of Internet users in China had increased to 751 million. As an essential part of the Internet, search engines are one of the most widely used Internet applications, and have gradually become an important channel for users to obtain and access Internet resources. [0003] The user can send a query request through the browser. According to the user's request, the se...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/954G06F16/958
Inventor 邱日轩肖子洋付晨
Owner 国网江西省电力有限公司信息通信分公司