Personalized web page filtering method

A web filtering and web technology, applied in the field of computer networks, can solve the problems that the time complexity cannot meet the real-time filtering, and the web filtering technology cannot meet the actual needs.

Active Publication Date: 2012-06-20
INST OF AUTOMATION CHINESE ACAD OF SCI
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Due to the large scale of the training database, it is necessary to calculate the similarity between the target webpage and all the webpages in the library when filtering each webpage, and its time complexity may not meet the needs of real-time filtering
In addition, the filtering threshold in this method is domain-dependent, so it needs to be carefully tuned according to the specific filtering test results when implementing personalization, which is another limitation of the practical application of this method.
[0007] Although after a lot of research, web filtering technology still can not meet the actual needs

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Personalized web page filtering method
  • Personalized web page filtering method
  • Personalized web page filtering method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0065] Hereinafter, various details involved in the technical solution of the present invention will be described in detail with reference to the accompanying drawings. It should be noted that the described embodiments are only intended to facilitate the understanding of the present invention, and do not have any limiting effect on it.

[0066] The overall framework of the scheme system of the embodiment of the present invention is attached figure 1 , Consists of two parts: personalized customization unit 1, web browsing unit 2. The personalized customization unit 1 includes: input module 11, first and second webpage preprocessing and feature extraction modules 12 and 15, Internet 13, unlabeled training webpage library 14, supervised learning module 16, feature extraction and feature selection module 17; The webpage browsing unit 2 includes: a target webpage 21, a third webpage preprocessing and feature extraction module 22, and a Bayesian classifier 23.

[0067] The system autom...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a web page filtering method which can be individually customized, which comprises steps: extracting the characteristics of a user example web page and a training web page base, excavating the attribute of a user interest class based on semi-supervised learning, conducting the characteristic extraction of the user interest class and the characteristic selection, and filtering the personalized web pages based on a Bayesian classifier. The invention provides a novel web page filtering frame driven by examples, the filtering demands of the users can be expressed through web page examples, the user examples can be any type web pages or multi-type composite web pages, and a web page filter in line with the individual demands of the user can be constructed by means of the semi-supervised learning, thereby overcoming the disadvantages of the limitation to single filtering or limited type web page, and unavailable realization of individual customization in a traditional web page filtering method. The method has the advantages of high accuracy, robustness and operation speed, and has excellent application prospects.

Description

Technical field [0001] The invention relates to the field of computer network technology, in particular to web page filtering technology. Background technique [0002] With the rapid development of the Internet (the Internet), it has gradually become an important part of people's lives. People are becoming more and more dependent on the Internet, and the demand for web filtering is also increasing. On the one hand, due to the openness of the Internet, some bad information is also spread on the Internet, such as pornography, drugs, violence, etc. Such bad information has a great impact on the physical and mental health of people, especially young people, and endangers social stability. On the other hand, due to the explosion of information and the rapid development of the Internet, the amount of information on the Internet is increasing geometrically, but for certain Internet users, most of the information is useless or even spam. Therefore, how to keep what you are interested i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
Inventor 胡卫明朱明亮李玺吴偶
Owner INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products