Personalized web page filtering method

A web filtering and web technology, applied in the field of computer networks, can solve the problems that web filtering technology cannot meet actual needs, and the time complexity cannot meet real-time filtering.

Active Publication Date: 2009-07-01
INST OF AUTOMATION CHINESE ACAD OF SCI
View PDF0 Cites 31 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Due to the large scale of the training database, it is necessary to calculate the similarity between the target webpage and all the webpages in the library when filtering each webpage, and its time complexity may not meet the needs of real-time filtering
In addition, the filtering threshold in this method is domain-dependent, so it needs to be carefully tuned according to the specific filtering test results when implementing personalization, which is another limitation of the practical application of this method.
[0007] Although after a lot of research, web filtering technology still can not meet the actual needs

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Personalized web page filtering method
  • Personalized web page filtering method
  • Personalized web page filtering method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0065] Various details involved in the technical solution of the present invention will be described in detail below in conjunction with the accompanying drawings. It should be pointed out that the described embodiments are only intended to facilitate the understanding of the present invention, rather than limiting it in any way.

[0066] The overall framework of the scheme system of the embodiment of the present invention is attached figure 1 , consists of two parts: Personalization Customization Unit 1 and Web Page Browsing Unit 2. Personalized customization unit 1 includes: input module 11, first and second webpage preprocessing and feature extraction modules 12 and 15, Internet 13, no label training webpage library 14, supervised learning module 16, feature extraction and feature selection module 17; The webpage browsing unit 2 includes: a target webpage 21 , a third webpage preprocessing and feature extraction module 22 , and a Bayesian classifier 23 .

[0067] The syst...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a web page filtering method which can be individually customized, which comprises steps: extracting the characteristics of a user example web page and a training web page base, excavating the attribute of a user interest class based on semi-supervised learning, conducting the characteristic extraction of the user interest class and the characteristic selection, and filtering the personalized web pages based on a Bayesian classifier. The invention provides a novel web page filtering frame driven by examples, the filtering demands of the users can be expressed through web page examples, the user examples can be any type web pages or multi-type composite web pages, and a web page filter in line with the individual demands of the user can be constructed by means of the semi-supervised learning, thereby overcoming the disadvantages of the limitation to single filtering or limited type web page, and unavailable realization of individual customization in a traditional web page filtering method. The method has the advantages of high accuracy, robustness and operation speed, and has excellent application prospects.

Description

technical field [0001] The invention relates to the technical field of computer networks, in particular to web page filtering technology. Background technique [0002] With the rapid development of the Internet (the Internet), it has gradually become an important part of people's lives, and people's dependence on the Internet is becoming stronger and stronger, and the demand for web page filtering is also increasing. On the one hand, due to the openness of the Internet, some bad information is also spread on the Internet, such as pornography, drugs, violence and so on. These bad information have a great impact on the physical and mental health of people, especially young people, and endanger the stability of society. On the other hand, due to the information explosion and the rapid development of the Internet, the amount of information on the Internet is increasing geometrically, but for specific Internet users, most of the information is useless or even spam. Therefore, h...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 胡卫明朱明亮李玺吴偶
Owner INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products