Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and device for filtering Internet web page information

A technology of webpage information and Internet, which is applied in the field of Internet webpage information filtering, and can solve problems such as polluting the Internet environment

Inactive Publication Date: 2018-06-12
佛山市车品匠汽车用品有限公司
View PDF8 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In daily life and work, people will always obtain various information from the Internet more or less, but those anti-bad information will also quietly enter people's field of vision, polluting the entire Internet environment

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for filtering Internet web page information

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] The following will clearly and completely describe the technical solutions in the embodiments of the present invention in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0028] Such as figure 1 As shown in FIG. 1 , it is a flow chart of an Internet web page information filtering method.

[0029] In a first aspect, a method for filtering Internet webpage information, the method includes:

[0030] Step S101: Preprocessing web page information, extracting valid information text in the page;

[0031] Step S102: For the Chinese word segmentation in the text, adopt the forward iterative maximum matching algorithm i...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a method and device for filtering Internet web page information. The method comprises the steps that a valid information text is extracted by preprocessing web page information;Chinese word segmentation is conducted on the text, and a forward iteration maximum matching algorithm of a string matching algorithm is adopted; feature extraction is conducted, and feature extraction of resulTS after Chinese word segmentation is conducted and texTS in a prepared corpus is conducted simultaneously to obtain feature vectors; text classification is conducted, if a text is the textcontaining unhealthy information, a system shows that a web page contains the unhealthy information, and the URL address of the web page is put on a URL address blacklist, so that when accessing is performed next time, the web page is immediately identified by the system as an unhealthy web page, and accessing of the web page by a user is prohibited. According to the method, the unhealthy information is filtered before the unhealthy information is viewed by the user, so that the network environment is purified to an extent, routes for a netizen to obtain the unhealthy information can be reduced, and the method is particularly important for development of physical and mental health of youths.

Description

technical field [0001] The invention relates to the technical field of the Internet, in particular to a method and device for filtering Internet web page information. Background technique [0002] The Internet is one of the important ways for our modern people to obtain information, and it is an important window for us to communicate with the outside world, and its importance has become particularly important. In daily life and work, people will always obtain various information from the Internet more or less, but those anti-bad information will also quietly enter people's field of vision and pollute the entire Internet environment. Bad information on the Internet refers to all kinds of information on the Internet that violates human morality and laws, incites, confuses, superstitions, and erodes human mental health. [0003] Therefore, the realization of network information identification and filtering technology is particularly important. By preprocessing bad information...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/353G06F16/9535G06F40/289
Inventor 胡静
Owner 佛山市车品匠汽车用品有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products