Method for webpage semantic module partition

A semantic and webpage technology, applied in the field of dividing webpage semantic modules, can solve the problems of inability to determine the size and granularity of semantic modules, and inability to automatically determine which layer the semantic module is divided into, so as to improve the display speed of webpages, the processing speed, and the system resources. less effect

Active Publication Date: 2014-08-13
袁运来
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Similarly, this method cannot determine the size granularity of semantic modules
[0014] In short, the use of existing methods for bad information filtering is faced with the problem of not being able to automatically determine which layer the semantic module is divided into

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for webpage semantic module partition
  • Method for webpage semantic module partition
  • Method for webpage semantic module partition

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0050] Below in conjunction with specific embodiment, further illustrate the present invention, should be understood that these embodiments are only used to illustrate the present invention and are not intended to limit the scope of the present invention, after having read the present invention, those skilled in the art will understand various equivalent forms of the present invention All modifications fall within the scope defined by the appended claims of the present application.

[0051] The method of dividing the semantic modules of web pages, the working principle of this method is based on the following two rules:

[0052] Rule 1: The content of each web page is different and semantically independent, and each web page corresponds to a URL, so each URL represents an independent semantic unit.

[0053] Rule 2: Each semantic module on a web page will contain one or more hyperlinks. If a semantic module is complete (news, news comments, forum main posts, forum repl...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for webpage semantic module partition. In the process of semantic module partition, nodes on the deepest layer where webpage semantic module partition should reach is estimated and a top-layer node set of a semantic module is calculated from the nodes on the deepest layer from bottom to top layer by layer according to similarity calculation. The deletion missing and fault deleting rate is low; the processing speed is high and the webpage display speed is increased under certain circumstances instead of being influenced; needed system resources are small in number and the method can be widely applied to various systems comprising platforms of a mobile device and a route with consideration of the cost of hardware and the like.

Description

technical field [0001] The invention relates to a method for dividing semantic modules of webpages, which determines the upper-level nodes where sensitive words are located by calculating weights, and iteratively expands node sets through methods such as similarity from bottom to top to realize the division of semantic modules of webpages, thereby shielding sensitive information. Background technique [0002] At present, there are more and more advertisements and bad information on the Internet, permeating most websites, which are acceptable to adults, but have a great negative effect on minors, so it is necessary to block them. [0003] The current bad information filtering software is mainly aimed at the entire webpage or website, by searching the URL database of the bad website to identify whether the url to be accessed is allowed to visit, and to detect whether the webpage contains bad words or pictures, and to block the webpage, which means that the software has IE's "C...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/9577
Inventor 袁运来
Owner 袁运来
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products