Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Method for Dividing Semantic Modules of Web Pages

A technology for semantics and web pages, applied in the field of dividing web page semantic modules, can solve the problems of inability to automatically determine which layer of semantic modules to be divided into, and inability to determine the size and granularity of semantic modules, so as to improve web page display speed, processing speed, and system resources. less effect

Active Publication Date: 2017-02-22
袁运来
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Similarly, this method cannot determine the size granularity of semantic modules
[0014] In short, the use of existing methods for bad information filtering is faced with the problem of not being able to automatically determine which layer the semantic module is divided into

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Method for Dividing Semantic Modules of Web Pages
  • A Method for Dividing Semantic Modules of Web Pages
  • A Method for Dividing Semantic Modules of Web Pages

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0050] Below in conjunction with specific embodiment, further illustrate the present invention, should be understood that these embodiments are only used to illustrate the present invention and are not intended to limit the scope of the present invention, after having read the present invention, those skilled in the art will understand various equivalent forms of the present invention All modifications fall within the scope defined by the appended claims of the present application.

[0051] The method of dividing the semantic modules of web pages, the working principle of this method is based on the following two rules:

[0052] Rule 1: The content of each web page is different and semantically independent, and each web page corresponds to a URL, so each URL represents an independent semantic unit.

[0053] Rule 2: Each semantic module on a web page will contain one or more hyperlinks. If a semantic module is complete (news, news comments, forum main posts, forum repl...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for webpage semantic module partition. In the process of semantic module partition, nodes on the deepest layer where webpage semantic module partition should reach is estimated and a top-layer node set of a semantic module is calculated from the nodes on the deepest layer from bottom to top layer by layer according to similarity calculation. The deletion missing and fault deleting rate is low; the processing speed is high and the webpage display speed is increased under certain circumstances instead of being influenced; needed system resources are small in number and the method can be widely applied to various systems comprising platforms of a mobile device and a route with consideration of the cost of hardware and the like.

Description

technical field [0001] The invention relates to a method for dividing semantic modules of webpages, which determines the upper-level nodes where sensitive words are located by calculating weights, and iteratively expands node sets through methods such as similarity from bottom to top to realize the division of semantic modules of webpages, thereby shielding sensitive information. Background technique [0002] At present, there are more and more advertisements and bad information on the Internet, permeating most websites, which are acceptable to adults, but have a great negative effect on minors, so it is necessary to block them. [0003] The current bad information filtering software is mainly aimed at the entire webpage or website, by searching the URL database of the bad website to identify whether the url to be accessed is allowed to visit, and to detect whether the webpage contains bad words or pictures, and to block the webpage, which means that the software has IE's "C...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
CPCG06F16/9577
Inventor 袁运来
Owner 袁运来
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products