Unlock instant, AI-driven research and patent intelligence for your innovation.

Webpage Analysis Method and System Based on Webpage Difference

A web page analysis and web page technology, applied in the field of network information analysis, can solve the problems of insufficient analysis ability, low efficiency, poor versatility, etc., and achieve the effect of good analysis effect, high efficiency and efficiency improvement.

Active Publication Date: 2019-01-22
NINGBO ZHONGQING CYYUN NEW MEDIA TECH CO LTD
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0013] The purpose of the embodiments of the present invention is to provide a webpage analysis method and system based on webpage differences, so as to solve the problems of low efficiency, insufficient analysis ability, and poor versatility in the analysis of existing webpage content

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Webpage Analysis Method and System Based on Webpage Difference
  • Webpage Analysis Method and System Based on Webpage Difference
  • Webpage Analysis Method and System Based on Webpage Difference

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0067] The aforementioned and other technical contents, features and effects of the present invention will be clearly presented in the following detailed description of the preferred embodiments with reference to the drawings. Through the description of specific implementation methods, the technical means and effects of the present invention to achieve the intended purpose can be understood more deeply and specifically, but the attached drawings are only for reference and description, and are not used to explain the present invention limit.

[0068] See figure 2 , which is a flowchart of a web page analysis method based on web page differences in an embodiment of the present invention, which includes the following steps:

[0069] S21. Collect web page information to be analyzed.

[0070] S22. Cluster the webpage information according to the collected node structure of the webpage information.

[0071] S23. According to the clustering result, extract the visualized web page...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention proposes a webpage analysis method and system based on webpage differences. The method includes: collecting webpage information to be analyzed; clustering the webpage information according to the node structure of the collected webpage information; classify the results, and extract the visualized webpage elements whose content changes in each node; classify the extracted visualized webpage elements, so as to identify the visualized webpage elements. The present invention can automatically realize the analysis of webpage content without manual operation, and realizes the analysis of the core webpage information concerned by users by extracting the visualized changing webpage elements, which has strong pertinence, good analysis effect, and saves labor costs , high efficiency, strong analytical ability, and strong versatility.

Description

technical field [0001] The invention relates to a network information analysis technology, in particular to a web page analysis method and system based on web page differences. Background technique [0002] The public opinion system needs to continuously crawl a large number of web pages on the Internet, and correctly analyze the content of the web pages, and analyze the release time, author, title and other information. For example, in volunteer network public opinion service, we need to collect volunteer-related network public opinion information from many sites. However, the formats of webpages of different sites vary widely, and even different layouts and pages of the same site may have different formats. These formats will be revised from time to time. These differences and modifications have brought great difficulties to automated web page analysis, and therefore often require a large amount of human intervention, manual configuration of new web pages and constant pa...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/9535G06F16/906G06K9/62
CPCG06F16/951G06F18/23G06F18/24
Inventor 冯建兴张云刚翁时锋梁丰王遵义
Owner NINGBO ZHONGQING CYYUN NEW MEDIA TECH CO LTD