System and method for analyzing information of websites

A technology for website information and analysis methods, applied in special data processing applications, instruments, electronic digital data processing, etc., can solve the problems of page omission, complex website structure, large manual workload, etc., to improve work efficiency and accurate information analysis. , Eliminate the effect of page omission

Active Publication Date: 2011-05-25
XIAMEN FUYUN INFORMATION TECH CO LTD
View PDF6 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] The information analysis process for the website of the prior art mainly adopts manual access to the website through a browser (if the website that needs permission also needs a user name and password to log in), and then adopts a screen capture method to analyze the screenshot information of the website page; if the web page is relatively long If it is necessary to use a special screen capture tool to capture the entire page; at the same time, there are still problems such as the complex structure of the website, and the workload of manual methods is heavy and there is no way to accurately complete the information analysis of the entire website.
Therefore, the prior art has the following

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • System and method for analyzing information of websites
  • System and method for analyzing information of websites
  • System and method for analyzing information of websites

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] Embodiment, referring to the accompanying drawings, a website information analysis system of the present invention includes:

[0036] An authority verifier 1, which is used to detect the authority of the target website, send a login command to the target website to extract the page information of the website, and determine whether the target website needs a user name and password to log in and visit. For the target website that needs a user name and password to log in and visit , using the web crawler technology analysis module to simulate the login process of the website to automatically log in to the target website;

[0037] A link extractor 2, which is used to obtain all link data information of the target website, and extract all link data information from the pages of the target website by using the link extraction algorithm module;

[0038] A link analyzer 3, which is used to analyze and analyze all link data information, and classify and analyze all link data inf...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a system and method for analyzing information of websites. The system comprises an authority verifier, a link extractor, a link analyzer, a link page imager, an incremental information analyzer and a report generator. The method comprises the following steps: the authority verifier is utilized to detect the authority of the target websites; the link extractor is utilized to obtain all the link data information of the target websites; the link analyzer is utilized to analyze all the link data information; the link page imager is utilized to analyze memory and imaging ofthe classified link data information; the incremental information analyzer is utilized to traverse all the links of the target websites again and form images of and memorize link queues which do not appear in a specified memory; and the report generator is utilized to generate the report in preset format. The system and the method have the following beneficial effects: by adopting the system and method disclosed by the invention, not only can automatic information analysis be realized, but also page omission existing in manual information analysis can be eliminated; and the system and the method have the characteristics of accurate, rapid and omission-free information analysis and easy operation.

Description

technical field [0001] The invention relates to an information analysis system for a website, in particular to a website information analysis system and a method thereof. Background technique [0002] The information analysis process for the website of the prior art mainly adopts manual access to the website through a browser (if the website that needs permission also needs to log in with a user name and password), and then adopts a screen capture method to analyze the screenshot information of the website page; if the web page is relatively long If you want to use a special screen capture tool to capture the entire page; at the same time, there are problems such as the complex structure of the website, and the manual method is heavy and there is no way to accurately complete the information analysis of the entire website. Therefore, the prior art has the following problems when solving website information analysis: one is that a special screen capture tool is required to co...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 陈奋吴世雄张永光
Owner XIAMEN FUYUN INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products