Method, device and browser for collecting web pages

A web page collection and web page technology, applied in the computer field, can solve the problems of inability to provide and realize long-term collection and preservation of web page content, and achieve the effect of extending functions and ensuring long-term effectiveness.

Active Publication Date: 2017-12-15
TENCENT TECH (SHENZHEN) CO LTD
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The purpose of the embodiments of the present invention is to provide a method for collecting webpages, which aims to solve the problem that long-term collection and preservation of webpage content cannot be realized due to the inability of the prior art to provide an effective method for collecting webpages

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method, device and browser for collecting web pages
  • Method, device and browser for collecting web pages
  • Method, device and browser for collecting web pages

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0022] figure 1 The implementation flow of the method for collecting webpages provided by Embodiment 1 of the present invention is shown, and the details are as follows:

[0023] In step S101, an instruction to bookmark a webpage is received, and a webpage link corresponding to the webpage is acquired.

[0024] In the embodiment of the present invention, the instruction to bookmark a webpage includes a corresponding webpage link, and when the instruction to bookmark a webpage is received, the corresponding webpage link is obtained.

[0025] In step S102, the webpage crawling server in the cloud server group is invoked according to the webpage link to grab the webpage content corresponding to the webpage link.

[0026] In the embodiment of the present invention, in order to improve the crawling speed of the webpage content and realize the load balancing of the webpage crawling servers in the cloud server group, as an example, call the webpage crawling server in the cloud serve...

Embodiment 2

[0030] figure 2 It shows the implementation flow of the method for collecting webpages provided by Embodiment 2 of the present invention, and the details are as follows:

[0031] In step S201, an instruction to bookmark a webpage is received, and a webpage link corresponding to the webpage is acquired.

[0032] In the embodiment of the present invention, the instruction to bookmark a webpage includes a corresponding webpage link, and when the instruction to bookmark a webpage is received, the corresponding webpage link is obtained.

[0033] In step S202, the webpage crawling server in the cloud server group is invoked according to the webpage link to grab the webpage content corresponding to the webpage link.

[0034] In the embodiment of the present invention, in order to improve the crawling speed of the webpage content and realize the load balancing of the webpage crawling servers in the cloud server group, as an example, call the webpage crawling server in the cloud serv...

Embodiment 3

[0048] image 3 The structure of the web page collection device provided by the third embodiment of the present invention is shown. For the convenience of description, only the parts related to the embodiment of the present invention are shown, including:

[0049] The link acquiring unit 31 is configured to receive an instruction to bookmark a webpage, and acquire a webpage link corresponding to the webpage.

[0050] The webpage content capture unit 32 is configured to call a webpage capture server in the cloud server group to capture the webpage content corresponding to the webpage link according to the webpage link.

[0051] The webpage content saving unit 33 is configured to save the webpage content to a cloud storage server in the cloud server group.

[0052] In the embodiment of the present invention, in order to improve the crawling speed of the webpage content and realize the load balancing of the webpage crawling servers in the cloud server group, as an example, call ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention is suitable for the technical field of computers and provides a web page collecting method and device as well as a browser. The web page collecting method comprises the following steps of receiving a web page collecting command, and obtaining a web page link corresponding to a web page; calling a web page grabbing server in a cloud server farm to grab web page content corresponding to the web page link according to the web page link; saving the web page content in a cloud storage server in the cloud server farm. According to the web page collecting method and device as well as the browser, which are disclosed by the invention, the cloud storage of the collected web page content is realized, and the long-term validity of the collected web page is ensured, so that the collected web page content is not limited by time and access addresses, and the function of a bookmark of the browser is expanded.

Description

technical field [0001] The invention belongs to the technical field of computers, and in particular relates to a web page collection method, device and browser. Background technique [0002] Favorites is a basic application in the browser, which is used to save the website / webpage link that the user needs to visit frequently on the local computer terminal, and the corresponding website / page can be opened directly by clicking the link) to access the corresponding resource, However, after the operating system of the computer terminal is reinstalled, the local user data in the previous favorites will be lost, and the website page can no longer be opened. In addition, the website link stored in the local computer terminal cannot be obtained on other computer terminals. [0003] In order to ensure that the link can also be opened on the computer terminal after the operating system is reinstalled or on other computer terminals, the network favorite folder is proposed, so that the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
CPCG06F16/9562
Inventor 刘刚
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products