Method and system for grabbing web pages from servers with different IPs (Internet Protocols) in website
A server and web crawling technology, applied in transmission systems, instruments, computing, etc., can solve problems such as low collection efficiency, crawling failure, and denial of service, and achieve the effect of improving fault tolerance and efficiency.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0016] The present invention will be described in detail below in conjunction with specific embodiments and accompanying drawings.
[0017] figure 1 It shows the system structure of grabbing webpages from multiple servers with different IPs in the website according to the present invention. Such as figure 1 As shown, the system includes a distributing device 11 , a judging device 12 connected to the distributing device 11 , and a grabbing device 13 connected to the judging device 12 .
[0018] The allocating device 11 is used for allocating the IP of the target website server for the web page crawling task of the client. The webpage grabbing task includes the URL (webpage address) of the webpage to be grabbed; the target website refers to the website where the webpage to be grabbed is located.
[0019] The judging means 12 is used for judging whether the webpage crawling task meets the polite access condition of the server. The polite access conditions include the followin...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com