Distributed crawler task scheduling system and method
A scheduling system and scheduling method technology, which are applied in the field of scheduling methods and systems for distributed crawler tasks, can solve the problems of inability to meet the individual needs of customers and the inability to obtain data effectively and quickly with their own efficiency, achieving fast collection speed and universal use of crawlers. high sex effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0049] figure 1 It is a flow chart of a method for scheduling distributed crawler tasks provided by an embodiment of the present application; figure 1 As shown, the scheduling method of the distributed crawler task provided by this embodiment includes at least the following steps:
[0050] S101, acquiring a user-defined crawler task;
[0051] In practical application scenarios, crawler tasks are generated based on crawler task scripts. The crawler task script is created by the user and combined in sequence according to various operations of the user on the browser. The crawler task script may contain loop operations, indicating that the same processing is performed on the urls in the loop list in turn. Therefore, for a crawler task created by a user, the application splits the outermost loop parameters of the task to form subtasks, so as to facilitate parallel crawling of the task. Among them, the processing method for the circular list is described in more detail below. ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


