Web crawler system based on big data
A web crawler and big data technology, applied in the field of crawler, can solve the problems of inability to identify valid data, low crawling efficiency, difficulty in applying information crawling, etc., to reduce the amount of invalid crawling data and improve crawling efficiency.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0030] In order to make the purpose, technical solution and advantages of the present application clearer, the technical solution of the present application will be described in detail below. Apparently, the described embodiments are only some of the embodiments of this application, not all of them. Based on the embodiments in the present application, all other implementation manners obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present application.
[0031] figure 1 A functional structure diagram of a web crawler system based on big data provided for an embodiment of the present application, such as figure 1 As shown, the big data-based web crawler system includes:
[0032] The configuration module 11 is used to configure crawler parameters;
[0033] In some embodiments, crawler parameters include, but are not limited to, network URLs and data fields in web pages.
[0034] By configuring the URL of the da...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


