Crawler method for configuring and collecting APP information and storage medium
A configuration and configuration item technology, applied in the field of crawler, can solve the problems of low development efficiency, high development cost, inability to realize APP information data, etc., and achieve the effect of high development efficiency and low coding and development cost.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0049] Please refer to figure 1 , this embodiment provides a crawler method for configuring APP information collection, which can adapt to various application markets to collect APP information, and reduce development costs while improving development efficiency.
[0050] The method includes the following steps:
[0051] 1. Crawler site configuration
[0052] S1: Construct a content parsing dictionary, the content parsing dictionary includes a primary parsing configuration key and its corresponding primary parsing configuration item, a secondary parsing configuration key and its corresponding secondary parsing configuration item;
[0053]The content parsing dictionary consists of key / item associations. Specifically, the content analysis dictionary in this embodiment records the association relationship between the first-level analysis configuration key and the corresponding first-level analysis configuration item, and the relationship between the second-level analysis config...
Embodiment corresponding Embodiment 1
[0076] This embodiment corresponds to Embodiment 1, and provides a specific application scenario:
[0077] Take the crawler crawling APP information in the App Store as an example to illustrate:
[0078] 1. Configure the initial domain name address of the App Store application market "https: / / itunes.apple.com / cn"; in addition, configure multiple relative path addresses corresponding to it. Here, "action games" and "adventure games" are classified as For example, configure the corresponding relative path and configure the corresponding parsing configuration key as "list-parse", the configuration is as follows:
[0079]
[0080]
[0081] 2. The crawler downloads the page content of "Action Game" and "Adventure Game" respectively; reads the parsing configuration item 'items.app_info'.name' corresponding to the parsing configuration key (list-parse), which is after the following list-parse According to the content in the first {}, the APP name in the list can be extracted a...
Embodiment 3
[0090] This embodiment corresponds to Embodiment 1 and Embodiment 2, and provides a computer-readable storage medium on which a computer program is stored. When the program is executed by a processor, it can implement the above-mentioned embodiment or one of the embodiments provided by the embodiment. Configure all the steps included in the crawler method for collecting APP information. The specific steps will not be repeated here, please refer to the description of Embodiment 1 or Embodiment 2 for details.
[0091] Wherein, the storage medium may be a magnetic disk, an optical disk, a read-only memory (Read-Only Memory, ROM) or a random access memory (Random Access Memory, RAM) and the like.
[0092] To sum up, the crawler method and storage medium for configuring APP information provided by the present invention are not only applicable to crawlers crawling in different types of application market layouts, but also do not require a lot of coding to realize, and do not need cu...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


