HTTP message collection method based on proxy, terminal equipment and storage medium
A collection method and message technology, which are applied in the Internet field, can solve the problems of increasing the burden on the server of the collected site and the loss of network virtual property, and achieve the effects of avoiding the loss of network virtual property, avoiding re-collection, and reducing the burden.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0024] The embodiment of the present invention provides a proxy-based HTTP packet collection method, such as figure 1 As shown, the method includes the following steps:
[0025] S1: Build an HTTP message proxy module, and receive the HTTP request message sent by the crawler module through the HTTP message proxy module.
[0026] The HTTP message proxy module provides proxy services of protocols such as HTTP, HTTPS or SOCKS for crawler modules that can be configured with proxy, and global proxy for the system for crawler modules that do not support proxy configuration.
[0027] S2: After the HTTP message agent module receives the HTTP request message, it judges whether there is an HTTP request message identical to the received HTTP request message in the HTTP message database, and if so, enters S4; otherwise, enters S3.
[0028] When the HTTP request message is transmitted using the HTTPS protocol, the crawler module uses the TLS certificate issuing authority corresponding to t...
Embodiment 2
[0042]The present invention also provides a proxy-based HTTP message collection terminal device, including a memory, a processor, and a computer program stored in the memory and operable on the processor, and the processor executes the computer program The steps in the above method embodiment of Embodiment 1 of the present invention are realized at the same time.
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 
