Website state reconnaissance method and device
A reconnaissance device and status technology, applied in the field of network information, can solve problems such as time-consuming, reduce unavailability, improve comprehensive management capabilities, and reduce invalid collection operations
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0026] Such as figure 1 As shown, the method for detecting website status in this embodiment includes:
[0027] Step S10: According to a preset reconnaissance cycle, periodically send an access request to the collection target webpage, and receive response information returned by the server of the collection target webpage;
[0028] Step S20: Process the response information;
[0029] Step S30: Determine whether the response information indicates that the collection target webpage is accessible, and when the collection target webpage is not accessible, perform step S40:
[0030] Step S40: Send first alarm information, the first alarm information is used to indicate that the collection target webpage is not accessible; the collection target webpage is an item in the task webpage list.
[0031] Preferably, webpage information corresponding to each collection target webpage in the task webpage list is collected periodically according to a preset collection period.
[0032] Due to the natura...
Embodiment 2
[0036] The method for detecting the status of the website in this embodiment is based on Embodiment 1, and further describes the structure of the reconnaissance webpage.
[0037] Such as figure 2 As shown, the method for detecting website status in this embodiment further includes:
[0038] Step S50: when the response information indicates that the collection target webpage is accessible, access the collection target webpage to obtain the webpage structure information of the collection target webpage;
[0039] Step S60: When it is detected that the webpage structure of the collection target webpage has changed according to the webpage structure information, second alarm information is issued, and the second alarm information is used to indicate that the webpage structure of the collection target webpage has changed.
[0040] Crawler programs usually analyze information based on collection templates customized for the web page. Therefore, if the web page structure changes, causing the...
Embodiment 3
[0043] The method for detecting website status in this embodiment is based on Embodiment 2, and further describes the method for detecting changes in the webpage structure.
[0044] Such as image 3 As shown, according to the web page structure information in the method for detecting website status in this embodiment, it is detected that the web page structure of the collection target web page has changed, which may include one or more of the following:
[0045] Step S61: It is detected that the frame information of the collected target webpage has changed;
[0046] Step S62: It is detected that the content information of the collection target webpage has changed;
[0047] Step S63: It is detected that the rendering information of the collection target webpage has changed;
[0048] Step S64: It is detected that the format information of the collection target webpage has changed.
[0049] A webpage is a kind of compound file that carries content displayed in a certain layout. Regular web...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com