Method for rapidly collecting dynamic script website data
A dynamic script and website technology, applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve problems such as unsuitable for practical applications, slow speed, etc., achieve the effect of reducing the number of times and improving the collection speed
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0012] The present invention will be described in detail below in conjunction with the accompanying drawings and embodiments.
[0013] In view of the non-processing and hard-coded methods used in the prior art to treat dynamic script websites, the execution process of the method of the present invention includes two parts, the first part is training, and the second part is crawling. Through the similarity training of pages, it is possible to know which events should be triggered on which page elements of various types of pages. Crawling can be carried out after the training is completed. The crawling process of the present invention can adopt multiple crawling strategies. In the breadth-first crawling method in this embodiment, each time an event is triggered, it will fall back to the original page. Until all the events that need to be triggered on the original page are triggered, other pages will be processed.
[0014] Such as figure 1 Shown, the training steps of the prese...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com