Fine-grained webpage information acquisition method
A collection method and web information technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of high construction cost, narrow application area, low accuracy rate, etc., to achieve high collection efficiency and reduce manual operations. Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0023] The present invention will be further described below in conjunction with the accompanying drawings and embodiments.
[0024] As shown in Figures 1, 2, and 3.
[0025] A method for collecting artificially fine-grained webpage information, comprising the following steps:
[0026] a. Use the roaming method of traditional network robots to collect well-structured or semi-structured web content and their URLs on the Internet;
[0027] b. Distinguish templates of URLs (i.e. webpage addresses) of collected webpages, use the part before the symbol "?" The identification mark (some webpages that use post or cookie to pass parameters);
[0028] c. Manually collect the information elements required for one or more purposes in the content of the above-mentioned webpage, and at the same time put them into the local database together with the aforementioned identification marks of this webpage; the information elements mentioned here refer to fine-grained collection information, ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com