Web information extraction system
A technology of information extraction and information points, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of failure to extract data analysis and prediction, lack of versatility, etc., to achieve easy understanding, save time and physical strength , the effect of simple interface operation
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment approach
[0088] 1. Single-slot extraction rules page
[0089] Taking the webpage "Sina-Weather-Dalian" as an example, this paper introduces the definition method of single-slot extraction rules. Assume that the information of interest to this webpage is the weather conditions of the day, including the following information points: city name, day of the week, temperature, and wind. The process of defining a rule is as follows:
[0090] (1) First, the user enters the inspection rules (corresponding URL), and chooses to define the extraction rule method (script web page and DOM tree); select the rule storage path according to the prompt interface, and enter the rule file name (the extension is rul).
[0091] (2) If the scripted webpage mode is selected, the system will automatically download the webpage corresponding to the URL and parse it to obtain the scripted webpage. The system automatically starts the browser to open the script webpage, and the user can click on the weather inform...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com