Method for picking up web information needed by user in web page
A user and page technology, applied in the Internet field, can solve problems such as high time complexity and poor scalability, and achieve the effect of flexible information extraction algorithms
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0054] The present invention provides an information extraction method based on HTML grammar standards, which couples webpage information segmentation and information extraction algorithms with specific information content, so that the information segmentation and extraction algorithms are more versatile and flexible.
[0055] The method for extracting Web information required by a user from a Web page provided by the present invention, as shown in Figure 2, includes:
[0056] Step S11, according to the order of HTML text corresponding to the Web page, select a number of HTML tags as tag ruler elements to generate a tag ruler, and store it in the system;
[0057] Step S12: The system matches the HTML text in sequence according to the HTML tag elements in the tag ruler, divides the Web information according to the matched HTML tags, and stores the divided Web information block and the position information of the HTML tag containing the information block in the text ;
[0058] Step S1...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com