Method and device for extracting webpage information
A technology of web page information and page information, applied in the network field, can solve the problems of reducing the amount of data storage, long setting period of extraction rules, and inability to extract web page information at the same time, so as to achieve reduced impact, stability, universal applicability, reliability and accuracy Extraction effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Example Embodiment
[0057] In order to enable those skilled in the art to better understand the solution of the present invention, the embodiments of the present invention will be further described in detail below with reference to the accompanying drawings and embodiments.
[0058] The following first introduces the application scenarios of the present invention and the preparation work before information extraction.
[0059] Webpage information extraction is an important work of search engine page analysis. The webpage content that users are interested in is extracted and organized into structured data, which is conducive to more effective indexing and searching of webpages by search engines. The invention provides an automatic and reliable Web page information extraction scheme.
[0060] For a website, its HTML (Hyper Text Markup Language, is a markup language used to describe web documents) web pages are not all built by manual editing, mainly by website creation tools and template code. Become. T...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic.
© 2023 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap