Method and device for extracting webpage content
A technology of webpage content and webpage, applied in the field of devices for extracting webpage content
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0035] Exemplary embodiments of the present invention will be described in detail below with reference to the accompanying drawings. In the drawings, like reference numerals refer to like elements throughout.
[0036] figure 1 is a block diagram showing an exemplary structure of the webpage content extracting apparatus 100 according to the embodiment of the present invention. According to an exemplary embodiment of the present invention, the webpage content extraction device 100 includes an input unit 110, a DDA webpage content extraction unit 120, a webpage to image conversion unit 130, a DIR webpage content extraction unit 140, and a DDA and DIR extraction result fusion unit 150. The input unit 110 is used to input web pages. In an exemplary embodiment of the present invention, the input web page may be, for example, a web page file in Hypertext Markup Language (HTML) format. The DDA webpage content extraction unit 120 performs webpage content extraction processing based ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com