System and a method for detecting the key content of a web page based on visual characteristics
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- 中共中央办公厅电子科技学院
- Publication Date
- 2019-02-15
- Estimated Expiration
- Not applicable · inactive patent
Smart Images

Figure 1
Abstract
Description
technical field
[0001] The invention relates to the technical field of the Internet, in particular to an automatic detection system and method for key content of webpages based on visual features. Background technique
[0002] With the widespread application of the Internet, web pages have become an important carrier for users to obtain information. When search engines use web crawler software to crawl web pages, they need to analyze the key content, remove non-key content such as advertisements, navigation bars, and user comments in the web page, and provide users with a summary of the target web page. On the other hand, with the complexity and diversification of web design and the further popularization of web page dynamic rendering technology, a lot of key content is often added dynamically through JavaScript code, while the traditional tag analysis based on static HTML code is used to analyze key content The detection method has been unable to adapt to the increasingly ...