Method for judging whether web page content is identical or not
A web page and content technology, applied in the field of judging web pages with the same content, can solve the problems of no web page identification and filtering, inconvenient use, web page filtering, etc., to achieve the effect of convenient and quick viewing and reduce redundant results pages
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0077] Below in conjunction with accompanying drawing and example the present invention is described in further detail.
[0078] Take the following two webpages A and B as an example, where:
[0079] For the URL of web page A see Figure 4 410 in , for the display effect of webpage A in IE browser, please refer to Figure 4 420 in , for the abbreviated content of the HTML source code file of webpage A, see Image 6 ;
[0080] URL of page B see Figure 5 510 in , for the display effect of web page B in IE browser, see Figure 5 520 in , for the abbreviated content of the HTML source code file of webpage B, see Figure 7 ;
[0081] Such as figure 1 As shown, we first calculate the title similarity of web pages A and B.
[0082] 110 is the processing of extracting the title content of the web page from the HTML source file of the web page, and the extraction method is to find the content in the source file with tags (case-insensitive), and the content between these two ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com