A web data similarity detection method based on two-level filtering of structure and content
A detection method and secondary filtering technology, applied in the field of similarity detection of Web data structure and content, can solve the problems of not making full use of the characteristics of Web data distribution area, difficult to find approximate content blocks efficiently, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0097] In order to facilitate those of ordinary skill in the art to understand and implement the present invention, the present invention will be described in further detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the implementation examples described here are only used to illustrate and explain the present invention, and are not intended to limit this invention.
[0098] please see figure 1 , the technical solution adopted in the present invention is: a Web data similarity detection method based on two-level filtering of structure and content, on the basis of the traditional general similarity detection method, the characteristics of Web data structure and content distribution are excavated, Two-stage filtering is performed on the detected document set; the invention considers that the documents containing similar web data should be similar in structure at first, and if the structures of the two documents are very diff...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com