Set similarity calculation method and system based on minhash
A set similarity and similarity calculation technology, applied in the minhash-based set similarity calculation method and system field, can solve the problems of long calculation time, complicated calculation process, and long time-consuming minhash signature process, and achieve speed and speed improvement Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 2
[0104] figure 2 A schematic structural diagram of a minhash-based set similarity calculation system provided in Embodiment 2 of the present invention, as shown in figure 2 As shown, the set similarity calculation system is used to implement the set similarity calculation method in the first embodiment above, and the set similarity calculation system includes: hash mapping module 1, class group establishment module 2, allocation module 3, minimum hash A hash value determination module 4, a minimum hash signature generation module 5, and a similarity calculation module 6.
[0105] Wherein, the hash mapping module 1 is configured to use a hash function to map each element in the set to a first hash value with a length of m bits, where m is an integer.
[0106] Class group build module 2 for build 2 k class groups, each class group corresponds to a label, and the tag is a second hash value with a length of k bits, and the tags corresponding to different class groups are differ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com