Method for estimating top-n cardinal number data in high-speed data flow
A high-speed data flow, top-n technology, applied in data classification, processing input data, electrical digital data processing, etc., to achieve stable time efficiency, space efficiency optimization, and simple methods
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0027] The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments. In the data stream, neither the data type of the non-top-n data nor the actual cardinality of the non-top-n data type is concerned, and the cardinality of the non-top-n data type is relatively small compared to the cardinality of the top-n data type Many, even if they are added together by mistake, the cardinality precision of the top-n data type will not be damaged much.
[0028] In the present invention, a data structure used is called "HyperLogLog Sketch matrix", which is set as S, with a width of m and a height of n, and each element is an HLL counter. Correspondingly, there are n hash functions that are independent of each other and have a hash value of 1~m, set f 1 , f 2 ,...,f n . Such as figure 2 As shown, when new data D appears, follow the steps below:
[0029] 1. Classify by business and set it as type X;
[0030] 2. ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com