Carry-based data frequency estimation method for Sketch data structure
A data structure and data technology, applied in the direction of electrical digital data processing, special data processing applications, computing, etc., can solve the problems of limited data storage limit, sensitive use of space, and Count-MinSketch performance limitations, etc., to reach the upper limit of counting The effect of improving and improving the degree of accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
specific example
[0062] Suppose there are 5 different query strings, namely a, b, c, d, e, and the frequencies are 1000, 300, 200, 1200, 400. In the original CM Sketch, a and c are mapped to the same position, and the count of this position is 1000+200=1200. b and d map to the same location, which has a count of 300+1200=1500.
[0063] Now suppose we traverse these strings in the order of edcba, trying to find top-3, and we have found 3 with a maximum value of 350, 340, 330 before. Find e, the query value 400 is large enough, and then go to the hash table to query its real value 400, then the current top-3 are 400, 350, 340 respectively. Find d, query the value of 1500, and then find the real value of 1200, then the current top-3 are 1200, 400, and 350 respectively. Find c, query the value 1200, and then find the real value 200, ignore it. Similarly, b is also ignored. Finally find a, query the real value of 1000, and finally get the top-3 as 1200, 1000, 400. In this process, a total of 5...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


