A Data Frequency Estimation Method Based on Carry-Based Sketch Data Structure
A data structure and data technology, applied in the direction of electronic digital data processing, digital data information retrieval, special data processing applications, etc., can solve the problems of limited data storage limit, sensitive use of space size, space size restricts accuracy, etc. Achieve the effect of improving accuracy and increasing the upper limit of counting
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
specific example
[0062] Suppose there are 5 different query strings, namely a, b, c, d, e, and the frequencies are 1000, 300, 200, 1200, 400. In the original CM Sketch, a and c are mapped to the same position, and the count of this position is 1000+200=1200. b and d map to the same location, which has a count of 300+1200=1500.
[0063] Now suppose we traverse these strings in the order of edcba, trying to find top-3, and we have found 3 with a maximum value of 350, 340, 330 before. Find e, the query value 400 is large enough, and then go to the hash table to query its real value 400, then the current top-3 are 400, 350, 340 respectively. Find d, query the value of 1500, and then find the real value of 1200, then the current top-3 are 1200, 400, and 350 respectively. Find c, query the value 1200, and then find the real value 200, ignore it. Similarly, b is also ignored. Finally find a, query the real value of 1000, and finally get the top-3 as 1200, 1000, 400. In this process, a total of 5...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


