Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Multiple Cuckoo Filters Based on a Streaming Computing Model

A streaming computing, cuckoo technology, applied in computing, instrumentation, database indexing, etc., can solve problems such as index performance decline and memory space growth, achieve small space utilization, reduce false positive rate, simplify calculation volume and space The effect of occupancy

Active Publication Date: 2021-01-08
HANGZHOU INST OF ADVANCED TECH CHINESE ACAD OF SCI
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the distributed data management system for multi-dimensional data has a sharp decline in performance such as indexing, especially the occupied memory space also grows rapidly with the increase of dimensions

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multiple Cuckoo Filters Based on a Streaming Computing Model
  • Multiple Cuckoo Filters Based on a Streaming Computing Model
  • Multiple Cuckoo Filters Based on a Streaming Computing Model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] The present invention will be described in further detail below in conjunction with the accompanying drawings and specific embodiments.

[0030] The present invention proposes an adaptive cuckoo filter that dynamically changes as the number of data streams changes under a streaming computing model. In order to make the purpose, technical solutions and effects of the present invention clearer and clearer, examples are given below with reference to the accompanying drawings The present invention is further described in detail.

[0031] 1. Data structure

[0032] Such as figure 1 As shown, multiple cuckoo filters are composed of multiple standard cuckoo filters equal to the total number of data streams, each data stream corresponds to a standard cuckoo filter, that is, the representation and query of multiple data sets are decomposed into multiple Representation and query of a single data set. As many data streams as there are generated, there are as many standard cucko...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a multiple cuckoo filter under a flow calculation model. The multiple cuckoo filters are mainly composed of a plurality of standard cuckoo filters with the same total number ofdata streams. A standard cuckoo filter is set for each data stream, each standard cuckoo filter processing a respective data stream and decomposing representations and queries of multiple data sets of the data stream into representations and queries of a plurality of single data sets; A sliding window is established for each cuckoo filter, each standard cuckoo filter performs filtering query at the same time, and the sliding windows segment data streams along a time boundary to query whether the same specified objects exist among different data streams at the same time or not. The advantagesof the cuckoo filter can be well inherited, the operand and the space occupancy rate can be greatly simplified for processing of the big data flow, the false positive rate is reduced, accurate data query is facilitated, and the technical effect is remarkable and outstanding.

Description

technical field [0001] The invention relates to a cuckoo filter in the field of computer big data, in particular to a multi-cuckoo filter indexed to massive multi-dimensional data under a streaming computing model. Background technique [0002] With the rapid development of mobile Internet, Web2.0, smart devices and other related industries, the amount of data generated by humans is growing exponentially. Massive data gradually presents big data characteristics such as huge scale, diversified types, and high-speed traffic. The multi-dimensional characteristics of data are becoming more and more obvious, and the storage of massive multi-dimensional data, real-time calculation and analysis, large-scale data indexing and searching, etc. have brought severe challenges to information systems. [0003] Different from low-dimensional data, multi-dimensional data enables the system to record a large amount of comprehensive information and provide users with richer services through ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/22G06F16/2455
Inventor 范小朋吴梦露
Owner HANGZHOU INST OF ADVANCED TECH CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products