Searching method based on classified file BloomFilter structure

A query method and collection technology, applied in the field of distributed computing, can solve the problems of different operation costs, no consideration of query costs, lack of query costs, etc.

Inactive Publication Date: 2006-02-22
HUNAN UNIV
View PDF0 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0010] However, the current Bloom Filter structure does not consider the query cost of the collection. They think that the I / O operation cost of the collection elements is the same when querying. In practice, the elements in the collection are invalid due to query (false positives occur), Query invalidation cost of elements Due to the different roles and positions of elements in the collection, the extra operation cost when querying invalid is not the same
Because the previous Bloom Filter structure did not consider the query cost, they treated the elements in the collection uniformly, assigned the same number of hash mapping functions to each element, and the query failure rate of each element was the same, resulting in a comparison of the overall query cost of the collection. high
The lack of query cost consideration and the lack of differentiated treatment of elements are common drawbacks in the current Bloom Filter, which often consumes more resources in current practical applications.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Searching method based on classified file BloomFilter structure
  • Searching method based on classified file BloomFilter structure
  • Searching method based on classified file BloomFilter structure

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0127] This embodiment is an application of BBF in a system using a file agent.

[0128] image 3 Is the use of proxy file system. The bottom layer is the workstation host, the middle layer is the file agent, and finally the ultimate network file server. Agents at the same level adopt a cooperative method, and regularly express their stored file directories in BBF into summaries and then pass them on to other agents to maintain the consistency of file lists on the network. In this system, due to security reasons, the workstation host will update the operating system patch or update the virus library of the anti-virus software from time to time, and at the same time, there may be some common file exchanges between workstations. Therefore, from the perspective of security, we divide the files into four categories according to their impact on the machine: critical system patch files, virus database update files, system running files, and ordinary user files. When a serious sec...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

This invention discloses one requiring method based on bloom filter structure, which divides different prices of different elements into different sub set and through looking each sub set lowest failure rate relationship establishes each sub set lowest positive rate to represent set integration lowest requirement invalid cost aim function. The invention uses category function step inherit formula to get each optimized Hars function number to fulfill set mapping and finding to vector.

Description

technical field [0001] The invention is a query method based on a graded Bloom Filter structure that supports set query starting from cost, belongs to the technical field of distributed computing, and in particular relates to applications that generate a large amount of data in a distributed system and require interactive query. Background technique [0002] In recent years, with the rapid development of computers, the scale of data collections in databases, networks and other applications has grown geometrically. Collection element queries are the most common operations on data collections. When the collection becomes larger and larger, access and representation become more and more difficult. How to represent large data collections and complete queries under large data collections has become a challenge for academic circles at home and abroad. It is an urgent need to design a simplified data structure representation and support data query of large collections. [0003] B...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 谢鲲张大方闵应骅谢高岗文吉刚
Owner HUNAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products