Unlock instant, AI-driven research and patent intelligence for your innovation.

A method and system for implementing a string hash table

An implementation method and technology for implementing a system, applied in the computer field, can solve problems such as performance bottlenecks of string hash tables

Active Publication Date: 2021-09-21
INST OF COMPUTING TECH CHINESE ACAD OF SCI
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The present invention proposes a string hash table implementation method by using multiple heterogeneous hash tables for storage of character strings of different lengths, which solves the performance bottleneck problem of the string hash table in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and system for implementing a string hash table
  • A method and system for implementing a string hash table
  • A method and system for implementing a string hash table

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] In order to make the object, technical solution and advantages of the present invention clearer, the implementation method of the string hash table proposed by the present invention will be further described in detail below in conjunction with the accompanying drawings. It should be understood that the specific implementation methods described here are only used to explain the present invention, and are not intended to limit the present invention.

[0025] When the inventor was developing the string hash table, he found that the performance of the existing string hash table had a bottleneck, so he tested the performance of strings in different hash tables, such as figure 1 shown. Among them, link_map is the hash table implementation of stdlibc++, using the traditional chain hash method; array_map aggregates and stores the strings in the conflict chain on the basis of chain hash to optimize Cache performance; linear_map is an open address hash table, Use a linear probin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a method for realizing a character string hash table, comprising: distributing the character string to a corresponding hash table according to the length of the character string; wherein, the hash table includes an array hash table, a numerical hash table and a character double hash table . The present invention uses a variety of heterogeneous hash tables to store character strings, and selects appropriate hash tables for different lengths of character strings; at the same time, for short character strings, short character strings are divided into several fixed-length intervals to improve memory space utilization. Utilizing the character of variable length strings, reserve 1-byte space at the end of each range of hash slots to store metadata information in situ; and for long strings, use a two-level hash table structure, and the first-level hash table uses only Use part of the prefix value to calculate the string hash, which reduces the amount of calculation of the hash value; and the secondary hash table is used as the conflict chain storage table of the primary hash table, which solves the conflict increase caused by the streamlined hash calculation of the primary hash table The problem.

Description

technical field [0001] The invention belongs to the technical field of computers, and in particular relates to a method and system for realizing a character string hash table. Background technique [0002] The classic data structure of the hash table is widely used in various fields of computer. For database applications, the hash table is one of the core data structures, which supports database aggregation operations, data deduplication operations, and connection operations. With the explosive growth of Internet data, the proportion of character data in databases has gradually increased, including user browsing logs, web page metadata, and so on. Due to the variable length of such data, new challenges are presented to the design of the hash table. [0003] The current implementation of hash tables for character strings is mainly divided into two categories: (1) using pointers as hash keys to store in hash tables implemented by open addresses or chain addresses. This type...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/22
CPCG06F16/2255G06F16/2282
Inventor 郑天祺程学旗李冰王征张志斌刘悦赵鹏郭嘉丰
Owner INST OF COMPUTING TECH CHINESE ACAD OF SCI
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More