Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

A method of deduplication when data without business primary key is stored in database

A business primary key and business data technology, applied in database indexing, electronic digital data processing, structured data retrieval, etc., can solve the problems of externally input data without a business primary key and low efficiency, and achieve fast, comprehensive, efficient and accurate query deduplication , improve efficiency, and quickly query the effect of deduplication

Active Publication Date: 2021-12-31
政采云有限公司
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] Usually, when designing the database table structure, the business primary key field is designed, and the uniqueness of the data is judged through the business primary key field, but sometimes some externally input data does not have a business primary key, and we need to judge before storing these data Whether the same data already exists to determine the subsequent processing method. When the business primary key does not exist, it is necessary to query whether there is the same data by using each field of the data as a query condition. In this way, the amount of data in the table is very large The efficiency will be very low, especially when the stored fields are not suitable for adding database indexes

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method of deduplication when data without business primary key is stored in database
  • A method of deduplication when data without business primary key is stored in database

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention.

[0037] Such as Figure 1-2 As shown, a method for deduplicating data without a business primary key when storing the database includes the following steps:

[0038] S1: Receive external data, import business data outside the original database system into the original database through the database receiving module, to ensure the speed of importing business data;

[0039] S2: Data field conversion, through the data conversion module, the external business data imported into the database is spliced ​​into strings according to the rules, so as to ensure that each piece of business data is spliced ​​into strings according to the same rules;

[0040] S3: Data string opera...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for deduplicating data without a business primary key when storing a database, which is characterized in that it comprises the following steps: a data conversion module splices external business data imported into the database into character strings, and a data hash operation module splices and combines The resulting string is calculated using the sha256 algorithm to obtain a byte array. The message digest conversion module converts the message digest in byte array format into a string H1. The message digest hash module performs hash operations on the converted string H1 again to obtain a For the integer value H2, the deduplication processing module mainly uses the values ​​H1 and H2 obtained by the two hash operations in the above steps as conditions to query the database. In the present invention, the method for deduplicating the non-business primary key data when storing the database adopts the characteristic of extremely low collision rate according to the result of the message digest algorithm, and can judge whether the data are equal by only comparing two fields, and effectively utilizes the database index to Improve efficiency.

Description

technical field [0001] The invention relates to the technical field of database query deduplication, in particular to a method for deduplication of data without a business primary key when storing databases. Background technique [0002] Usually, when designing the database table structure, the business primary key field is designed, and the uniqueness of the data is judged through the business primary key field, but sometimes some externally input data does not have a business primary key, and we need to judge before storing these data Whether the same data already exists to determine the subsequent processing method. When the business primary key does not exist, it is necessary to query whether there is the same data by using each field of the data as a query condition. In this way, the amount of data in the table is very large The efficiency will be very low, especially when the stored fields are not suitable for adding database indexes. Contents of the invention [00...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/215G06F16/22
CPCG06F16/215G06F16/2255
Inventor 杨建华陈洁
Owner 政采云有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products