Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Method for duplicate removal of service-free primary key data during database storage

A business primary key and business data technology, which is applied in database indexing, electronic digital data processing, structured data retrieval, etc., can solve problems such as low efficiency and externally input data without a business primary key, so as to improve efficiency, be fast, comprehensive, efficient and accurate Query the effect of deduplication

Active Publication Date: 2019-10-22
政采云有限公司
View PDF6 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] Usually, when designing the database table structure, the business primary key field is designed, and the uniqueness of the data is judged through the business primary key field, but sometimes some externally input data does not have a business primary key, and we need to judge before storing these data Whether the same data already exists to determine the subsequent processing method. When the business primary key does not exist, it is necessary to query whether there is the same data by using each field of the data as a query condition. In this way, the amount of data in the table is very large The efficiency will be very low, especially when the stored fields are not suitable for adding database indexes

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for duplicate removal of service-free primary key data during database storage
  • Method for duplicate removal of service-free primary key data during database storage

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention.

[0037] Such as Figure 1-2 As shown, a method for deduplicating data without a business primary key when storing the database includes the following steps:

[0038] S1: Receive external data, import business data outside the original database system into the original database through the database receiving module, to ensure the speed of importing business data;

[0039] S2: Data field conversion, through the data conversion module, the external business data imported into the database is spliced ​​into strings according to the rules, so as to ensure that each piece of business data is spliced ​​into strings according to the same rules;

[0040] S3: Data string opera...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for duplicate removal of service-free primary key data during database storage. The method comprises the following steps: the data conversion module splices the external service data imported into the database into a character string; the data hash operation module calculates the spliced and combined character strings by using a sha256 algorithm to obtain a byte array; the message digest conversion module converts the message digest in the byte array format into a character string H1; the message digest hash module performs hash operation on the character string H1 converted from the message digest again to obtain an integer value H2, and the duplicate removal processing module queries a database mainly by taking the values H1 and H2 obtained by two times of hash operation in the above steps as conditions. According to the method for duplicate removal of service-free primary key data during database storage, whether the data are equal or not can be judged by comparing only two fields according to the characteristic that the collision rate of a message digest algorithm result is extremely low, and the database index is effectively utilized to improvethe efficiency.

Description

technical field [0001] The invention relates to the technical field of database query deduplication, in particular to a method for deduplication of data without a business primary key when storing databases. Background technique [0002] Usually, when designing the database table structure, the business primary key field is designed, and the uniqueness of the data is judged through the business primary key field, but sometimes some externally input data does not have a business primary key, and we need to judge before storing these data Whether the same data already exists to determine the subsequent processing method. When the business primary key does not exist, it is necessary to query whether there is the same data by using each field of the data as a query condition. In this way, the amount of data in the table is very large The efficiency will be very low, especially when the stored fields are not suitable for adding database indexes. Contents of the invention [00...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/215G06F16/22
CPCG06F16/215G06F16/2255
Inventor 杨建华陈洁
Owner 政采云有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products