Implementation method and implementation device for data duplication elimination query

An implementation method and data technology, applied in the field of database storage, can solve problems such as inability to deduplicate queries in database clusters, and the efficiency of eliminating duplicate queries is not very ideal, so as to avoid double calculations and improve efficiency

Inactive Publication Date: 2013-11-20
DAWNING INFORMATION IND BEIJING
View PDF4 Cites 21 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Aiming at the problems in the related technologies that the elimination of repeated queries of arbitrary data columns cannot be solved, and that there is double calculation when the query results are summarized, resulting in the unsatisfactory efficiency of eliminating repeated queries, the present invention proposes a method and realization of data deduplication query The device can solve the problem that existing related technologies cannot perform de-duplication query on large-scale data on the database cluster, and realize de-duplication query of any data column

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Implementation method and implementation device for data duplication elimination query
  • Implementation method and implementation device for data duplication elimination query
  • Implementation method and implementation device for data duplication elimination query

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039]The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. All other embodiments obtained by persons of ordinary skill in the art based on the embodiments of the present invention belong to the protection scope of the present invention.

[0040] According to an embodiment of the present invention, a method for implementing data deduplication query is provided.

[0041] Such as figure 1 As shown, the implementation method of data deduplication query according to the embodiment of the present invention includes:

[0042] Step S101, querying each of the multiple database nodes to obtain query results, wherein, for each database node that has obtained multiple query results, the multiple query results obtained from the query of th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an implementation method and an implementation device for data duplication elimination query. The implementation method comprises the following steps: querying all database nodes in a plurality of database nodes to obtain query results, wherein for all the database nodes querying the query results, the query results queried by the database nodes are subjected to the duplication elimination operation, and the query results subjected to the duplication elimination operation are taken as the query results of the database nodes; merging the query results of the database nodes. The implementation method and the implementation device achieve the duplication elimination query operation of large-scale data in a database cluster, avoid the condition that duplication elimination query operation is confined to the duplication elimination division column, achieve the duplication elimination query to any data columns, and in addition, avoid the problem of repeated calculation in duplication elimination query, and improve the duplication elimination query efficiency.

Description

technical field [0001] The invention relates to the field of database storage, in particular to a method and device for implementing data deduplication query. Background technique [0002] Eliminating duplicate records is a common query operation type in current database systems, and this type of query is usually also called deduplicated query. For example, a database application system usually needs to list all different records, or count different records, or count the number of different records. [0003] In a single database system, currently mature methods for eliminating duplication mainly include sorting and merging methods and hash merging methods. However, in a database cluster composed of multiple independent database systems, duplicate records may be distributed on different database servers, and due to the overhead of network transmission and communication between database nodes, the cross-node data deduplication query is increased. Due to the difficulty of pro...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 宋怀明王勇苗艳超刘新春邵宗有
Owner DAWNING INFORMATION IND BEIJING
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products