Multi-core synchronization data query optimization method based on column storage

A data query and optimization method technology, applied in the direction of electrical digital data processing, special data processing applications, multi-programming devices, etc., can solve the impact of output results, the impact of bad nodes, and the parallel execution of query tasks without too much consideration. problem, to shorten return time and improve efficiency

Inactive Publication Date: 2013-09-25
XIDIAN UNIV
View PDF2 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] However, there are still some problems in these studies: the distributed parallel data query optimization technology utilizes the distributed system architecture to improve the efficiency of parallel query, but the shortcomings of the architecture also make this method susceptible to the influence of individual bad nodes, thus af

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-core synchronization data query optimization method based on column storage
  • Multi-core synchronization data query optimization method based on column storage
  • Multi-core synchronization data query optimization method based on column storage

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0066] In this embodiment, the present invention is described by using a 4-core processor and inquiring about a student information table stuInfo in a university database.

[0067] figure 2 shows the logical structure diagram of the student information table, image 3 A column storage structure diagram of the student information table is shown.

[0068] Step 1, read the sql query statement.

[0069] It should be noted that, in this embodiment, the following simple query is preset to stuInfo: find out the names, genders and specific ages of students in a college whose age is younger than a certain fixed value. The SQL statement for this query is as follows.

[0070] Select name,sex,age

[0071] from stuInfo

[0072] where age<24 and college='Computer Science'

[0073] Step 2, generate query plan: for the query statement input by the user, use the delayed materialization strategy to generate a query plan for the query statement.

[0074] The delayed materialization strat...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a multi-core synchronization data query optimization method based on column storage. A map-reduce execution flow is constructed aiming at a query plane, and is composed of a plurality of sub-map-reduce flows; the query operation of a database is finished through the execution of the plurality of sub-map-reduce flows; when the plurality of sub-map-reduce flows are executed, a map-reduce model is responsible for distributing each mapping task to different cores and executing, each core is used for orderly determining input-output key value pair of the mapping function and simplifying function according to the operation in the distributed mapping task so as to finish the execution of the query plan. The feature that the tasks can be executed synchronously on the multi-core processor is well utilized, the return time of the great data query result is shortened, and the data query efficiency is improved.

Description

technical field [0001] The present invention relates to the fields of large data parallel computing and column storage-related architecture data storage and query in the field of computers, in particular to a multi-core parallel data query optimization method based on column storage, in which a mapping-simplification parallel programming model is used to realize a database under a multi-core environment Partial parallelization of query tasks. Background technique [0002] Data query refers to the process and technology of storing the selected, organized and evaluated data into the database, and querying the accurate data that can meet the needs of users from the database. It is one of the core issues of the database system. With the development of Internet technology, data is growing at an unimaginable rate. In the face of these big data, how to optimize the data query and improve the query efficiency of the database is an important research direction in the database field....

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06F9/46
Inventor 刘慧刘志镜袁通王韦桦邱龙滨强波曹志高王梓赵宏伟李雨楠姚勇屈鉴铭张璐萱张慧泽徐曾
Owner XIDIAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products