Hive data processing method and device

A data processing and target data technology, applied in the field of data processing, can solve problems such as not being able to satisfy users

Inactive Publication Date: 2016-03-23
INSPUR GROUP CO LTD
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Hive's SQL-like language brings a lot of convenience to data mining workers. Massive data can be analyzed through simple SQL statements, but the existing functions provided by Hive are only the extraction, conversion and loading functions of massive data. The convenience of query requires sorting the queried data, and the functions provided by the existing Hive cannot meet the needs of users

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Hive data processing method and device
  • Hive data processing method and device
  • Hive data processing method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0022] The embodiment of the present invention provides a method for Hive data processing, such as figure 1 shown, including:

[0023] Step 101, acquire a data request message.

[0024] Wherein, the data request message carries information related to the sorting of the target data.

[0025] It should be noted that the sorting-related information of the data carried in the data request message may be information indicating how to sort which data. For example, inf...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

An embodiment of the invention provides a Hive data processing method and device and relates to the technical field of data processing. By means of the Hive data processing method and device, a user can sequence inquired data, demands of the user are met and the user experiment is improved. The method comprises steps as follows: a data request message which carries related sequencing information of target data is acquired; the target data are obtained through Hive according to the data request message; a target sequencing function is determined from Hive according to the data request message, and the target data are sequenced according to the target sequencing function; sequencing rules of the data are recorded in the sequencing function; the sequenced target data are output through Hive. The Hive data processing method and device are applicable to data sequencing scenes in Hive.

Description

technical field [0001] The present invention relates to the technical field of data processing, in particular to a method and device for Hive data processing. Background technique [0002] With the massive increase of data, a single computer can no longer store massive data. Therefore, distributed clusters have received extensive attention. In a distributed cluster, data can be distributed to multiple computers for storage and distributed computing can be implemented. Hadoop is the infrastructure of distributed systems. Users can develop distributed programs without knowing the underlying details of the distribution, and make full use of the capabilities of cheap computer clusters to perform high-speed calculations and storage of data. [0003] Hive is a data warehouse tool for Hadoop. It can map structured data files into a data table, provide a complete structured query language (SQL, StructuredQuery Language) query function, and convert SQL statements into MapReduce task...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/2471G06F16/245G06F16/283G06F2216/03
Inventor 宗栋瑞郭美思
Owner INSPUR GROUP CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products