Unlock instant, AI-driven research and patent intelligence for your innovation.

Hive-based data aging method, device and apparatus

A technology of data and data areas, applied in database indexing, database models, electrical digital data processing, etc., can solve problems such as complex screening, low accuracy and reliability, and disk space occupation

Pending Publication Date: 2019-03-12
HANGZHOU DT DREAM TECH
View PDF4 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

When the physical disk on which the distributed file system depends is full, not only the new data cannot be written, but also the calculation of the original data needs to occupy a certain amount of disk space due to the problem of temporary file generation, which also leads to great impact. Impact
[0004] At present, when the storage space is full, if the hardware resources cannot be increased, the space can only be released by manually selecting the storage table for drop (a deletion operation), but for a distributed file system as a big data warehouse , Manually select which tables to drop from the massive data tables, not only the screening is more complicated, but also the data that is still in use may be deleted by mistake, the accuracy and reliability are low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Hive-based data aging method, device and apparatus
  • Hive-based data aging method, device and apparatus
  • Hive-based data aging method, device and apparatus

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] The core of the present invention is to provide a hive-based data aging method, which can realize automatic scanning of hive data areas, aging judgment and deletion of aging areas, with high efficiency, higher accuracy and reliability; another aspect of the present invention The object is to provide a device, device and computer-readable storage medium based on the above method.

[0036] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a data aging method based on hive, which comprises the following steps: when receiving read-write access to data area of hive, refreshing access time record of data area corresponding to read-write access in metadata database of hive; Judging whether the preset scanning condition is satisfied at present; If the preset scanning condition is satisfied, the access time recordsof all the data areas of the hive are scanned, and the data area satisfying the preset aging condition is taken as the aging area, and the contents of the aging area are deleted. The invention can realize automatic scanning and aging judgment of hive data area and deletion of aging area, and has high efficiency, accuracy and reliability. Another object of the present invention is to provide an apparatus, device and computer-readable storage medium based on the method described above.

Description

technical field [0001] The invention relates to the technical field of data aging processing, in particular to a hive-based data aging method. The invention also relates to a hive-based data aging device, equipment and computer-readable storage medium. Background technique [0002] Hive is a data warehouse tool based on Hadoop, which can map structured data files into a database table, and provides a simple SQL query function, which can convert SQL statements into distributed computing tasks for execution. [0003] At present, when hive is used for big data processing, due to the huge storage capacity of big data itself, especially in many actual production systems, there is a large amount of new data every day, and the increasingly expanding data has a negative impact on the storage resources of the system. huge challenge. When the physical disk on which the distributed file system depends is full, not only the new data cannot be written, but also the calculation of the o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/22G06F16/28G06F16/182
Inventor 郑艳涛袁益梦林锋
Owner HANGZHOU DT DREAM TECH