Processing method and processing device of mass data

A processing method and technology for massive data, applied in the field of data processing, can solve the problems of efficient demand, the database cannot meet, it is difficult to meet the requirements of massive data and large files, etc., to achieve high efficiency, high access efficiency, and memory saving. Effect

Active Publication Date: 2009-09-16
NAVINFO
View PDF1 Cites 32 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] Method 1) is a popular mass data management method. Although the database has certain advantages in managing large files and mass data, since this method uses a database for data management, the biggest bottleneck for mass data is efficiency. For systems with relatively high efficiency requirements, the database cannot meet the high efficiency requirements;
[0007] And method 2) can only solve the problem of small data volume, and cannot meet the storage requirements of massive data
[0008] Therefore, fast access to large data volume files has increasingly become the technical bottleneck of the industry, and the traditional method of reading and writing files through I / O is difficult to meet the requirements of massive data and large files

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Processing method and processing device of mass data
  • Processing method and processing device of mass data
  • Processing method and processing device of mass data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] In order to make the technical problems, technical solutions and advantages to be solved by the present invention clearer, the following will describe in detail with reference to the drawings and specific embodiments.

[0041] figure 1 It is a schematic flowchart of a massive data processing method in an embodiment of the present invention. like figure 1 , the massive data processing method of an embodiment of the present invention comprises:

[0042] Step 101, setting a data file and an index file, the data file is used to store data objects, the data file includes at least one file data block, in the data file, the length of each file data block is equal; the index file Corresponding to the data file, the index file includes an address offset of each data object stored in the data file in the data file;

[0043] Step 102, when storing the data object into the data file, judge whether the remaining space of the file data block currently pointed to by the cursor in t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a processing method and a processing device of mass data. The method comprises the followings: a data file and an index file are set, wherein the data file comprises at least one file data block, and the length of each file data block is equal; the index file corresponds to the data file, comprising the address offset of each data object in the data file; when the data object is accessed, if the free space of the file data block currently appointed by a cursor is not enough to store the data object to be stored, the data file applies one or more new file data blocks to the system after the free space is filled with the data object to be stored, and the remaining data that is not accessed in the data object to be stored is stored in the new file data block, and the address offset of newly accessed data object is recorded in the index file; and the stored data object is read in the data file in a manner that the cursor points to the address offset of the data object to be read. The application of the technical proposal improves the access efficiency for mass data.

Description

technical field [0001] The invention relates to the field of data processing, in particular to a method and device for processing massive data. Background technique [0002] In the data processing industry, more and more data needs to be processed, and the data files are getting bigger and bigger. At present, for the access problem of massive data, it generally comes down to the following two methods: [0003] 1) Incorporate massive data into the database for management; [0004] 2) Create and serialize the data objects and store them in ordinary files. [0005] In the process of realizing the present invention, the inventor finds that there are at least the following problems in the prior art: [0006] Method 1) is a popular mass data management method. Although the database has certain advantages in managing large files and mass data, since this method uses a database for data management, the biggest bottleneck for mass data is efficiency. For systems with high efficie...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 石清华刘盛理徐晋晖
Owner NAVINFO
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products