Mass data processing method based on files

A technology of data processing and processing methods, applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve the problems of large data volume, difficulty, and low fault tolerance, so as to ensure data security and stability , to ensure the effect of efficiency

Active Publication Date: 2012-12-19
WUHAN TIANYU INFORMATION IND
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] 1. The amount of data is too large, tens of millions or even hundreds of millions of data. In these data, there may be random data format errors, which will lead to great difficulties in system design;
[0004] 2. The requirements for software and hardware are high. For the processing of massive data, the system resources occupied are high. If the system resources of software and hardware are allocated reasonably, it is also a major problem in the processing of massive data;
[0006] 4. Transaction management of massive data. During data processing, if the data involved is processing a transaction, it is necessary to ensure the transaction control of the database. As the amount of data increases, it is necessary to ensure that a large amount of data is processed in the same database transaction. a rather difficult question
[0007] 5. The processing program of massive data, once designed, cannot be reused, and often can only be applied to one industry or a certain project, wasting a lot of manpower and material resources
[0008] In traditional mass data processing, after using high-configuration servers and enhancing CPU processing performance and memory capacity, there are still some problems that cannot be solved, such as low fault tolerance, unreasonable resource allocation, inconsistent transaction management, etc.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Mass data processing method based on files
  • Mass data processing method based on files

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] In order to better understand the present invention, the invention will be described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0026] In the file-based mass data processing method of the present invention, the mass data received by the data processing system is first grouped into files, and then concurrently processed by multi-threads (processes). Such as figure 1 As shown, the data processing system of this embodiment includes a hardware environment consisting of two data processing computers and a disk cabinet. The disks are shared by the data processing computers, and both computers can access the disk cabinets; in addition, the data processing computers are connected to the database server connected.

[0027] Such as figure 2 As shown, the specific processing process of this implementation is as follows:

[0028] (1) Execute mutual exclusion control between multiple data processing servers, the specific steps are a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a mass data processing method based on files. The specific processing process comprises the following steps: (1) carrying out exclusive control on multiple data processing servers and keeping only one data processing system to process data and other data processing servers to act as backups; and (2) grouping the files with the mass data according to the total quantity of concurrent threads and ensuring the grouped files to correspond to different threads to be processed. The mass data processing process provided by the invention aims to ensure the correctness and the integrity of the data under the condition that various accidents take place in the computer system and ensure the processability of the mass data while ensuring the correctness and the integrity of the data.

Description

【Technical field】 [0001] The invention relates to a massive data processing method, in particular to a file-based massive data processing method. 【Background technique】 [0002] Massive data is too large, the data format is complex, and there are many random situations in the data, which is not easy to classify and process, and its processing is a difficult and complicated task. There are mainly the following reasons [0003] 1. The amount of data is too large, tens of millions or even hundreds of millions of data. In these data, there may be random data format errors, which will lead to great difficulties in system design; [0004] 2. The requirements for software and hardware are high. For the processing of massive data, the system resources occupied are high. If the system resources of software and hardware are allocated reasonably, it is also a major problem in the processing of massive data; [0005] 3. High system fault tolerance is required. When an error occurs in ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
Inventor 袁洁
Owner WUHAN TIANYU INFORMATION IND
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products