Distributed data processing method based on mapping reduction

A distributed data and processing method technology, applied in the field of data processing, can solve problems such as data processing problems that cannot meet the needs of different medical institutions, achieve efficient and accurate parallel processing, simplify the processing process, and reduce the workload.

Active Publication Date: 2020-12-04
HEFEI UNIV OF TECH
View PDF11 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At present, the real electronic medical record data is still stored in the local database of the medical institution. The current medical record system is only a data query system, which cannot meet the data processing problems between different medical institutions.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Distributed data processing method based on mapping reduction

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] In this embodiment, a distributed data processing method based on map reduction is applied to M hospital databases and x disks N={N 1 ,N 2 ,...,N i ,...,N x} composed of data processing environment, where, N i Indicates the i-th disk that saves data, 1≤i≤x, such as figure 1 As shown, the distributed data processing method is performed in the following steps:

[0035] Step 1. Perform fragmentation processing on the data in the disk, and improve the data processing efficiency and reduce the system load by processing the disk fragmentation;

[0036] Step 1.1, define the i-th disk N i The standard size of the saved data fragmentation is S. By setting the fragmentation size, the fragmentation is processed in parallel to improve the processing efficiency, and the p-th patient ID number is defined as k p , define the medical record information of the pth patient as v p ; From the pth patient's ID number k p and its corresponding pth medical record information v p Comb...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a distributed data processing method based on mapping reduction, which comprises the following steps of: 1, fragmenting data in a magnetic disk; 2, performing data processing on a processing result in the step 1 again in a set buffer area; and 3, stipulating the data from different magnetic disks through circular processing. According to the method, the disk is fragmented,and the data in the disk is sorted and merged twice, so that the data in the disk can be orderly arranged and efficiently queried through the mapping protocol, the workload during large-scale data processing can be reduced, and the operation efficiency is improved.

Description

technical field [0001] The invention belongs to the technical field of data processing, in particular to a distributed data processing method based on mapping and reduction. Background technique [0002] In the traditional data processing mode using centralized data processing, the calculations from all terminals are completed by the host, and the processing speed of this type of network may be somewhat slow. In addition, if users have various needs, it may be difficult to meet these needs on a centralized computer network, because each user's applications and resources must be set up separately, and these applications and resources are all on the same server. It is operated on a centralized computer, which makes the system inefficient. Also, because all users must connect to a central computer, centralized connections can be a big problem with centralized networks. Centralized data processing is based on a large central computer, and all data, calculation, and processing ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G16H10/60G06F3/06
CPCG16H10/60G06F3/061G06F3/0644G06F3/0656G06F3/067
Inventor 李磊张人杰卜晨阳吴信东
Owner HEFEI UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products