Adaptive parallel processing method aiming at variable length characteristic extraction for big data

A feature extraction and parallel processing technology, applied in the field of big data processing, can solve the problems of high cost and limited number of CPU cores in distributed cluster systems

Inactive Publication Date: 2014-01-29
ZHENJIANG ZHONGAN COMM TECH
View PDF3 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, due to the limited number of CPU cores, the high cost of establishing a distributed cluster system, and the use of GPU processing is also restricted by hardware capabilities, limited to only processing feature data of fixed length, so the processing of feature extraction in the big data environment Methods and capabilities still need to be further innovated and improved

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Adaptive parallel processing method aiming at variable length characteristic extraction for big data
  • Adaptive parallel processing method aiming at variable length characteristic extraction for big data
  • Adaptive parallel processing method aiming at variable length characteristic extraction for big data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] The content of the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0026] 1. The overall process of a large data-oriented adaptive parallel processing method for variable-length feature extraction involved in the present invention is: according to the length of its own hardware characteristics and feature data, adaptively adopt parallelizable matrix arrays The processing method divides the feature data to be extracted into N parts (see attached figure 1 ), process the data in batches, extract features of a certain length each time, construct a matrix array with good parallelism in parallel according to each part of the feature data and the task data from the big data to be processed, and perform multi-threading on the data Execute the processing concurrently and record the matching results; after the entire feature extraction is completed, process all the matching results according to the error tolerance rate ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an adaptive parallel processing method aiming at variable length characteristic extraction for big data. The method aims at variable-length characteristic data, and big data are processed through a graphic processing unit (GPU) parallel computing power based on compute unified device architecture (CUDA). During processing of the big data, an adaptive parallel matrix array processing mode is used for performing multithreading concurrent execution processing on the data according to own hardware characteristics and lengths of the characteristic data, so that the characteristic extraction speed is accelerated. According to the method, the data are processed in batches according to the hardware own processing capacity and the lengths of the characteristic data, and characteristics of certain length are extracted every time, and matching results are recorded; after the whole characteristics are extracted, all matching results are processed according to the allowed error-tolerant rate of data sampling to obtain requested characteristic extraction results. Good parallelism of matrix arrays is used, and the method aims at extracting variable-length characteristics, so that data can be parallelized effectively and sufficiently, and the method is particularly applicable to big-data rapid characteristic extraction with certain fault tolerance.

Description

technical field [0001] The invention belongs to the technical field of big data processing, relates to a feature extraction method, and more particularly relates to an adaptive parallel processing method for variable-length feature extraction oriented to big data. technical background [0002] With the advent of the big data era, how to quickly process big data and extract effective information has become a cutting-edge research hotspot in the IT industry. At present, feature extraction technology is more and more widely used in image processing, pattern recognition and network intrusion detection, especially in the big data environment, the flexibility and efficiency of variable length feature extraction has become a constraint on the ability to quickly process data bottleneck. [0003] According to the retrieval of existing patent materials, there are two main methods for feature extraction of big data: one is to increase the number of CPU cores or establish a distributed...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F9/38G06F9/50G06T1/20
Inventor 刘镇焦弘杰吕超邢红兵
Owner ZHENJIANG ZHONGAN COMM TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products