Check patentability & draft patents in minutes with Patsnap Eureka AI!

SGD (Stochastic Gradient Descent) algorithm optimization system and method

An algorithm and parallel computing technology, which is applied in the field of big data processing, can solve the problems of big data calculation, increase the network overhead of computing clusters, etc., and achieve the effect of improving overall performance, reducing network overhead, and improving computing performance

Active Publication Date: 2017-01-18
INSPUR BEIJING ELECTRONICS INFORMATION IND
View PDF3 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, under the Spark big data platform, the data set sampling and gradient calculation of the SGD algorithm adopt parallel operations, so that each computing node server needs to undertake a large amount of data calculation, and in a distributed computing environment, each computing node needs to perform Data exchange (shuffle operation) to update the random gradient value and weight value increases the network overhead of the computing cluster

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • SGD (Stochastic Gradient Descent) algorithm optimization system and method
  • SGD (Stochastic Gradient Descent) algorithm optimization system and method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] The core of the present invention is to provide an SGD algorithm optimization system and method, which can optimize the SGD algorithm under the Spark framework and improve the overall performance of the SGD algorithm for processing massive data.

[0037] In order to make the above objects, features and advantages of the present invention more comprehensible, the specific implementation manners of the present invention will be described in detail below in conjunction with the accompanying drawings.

[0038] In the following description, specific details are set forth in order to provide a thorough understanding of the present invention. However, the present invention can be implemented in many other ways than those described here, and those skilled in the art can make similar extensions without departing from the connotation of the present invention. Accordingly, the invention is not limited to the specific implementations disclosed below.

[0039] Please refer to figu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention disclsoes an SGD (Stochastic Gradient Descent) algorithm optimization system and method. The system comprises: a plurality of distributed calculation nodes based on an FPGA (Field Programmable Gate Array) and a center calculation node based on a CPU (Central Processing Unit), wherein each distributed calculation node is used for carrying out parallel calculation on data to be processed in an SGD algorithm through a corresponding parallel data path in the FPGA; the center calculation node is used for distributing and dispatching a data processing task. The center calculation node can be used for distributing different calculation tasks according to structural characteristics of each distributed calculation node, and a calculation performance, an energy efficiency ratio and a computation real-time performance of each distributed calculation node are improved; all the distributed calculation nodes do not need to be subjected to data exchange and network overhead of a calculation cluster is reduced. The distributed calculation nodes are arranged based on the FPGA and the center calculation node is arranged based on the CPU, so that a heterogeneous calculation platform is formed; the SGD algorithm is subjected to parallel designing and the SGD algorithm under a Spark framework is extremely optimized; the whole performance of processing massive data by the SGD algorithm is improved.

Description

technical field [0001] The invention relates to the field of big data processing, in particular to an SGD algorithm optimization system and method. Background technique [0002] With the development of information technology, we have entered the era of big data. Many machine learning algorithms can be transformed into convex function optimization problems, that is, the task of finding the minimum value of a regression function, the simplest method of which is gradient descent. The stochastic gradient descent (SGD) algorithm is a typical algorithm for convex function optimization problems in machine learning algorithms. [0003] In the Spark big data processing framework, the processing idea of ​​the SGD algorithm is: first randomly initialize the gradient value and weight value of the function, and use parameters to specify a random sampling subset of the full set of data, and then calculate the average value of the gradient of the data points in the subset , that is, get ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/11G06F9/50
CPCG06F9/5027G06F9/5061G06F17/11Y02D10/00
Inventor 王丽陈继承王洪伟
Owner INSPUR BEIJING ELECTRONICS INFORMATION IND
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More