Spark-based support vector machine parameter optimization parallel particle swarm optimization method

A technology of support vector machine and particle swarm optimization, which is applied to computer components, instruments, character and pattern recognition, etc., can solve the problems of complex calculation, unacceptable efficiency of PSO stand-alone algorithm, long stand-alone implementation time, etc. The effect of high efficiency, high reliability and fast operation speed

Inactive Publication Date: 2017-10-13
CHINA UNIV OF MINING & TECH
View PDF0 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

PSO involves multiple iterations, the calculation is more complex, and the single-machine implementation takes longer
Especially in the case of a large amount of data and a large number of iterations, the efficiency of the PSO stand-alone algorithm is often unacceptable.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Spark-based support vector machine parameter optimization parallel particle swarm optimization method
  • Spark-based support vector machine parameter optimization parallel particle swarm optimization method
  • Spark-based support vector machine parameter optimization parallel particle swarm optimization method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0051] The flow chart of parameter optimization with particle swarm algorithm is shown in the appendix figure 1 . The optimization process under this method includes: first, initialize 40 particles; then cross-validate; then select the global extremum; and finally update the particles. From the start of particle swarm algorithm parameter optimization to the end of optimization, the calculation of the global extremum of 40 particles may be performed more than once during this period, so the parallel speed and position update of 40 particles may also be performed more than once. So there will be multiple iterative calculations. attached figure 1 The global extremum (before) in refers to the global extremum before each particle update iteration operation, and the global extremum (back) refers to the global extremum after each particle update iteration operation. According to the size relationship between them, the update principle of the two is: if the global extremum (post) o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a spark-based support vector machine parameter optimization parallel particle swarm optimization method which is applied to machine learning model training. The initialization information of all nodes is converted into information of an RDD form and is stored. The cross validation of node particles of different address coordinate parameters is carried out, and the node particles are mapped as an accurate rate of cross validation and the individual extremums of particles. The individual extremums of all particles are used to find a global extremum. A node particle is updated in each RDD according to the global extremum. Whether the global extremum reaches a target accurate rate or a number of iterations reaches a limit or not is judged, an optimization process is ended if so, otherwise the parallel cross validation of the node particles is repeated. The running speed of the optimization process is fast, the search range is large, and the searching of global optimum by the particle swarm algorithm is accurate and rapid.

Description

technical field [0001] The invention relates to a parallel particle swarm algorithm, and is especially suitable for a parallel particle swarm optimization method for parameter optimization of a spark-based support vector machine used in machine learning model training. Background technique [0002] A support vector machine is a machine learning method. Firstly, the optimal model parameters are obtained by cross-validating the training data set, then the optimal model parameters found in the previous step are used to train the prediction model, and finally the unknown data is predicted using the trained prediction model. [0003] The grid algorithm has already realized the optimization of parameters, and the grid algorithm has the limitation of local optimum, while the global optimization ability of the particle swarm algorithm solves the local optimum problem very well. Particle swarm optimization algorithm is a biological evolutionary algorithm that imitates the foraging o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/2111G06F18/2411G06F18/214
Inventor 刘鹏仰彦妍赵慧含叶帅尹良飞王学奎孟磊
Owner CHINA UNIV OF MINING & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products