Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Flow Shop Scheduling Method Based on Deep Reinforcement Learning

A technology of reinforcement learning and workshop scheduling, applied in the direction of control/adjustment system, program control, instrument, etc., can solve the problems of large network input changes, etc., and achieve the effect of generality improvement, strong popularization, and novel methods

Active Publication Date: 2022-03-01
NORTHEASTERN UNIV LIAONING
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Using a multi-layer perceptron to fit the policy network in reinforcement learning requires manual design of state features and reward functions, and directly using indicators such as the number of machines and jobs as state features will cause large changes in the input of the network, and it is difficult to expand to different numbers of machines. In terms of the number of tasks and the number of jobs, the model may need to be retrained on different problem scales and problem data, and there are still limitations in versatility

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Flow Shop Scheduling Method Based on Deep Reinforcement Learning
  • A Flow Shop Scheduling Method Based on Deep Reinforcement Learning
  • A Flow Shop Scheduling Method Based on Deep Reinforcement Learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038]The specific implementation manners of the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. The following examples are used to illustrate the present invention, but are not intended to limit the scope of the present invention.

[0039] like figure 1 As shown, the flow shop scheduling method based on deep reinforcement learning in this embodiment is as follows:

[0040] Step 1: Generate the flow shop problem data set for training, and divide it into training set, verification set and test set in proportion;

[0041] In this embodiment, the flow shop problem data set includes 1,000,000 pieces of training data and 10,000 pieces of test data, which are divided into training set, verification set and test set in proportion.

[0042] The flow shop problem data set is a problem matrix of b*j*m size, where b is the batchsize of training or verification samples, j is the number of workpieces, and m is the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a flow shop scheduling method based on deep reinforcement learning, which takes each workpiece as a node and aggregates the processing time of its process as node information to obtain its embedded representation. Use the pointer network to fit the policy network, use the maximum completion time as a reward, train the policy network and save the parameters. In practical problems, the embedded representation of each artifact is used as the input of the policy network, and the artifact with the highest probability is selected in turn until all artifacts are selected to obtain a complete scheduling sequence. The flow shop scheduling method described in the present invention can obtain near-optimal solutions on small-scale problems, and can obtain better solutions than heuristic algorithms and genetic algorithms on large-scale problems, and can be extended to problems with different numbers of machines and jobs On the one hand, it breaks the limitation that the model needs to be retrained on different problem scales and problem data, and has wider versatility.

Description

technical field [0001] The invention relates to the technical field of flow shop scheduling, in particular to a flow shop scheduling method based on deep reinforcement learning. Background technique [0002] The flow shop scheduling problem is one of the classic combinatorial optimization problems, which is difficult to find the optimal solution and has high computational complexity. In order to increase the productivity of the system, effective and efficient manufacturing and scheduling technology solutions must be used. Many researchers have proposed various algorithms for this problem. The exact algorithm can get the optimal solution, but the solution time is often too long, so it is not suitable for large-scale problems; the NEH heuristic algorithm is a kind of flow shop scheduling algorithm that is widely used at present, and can be used to solve the problem with the minimum processing time (makespan) optimization problem. The heuristic algorithm is difficult to guar...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G05B19/418
CPCG05B19/41865G05B2219/32252
Inventor 戚放任涛王心悦董卓然张皓东
Owner NORTHEASTERN UNIV LIAONING
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products