Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Container stacking optimization method based on deep reinforcement learning

A technology of reinforcement learning and optimization method, which is applied in the field of cargo box stacking optimization based on deep reinforcement learning, can solve the problems that cannot be directly applied, and achieve the effect of shortened solution time, good effect and fast solution.

Pending Publication Date: 2022-05-27
SHANDONG UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, all aspects of the intelligent optimization framework of the deep reinforcement learning method must be redesigned according to the characteristics of the problem, and the existing successfully applied methods cannot be directly applied.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Container stacking optimization method based on deep reinforcement learning
  • Container stacking optimization method based on deep reinforcement learning
  • Container stacking optimization method based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0053] The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention.

[0054] The present invention provides a cargo box stacking optimization method based on deep reinforcement learning. The objective of minimizing the number of disordered stacking of adjacent cargo boxes is designed to optimize the stacking process, and the cargo boxes with low priority (large value) are stacked first. On top of the high-level (smaller value) containers are disorderly stacking. exist figure 1 , the number of unordered stacks generated by storing 10 containers in the storage space of 3 stacks and 5 layers is 3 (the underlined below the number is the unordered stack).

[0055] The method of the invention can adapt to the situation that the number of the cargo boxes and the highest stacking layer number of the stack change, and improve the stacking effic...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a container stacking optimization method based on deep reinforcement learning, and the method comprises the following steps: designing m environment state variables to represent the stacking state of each stack according to a container stacking sequence and a container lifting priority, and calculating the stacking state of each stack according to the states of n stacks used in the current step and the states of containers to be stacked, calculating to obtain a state matrix S at the current moment; further extracting features in the state matrix S by designing a feature extraction network to obtain a feature matrix T; taking the feature matrix T as input data of a stacking decision network, outputting probability distribution of each stack, and further selecting one stack to stack containers; and training the feature extraction network and the stacking decision network by using a deep reinforcement learning algorithm, evaluating output by using a decision evaluation network during training, optimizing a stacking decision and updating parameters. The method disclosed by the invention can adapt to the condition that the number of the containers and the highest stacking layer number of the stack are changed, so that the purpose of improving the stacking and extracting efficiency of the containers is achieved.

Description

technical field [0001] The invention relates to the technical field of cargo box stacking, in particular to a cargo box stacking optimization method based on deep reinforcement learning. Background technique [0002] The container stacking problem is a basic decision-making problem that is widely used in logistics, and is often seen in storage systems such as port yards and warehouses. Generally speaking, the storage order of the containers is not completely related to the order of picking up the containers. It is best not to stack the containers that are picked up later on top of the containers that are picked up first. Therefore, if the random stacking method is adopted, it will cause a large number of disorderly stacking, which will cause trouble for the luggage. If the boxes that are picked up at the same time are stacked together, even if a dedicated stack is used, storage space will be wasted. Therefore, there is a need for an optimized method to achieve efficient co...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06Q10/04G06Q10/08G06N3/04G06N3/08
CPCG06Q10/043G06Q10/08G06N3/08G06N3/045Y02P90/30
Inventor 李歧强段振堂宋文
Owner SHANDONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products