Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Crowd evacuation simulation method and system based on deep reinforcement learning

A technology of reinforcement learning and simulation method, which is applied in the field of crowd evacuation simulation and system based on deep reinforcement learning, which can solve the problems of immutable state space, random experience playback, and huge state space, so as to solve the problem of dimension disaster and improve learning efficiency , the effect of improving the effectiveness

Active Publication Date: 2021-01-15
SHANDONG NORMAL UNIV
View PDF3 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The multi-agent deep deterministic policy gradient (Multi-Agent Deep Deterministic Policy Gradient, MADDPG) algorithm proposed by Lowe et al. is a new multi-agent deep reinforcement learning algorithm, but the algorithm also has immutable state space and experience playback. Random and other problems seriously affect the learning efficiency of the algorithm
At the same time, with the increase in the number of agents guiding evacuation and the increase in the complexity of the environment, a huge state space is inevitably brought about. These problems seriously affect the application effect of the algorithm in the field of crowd evacuation.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Crowd evacuation simulation method and system based on deep reinforcement learning
  • Crowd evacuation simulation method and system based on deep reinforcement learning
  • Crowd evacuation simulation method and system based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0037] This embodiment discloses a crowd evacuation simulation method based on deep reinforcement learning, including:

[0038] According to the scene information and crowd parameter information, initialize the evacuation scene simulation model;

[0039] Divide the crowd into groups and identify the leaders and followers of each group;

[0040] The evacuation path of the crowd is obtained by using the hierarchical path planning method, in which the leader in the upper group performs global path planning through the E-MADDPG algorithm to obtain the optimal evacuation path, and the followers in the lower group avoid obstacles and follow the leader along the optimal path. Evacuation route for evacuation.

[0041] Further, the real scene database of the shopping mall is received, and the pedestrian motion stop point is obtained from the pedestrian video by using the YOLO V3 method, which is used as the state space of the E-MADDPG algorithm.

[0042] Further, change parameters ar...

Embodiment 2

[0130] In this embodiment, a crowd evacuation simulation system based on deep reinforcement learning optimized by experience pool is disclosed, including:

[0131] The initialization setting module performs initialization setting of parameters in the evacuation scene simulation model according to the scene information and crowd parameter information;

[0132] The leader selection module in the group realizes the grouping of people; selects the leader in the group;

[0133] The evacuation simulation module uses the hierarchical path planning method to obtain the evacuation path of the crowd. Among them, the leader in the upper group performs global path planning through the E-MADDPG algorithm to obtain the optimal evacuation path, and the followers in the lower group avoid obstacles and follow the leader evacuate along the optimal evacuation path.

Embodiment 3

[0135] An electronic device is disclosed in this embodiment, including a memory, a processor, and computer instructions stored in the memory and run on the processor. When the computer instructions are run by the processor, the deep reinforcement learning-based Steps of crowd evacuation simulation method.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a crowd evacuation simulation method and system based on deep reinforcement learning. The method comprises the steps of initializing a constructed evacuation scene simulation model according to scene information and crowd parameter information; grouping crowds, and dividing leaders and followers of each group; adopting a hierarchical path planning method to obtain evacuation paths of crowds, wherein a leader in an upper-layer group performs global path planning through an EMADDPG algorithm to obtain an optimal evacuation path, and followers in a lower-layer group avoidobstacles and follow the leader to evacuate along the optimal evacuation path. A learning curve and a high-priority experience playback strategy are introduced on the basis of a traditional MADDPG algorithm, an EMADDPG algorithm is formed, the learning efficiency of the algorithm is improved, a hierarchical path planning method is provided on the basis of the EMADDPG algorithm and used for planning evacuation paths of crowds, the path planning time is effectively shortened, and the crowd evacuation efficiency is improved. People can be better guided to evacuate, and the crowd evacuation efficiency is improved.

Description

technical field [0001] The present disclosure relates to a crowd evacuation simulation method and system based on deep reinforcement learning. Background technique [0002] The statements in this section merely provide background information related to the present disclosure and do not necessarily constitute prior art. [0003] With the increasing frequency of public safety issues, large-scale crowd evacuation has become an important link that cannot be ignored in emergency response. In crowded places, once a dangerous accident occurs, the crowd will rush to escape the scene in order to avoid the danger, which will cause crowding during the evacuation process. Failure to evacuate in time may even cause collisions and stampede accidents, causing secondary damage to the evacuated crowd. At the same time, large-scale crowd evacuation is a complex process, and large-scale crowd evacuation experiments are difficult to carry out due to problems such as organizational difficultie...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F30/27G06Q10/04G06Q50/26
CPCG06F30/27G06Q10/047G06Q50/265Y02A10/40
Inventor 刘弘李信金孟祥栋赵缘
Owner SHANDONG NORMAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products