Crowd evacuation simulation method and system based on deep reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of reinforcement learning and simulation method, which is applied in the field of crowd evacuation simulation and system based on deep reinforcement learning, which can solve the problems of immutable state space, random experience playback, and huge state space, so as to solve the problem of dimension disaster and improve learning efficiency , the effect of improving the effectiveness

Active Publication Date: 2021-01-15

SHANDONG NORMAL UNIV

View PDF3 Cites 8 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

The multi-agent deep deterministic policy gradient (Multi-Agent Deep Deterministic Policy Gradient, MADDPG) algorithm proposed by Lowe et al. is a new multi-agent deep reinforcement learning algorithm, but the algorithm also has immutable state space and experience playback. Random and other problems seriously affect the learning efficiency of the algorithm

At the same time, with the increase in the number of agents guiding evacuation and the increase in the complexity of the environment, a huge state space is inevitably brought about. These problems seriously affect the application effect of the algorithm in the field of crowd evacuation.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0037] This embodiment discloses a crowd evacuation simulation method based on deep reinforcement learning, including:

[0038] According to the scene information and crowd parameter information, initialize the evacuation scene simulation model;

[0039] Divide the crowd into groups and identify the leaders and followers of each group;

[0040] The evacuation path of the crowd is obtained by using the hierarchical path planning method, in which the leader in the upper group performs global path planning through the E-MADDPG algorithm to obtain the optimal evacuation path, and the followers in the lower group avoid obstacles and follow the leader along the optimal path. Evacuation route for evacuation.

[0041] Further, the real scene database of the shopping mall is received, and the pedestrian motion stop point is obtained from the pedestrian video by using the YOLO V3 method, which is used as the state space of the E-MADDPG algorithm.

[0042] Further, change parameters ar...

Embodiment 2

[0130] In this embodiment, a crowd evacuation simulation system based on deep reinforcement learning optimized by experience pool is disclosed, including:

[0131] The initialization setting module performs initialization setting of parameters in the evacuation scene simulation model according to the scene information and crowd parameter information;

[0132] The leader selection module in the group realizes the grouping of people; selects the leader in the group;

[0133] The evacuation simulation module uses the hierarchical path planning method to obtain the evacuation path of the crowd. Among them, the leader in the upper group performs global path planning through the E-MADDPG algorithm to obtain the optimal evacuation path, and the followers in the lower group avoid obstacles and follow the leader evacuate along the optimal evacuation path.

Embodiment 3

[0135] An electronic device is disclosed in this embodiment, including a memory, a processor, and computer instructions stored in the memory and run on the processor. When the computer instructions are run by the processor, the deep reinforcement learning-based Steps of crowd evacuation simulation method.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a crowd evacuation simulation method and system based on deep reinforcement learning. The method comprises the steps of initializing a constructed evacuation scene simulation model according to scene information and crowd parameter information; grouping crowds, and dividing leaders and followers of each group; adopting a hierarchical path planning method to obtain evacuation paths of crowds, wherein a leader in an upper-layer group performs global path planning through an EMADDPG algorithm to obtain an optimal evacuation path, and followers in a lower-layer group avoidobstacles and follow the leader to evacuate along the optimal evacuation path. A learning curve and a high-priority experience playback strategy are introduced on the basis of a traditional MADDPG algorithm, an EMADDPG algorithm is formed, the learning efficiency of the algorithm is improved, a hierarchical path planning method is provided on the basis of the EMADDPG algorithm and used for planning evacuation paths of crowds, the path planning time is effectively shortened, and the crowd evacuation efficiency is improved. People can be better guided to evacuate, and the crowd evacuation efficiency is improved.

Description

technical field [0001] The present disclosure relates to a crowd evacuation simulation method and system based on deep reinforcement learning. Background technique [0002] The statements in this section merely provide background information related to the present disclosure and do not necessarily constitute prior art. [0003] With the increasing frequency of public safety issues, large-scale crowd evacuation has become an important link that cannot be ignored in emergency response. In crowded places, once a dangerous accident occurs, the crowd will rush to escape the scene in order to avoid the danger, which will cause crowding during the evacuation process. Failure to evacuate in time may even cause collisions and stampede accidents, causing secondary damage to the evacuated crowd. At the same time, large-scale crowd evacuation is a complex process, and large-scale crowd evacuation experiments are difficult to carry out due to problems such as organizational difficultie...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06F30/27G06Q10/04G06Q50/26

CPCG06F30/27G06Q10/047G06Q50/265Y02A10/40

Inventor 刘弘李信金孟祥栋赵缘

Owner SHANDONG NORMAL UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Crowd evacuation simulation method and system based on deep reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology