Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Multi-agent cluster obstacle avoidance method based on reinforcement learning

A technology of enhanced learning and multi-agents, which is applied in two-dimensional position/channel control, non-electric variable control, instruments, etc., can solve the problems of obstacle avoidance and obstacle avoidance algorithm agents that cannot be performed quickly, and achieve high avoidance The effect of failure efficiency

Active Publication Date: 2021-07-23
UNIV OF ELECTRONICS SCI & TECH OF CHINA
View PDF10 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003]Swarm obstacle avoidance has always been an important module of agent group control. Most obstacle avoidance algorithms tend to fall into local optimal values ​​when encountering complex obstacle environments, so that agents cannot fast obstacle avoidance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-agent cluster obstacle avoidance method based on reinforcement learning
  • Multi-agent cluster obstacle avoidance method based on reinforcement learning
  • Multi-agent cluster obstacle avoidance method based on reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0108] The technical solution of the present invention will be further described in detail below in conjunction with the accompanying drawings, but the protection scope of the present invention is not limited to the following description.

[0109] The invention combines the Flocking collaborative control algorithm and the Q-learning algorithm according to the obstacle avoidance requirements of the agent cluster in the obstacle environment task execution process, and proposes an autonomous collaborative obstacle avoidance method for multiple agents for complex obstacle environments. In the learning process, there is no need to learn from the historical experience of its neighbors, which helps to speed up the training efficiency of multi-agent clusters, specifically:

[0110] Such as figure 1 As shown, a multi-agent group obstacle avoidance method based on reinforcement learning includes the following steps:

[0111] S1. Establish the motion model of the swarm system:

[0112]...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a multi-agent cluster obstacle avoidance method based on reinforcement learning. The method comprises the following steps: S1, establishing a motion model of a cluster system; S2, defining an obstacle avoidance factor xi and an obstacle avoidance evaluation criterion; S3, designing a state space, a behavior space and a reward function trained by a cluster formation transformation obstacle avoidance model Q-learning when ximin is less than ximin; S4, enhancing a state space, a behavior space and a reward function of learning training by the cluster autonomous collaborative obstacle avoidance model during design; S5, designing an agent behavior selection method; and S6, obtaining a Q value table obtained by training, and carrying out cluster autonomous collaborative obstacle avoidance based on the motion model defined in the S1. According to the invention, parameters such as an obstacle avoidance factor and an obstacle avoidance evaluation criterion are used for selection and judgment of an intelligent agent cluster obstacle avoidance model, and a Q-learning algorithm is combined to train a cluster autonomous collaborative obstacle avoidance model, so that an optimal cluster individual obstacle avoidance strategy and high obstacle avoidance efficiency are obtained.

Description

technical field [0001] The invention relates to multi-agent obstacle avoidance, in particular to a multi-agent group obstacle avoidance method based on reinforcement learning. Background technique [0002] In recent years, intelligent bodies such as unmanned aerial vehicles and unmanned cars have developed rapidly due to their high stability, strong adaptability, and low risk; the clustering of intelligent bodies solves the problem of limited functionality of a single intelligent body. At the same time, intelligent individuals are effectively integrated. [0003] Swarm obstacle avoidance has always been an important module of agent group control. Most obstacle avoidance algorithms tend to fall into local optimal values ​​when encountering complex obstacle environments, making agents unable to quickly avoid obstacles. Contents of the invention [0004] The purpose of the present invention is to overcome the deficiencies of the prior art, to provide a multi-agent cluster ob...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G05D1/02
CPCG05D1/0217G05D1/0221G05D1/0289G05D1/104
Inventor 张瑛黄治宇薛玉玺肖剑吴磊高天奇张钱江
Owner UNIV OF ELECTRONICS SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products