Multi-agent cooperation framework based on deep reinforcement learning

A reinforcement learning and deep technology, applied in the field of virtual reality and artificial intelligence, can solve problems such as complexity, dimensionality disaster, and low learning efficiency, and achieve the effects of strong versatility, stability, and scalability

Active Publication Date: 2019-08-02
SOUTHEAST UNIV
View PDF3 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Multi-agent (that is, intelligent body) system can solve complex and distributed problems, and has advantages in speed, reliability, flexibility and maintainability. It has always been the focus of research in the field of artificial intelligence. Since reinforcement learning does not require environmental modeling , has become the main research method of multi-agent collaboration, but it still faces many challenges when solving multi-agent collaboration with continuous state and action space, such as the problem of "curse of dimensionality" and low learning efficiency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-agent cooperation framework based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0031] Such as figure 1 As shown, the multi-agent collaboration framework based on deep reinforcement learning of the present invention consists of four parts, including: agent, billboard, Actor-Critic-based deep reinforcement learning module and next-moment state calculation module.

[0032] An agent is defined by its current state, velocity, and desired goal, where the current state is defined by the agent's current position p c and towards θ c Common representation; speed defines the moving rate and direction of the agent; the expected target provides the moving target point of the agent, and the expected moving direction can be calculated according to the current position and the target position.

[0033] The billboard is mainly responsible for the storage, update and transmission of information. The information stored in the bi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a multi-agent collaboration framework based on deep reinforcement learning, which comprises an agent, a billboard, and a an actor-critic-based deep reinforcement learning module, a next moment state calculation module. The agent is defined by a current state, a speed and an expected target; the billboard is responsible for storing, updating and transmitting information; anactor in the Actor-Critic-based deep reinforcement learning module selects a proper action according to the current environment state and the state of an agent, and training learning is continuously carried out through Critic in combination with evaluation given by the state sequence of each agent, so that an optimal control strategy is obtained; and the next moment state calculation module respectively calculates the state of each agent at the next moment according to the current state of each agent and the taken action, and interacts with the billboard. The multi-agent cooperation frameworkbased on deep reinforcement learning provided by the invention has relatively good expandability and relatively high universality, and can provide a technical scheme for realizing diverse multi-agentcooperation.

Description

[0001] Technical field: [0002] The invention relates to the fields of virtual reality and artificial intelligence, in particular to a multi-agent collaboration framework based on deep reinforcement learning. [0003] Background technique: [0004] Multi-agent (that is, intelligent body) system can solve complex and distributed problems, and has advantages in speed, reliability, flexibility and maintainability. It has always been the focus of research in the field of artificial intelligence. Since reinforcement learning does not require environmental modeling , has become the main research method of multi-agent cooperation, but it still faces many challenges when solving multi-agent cooperation with continuous state and action space, such as the problem of "curse of dimensionality" and low learning efficiency. In recent years, with the rapid development of artificial intelligence technology, deep reinforcement learning has attracted more and more attention, because it has broa...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N20/00G06N3/08
CPCG06N20/00G06N3/08G06N3/088
Inventor 孙立博秦文虎翟金凤
Owner SOUTHEAST UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products