Multi-agent cooperation framework based on deep reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A reinforcement learning and deep technology, applied in the field of virtual reality and artificial intelligence, can solve problems such as complexity, dimensionality disaster, and low learning efficiency, and achieve the effects of strong versatility, stability, and scalability

Active Publication Date: 2019-08-02

SOUTHEAST UNIV

View PDF3 Cites 19 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] Multi-agent (that is, intelligent body) system can solve complex and distributed problems, and has advantages in speed, reliability, flexibility and maintainability. It has always been the focus of research in the field of artificial intelligence. Since reinforcement learning does not require environmental modeling , has become the main research method of multi-agent collaboration, but it still faces many challenges when solving multi-agent collaboration with continuous state and action space, such as the problem of "curse of dimensionality" and low learning efficiency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0030] The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0031] Such as figure 1 As shown, the multi-agent collaboration framework based on deep reinforcement learning of the present invention consists of four parts, including: agent, billboard, Actor-Critic-based deep reinforcement learning module and next-moment state calculation module.

[0032] An agent is defined by its current state, velocity, and desired goal, where the current state is defined by the agent's current position p c and towards θ c Common representation; speed defines the moving rate and direction of the agent; the expected target provides the moving target point of the agent, and the expected moving direction can be calculated according to the current position and the target position.

[0033] The billboard is mainly responsible for the storage, update and transmission of information. The information stored in the bi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a multi-agent collaboration framework based on deep reinforcement learning, which comprises an agent, a billboard, and a an actor-critic-based deep reinforcement learning module, a next moment state calculation module. The agent is defined by a current state, a speed and an expected target; the billboard is responsible for storing, updating and transmitting information; anactor in the Actor-Critic-based deep reinforcement learning module selects a proper action according to the current environment state and the state of an agent, and training learning is continuously carried out through Critic in combination with evaluation given by the state sequence of each agent, so that an optimal control strategy is obtained; and the next moment state calculation module respectively calculates the state of each agent at the next moment according to the current state of each agent and the taken action, and interacts with the billboard. The multi-agent cooperation frameworkbased on deep reinforcement learning provided by the invention has relatively good expandability and relatively high universality, and can provide a technical scheme for realizing diverse multi-agentcooperation.

Description

[0001] Technical field: [0002] The invention relates to the fields of virtual reality and artificial intelligence, in particular to a multi-agent collaboration framework based on deep reinforcement learning. [0003] Background technique: [0004] Multi-agent (that is, intelligent body) system can solve complex and distributed problems, and has advantages in speed, reliability, flexibility and maintainability. It has always been the focus of research in the field of artificial intelligence. Since reinforcement learning does not require environmental modeling , has become the main research method of multi-agent cooperation, but it still faces many challenges when solving multi-agent cooperation with continuous state and action space, such as the problem of "curse of dimensionality" and low learning efficiency. In recent years, with the rapid development of artificial intelligence technology, deep reinforcement learning has attracted more and more attention, because it has broa...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06N20/00G06N3/08

CPCG06N20/00G06N3/08G06N3/088

Inventor 孙立博秦文虎翟金凤

Owner SOUTHEAST UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Multi-agent cooperation framework based on deep reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology