Heterogeneous multi-agent collaborative decision-making method based on depth deterministic policy gradient

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A multi-agent, decision-making technology, applied in digital transmission systems, electrical components, transmission systems, etc., can solve problems such as discrete actions and states

Inactive Publication Date: 2018-09-28

INST OF SOFTWARE - CHINESE ACAD OF SCI

View PDF3 Cites 71 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Its advantage is that it has strong expressive ability and good decision-making ability. The disadvantage is that actions and states are discrete.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0060] In the following, specific embodiments of the present invention will be described in detail in conjunction with the examples and accompanying drawings. The embodiments depicted here are only used to illustrate and explain the present invention, but not to limit the present invention.

[0061] The heterogeneous multi-agent collaborative decision-making method based on deep deterministic policy gradients proposed by the present invention mainly includes the following steps: First, define the characteristic attributes and reward and punishment rules of heterogeneous multi-agents, clarify the state space and action space of the agents, Construct a sports environment for multi-intelligent collaborative decision-making; then, use the deep deterministic policy gradient algorithm to define the actor module for decision-making actions and the critic module for evaluation and feedback, and train the parameters of the learning model, according to the environment in which the agent ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a heterogeneous multi-agent collaborative decision-making method based on a depth deterministic policy gradient, belonging to the collaborative decision-making field of a heterogeneous intelligent unmanned system, comprising the following steps of: firstly, defining heterogeneous multi-agent characteristic attributes and reward and punishment rules, defining multi-agent state space and action space, and constructing multi-agent motion environment for collaboratively making decision; then, establishing an actor module for decision-making action and a critic module for evaluating feedback based on the depth-deterministic strategy gradient algorithm, and training the parameters of the learning model; using the trained model to obtain the multi-agent state sequence; and evaluating the situation of the multi-agent motion state sequence according to the reward and punishment rules set in the environment. The invention may construct reasonable sports environment according to actual needs, achieve the purpose of intelligent sensing and strategy optimization through the synergy between multiple agents in the system, and has a positive effect on the development of the unmanned system field in China.

Description

technical field [0001] The invention belongs to the field of collaborative decision-making of heterogeneous intelligent unmanned systems, and in particular relates to a method for collaborative decision-making of heterogeneous multi-agents based on deep deterministic strategy gradients. Background technique [0002] In recent years, the rapid development of information technology and intelligent perception technology has laid an important foundation for advanced intelligent behaviors such as the perception of complex environments, precise intelligent decision-making, and multi-machine task coordination. The research on intelligent unmanned systems has now become a landmark achievement in the development of artificial intelligence. The complexity of its tasks and the uncertainty of the dynamic environment determine that the system must have strong adaptive and autonomous capabilities. [0003] Traditional intelligent ant colony (Swarm Intelligence) [1] Beginning in 1959, Fre...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): H04L29/08H04L12/24

CPCH04L41/142H04L41/145H04L67/10

Inventor 李瑞英王瑞胡晓惠张慧

Owner INST OF SOFTWARE - CHINESE ACAD OF SCI

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Heterogeneous multi-agent collaborative decision-making method based on depth deterministic policy gradient

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology