Value decomposition multi-agent reinforcement learning training method using attention network
A reinforcement learning and multi-agent technology, applied in the field of reinforcement learning training, can solve problems such as dimension explosion, credit allocation, and unstable environment, and achieve the effect of improving performance and performance
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 2
[0113] The value decomposition multi-agent reinforcement learning training method using the attention network of the present invention can be applied to many fields such as robot control, traffic coordination, and manufacturing control.
[0114] The multi-agent reinforcement learning training method using the value decomposition of the attention network described in the present invention is used for the control of the robot and cooperates with the work of each module, wherein the agent represents each module and part of the robot, including the power module, the arm module, leg module, head module. The strategy of the agent is the current action, in which the power module controls the magnitude of the output, and other actionable modules control the direction and magnitude of the action. All these actions are controlled by the agent value function network, and the reward represents the distance of the entire robot action. Each module can only see its own working status, but ca...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


