Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Robot confrontation method based on evolutionary reinforcement learning

A reinforcement learning and robotics technology, applied in neural learning methods, instruments, artificial life, etc., can solve problems such as poor exploration ability, difficult continuous control of robots, complex modeling process, etc., to achieve the effect of improving exploration ability and stability

Pending Publication Date: 2021-07-09
NANKAI UNIV
View PDF7 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] For the problem of robot confrontation, the existing methods are mainly divided into two categories: differential game and artificial intelligence. The method of differential game can obtain the analytical solution of the robot confrontation strategy by establishing and solving the differential equation. However, their modeling process is very complicated, and Without generalization ability, it can only be applied to specific simple environments; artificial intelligence methods are often difficult to directly deal with the continuous control problems of robots, and need to rely on prior knowledge or other underlying control methods, with poor exploration ability

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Robot confrontation method based on evolutionary reinforcement learning
  • Robot confrontation method based on evolutionary reinforcement learning
  • Robot confrontation method based on evolutionary reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0069] Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided for more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art.

[0070] The present invention uses the evolutionary reinforcement learning method, which is a combination of deep reinforcement learning and evolutionary thinking. The deep reinforcement learning method does not require modeling, and can realize end-to-end control of the robot, and has a certain generalization ability, which can effectively solve the current problems. There are problems with the method, and evolutionary thinking uses...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a robot confrontation method based on evolutionary reinforcement learning. The robot confrontation method comprises the following steps: constructing a robot confrontation strategy based on deep reinforcement learning, wherein the confrontation strategy is a depth deterministic strategy gradient algorithm DDPG; forming an evolutionary depth deterministic strategy gradient algorithm EDDPG in combination with an evolutionary algorithm and the confrontation strategy; and the strategy network model trained by the algorithm DDPG and the algorithm EDDPG being used as a controller to control the robot to confront with the robot controlled by the confronting strategy based on the threat index. According to the method, an evolutionary reinforcement learning method is considered and used, the evolutionary reinforcement learning method is a combination of deep reinforcement learning and an evolutionary thought, the deep reinforcement learning method does not need modeling, end-to-end control over the robot can be achieved, certain generalization ability is achieved, and the problems existing in an existing method can be effectively solved; the evolutionary idea is to improve the exploration ability and stability of reinforcement learning by using the population.

Description

technical field [0001] The invention relates to the technical field of robot confrontation, in particular to a robot confrontation method based on evolutionary reinforcement learning. Background technique [0002] Robot confrontation is a kind of zero-sum game problem with the robot as the carrier and the goal of defeating the opponent under specific rules. Considering the kinematics and dynamics characteristics of the robot itself, robot confrontation is essentially a differential game problem, which is extremely difficult to solve. The current theoretical methods can only solve the differential game problems of some simple models, and the theoretical methods and technologies of robot confrontation still need new breakthroughs. [0003] For the problem of robot confrontation, the existing methods are mainly divided into two categories: differential game and artificial intelligence. The method of differential game can obtain the analytical solution of the robot confrontatio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06N3/00G06N3/02G06N3/08
CPCG06N3/02G06N3/08G06N3/006
Inventor 张雪波古明阳赵铭慧姜帆
Owner NANKAI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products