Soccer robot cooperation method based on reinforcement learning

A football robot and reinforcement learning technology, applied in the field of football robot collaboration based on reinforcement learning, can solve problems such as low collaboration efficiency

Active Publication Date: 2019-05-24
NANJING UNIV OF POSTS & TELECOMM
View PDF13 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Aiming at the problem of low cooperation efficiency between soccer robots in the soccer robot competition in the above-mentioned prior art, the present invention proposes a soccer robot collaboration method based on reinforcement learning, which is based on adding communication-bas

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Soccer robot cooperation method based on reinforcement learning
  • Soccer robot cooperation method based on reinforcement learning
  • Soccer robot cooperation method based on reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0023] refer to figure 1 , in an embodiment of the present invention, a football robot collaboration method based on reinforcement learning is provided, specifically including the method:

[0024] S1. Construct the basic reinforcement learning model of the soccer robot based on the Sarsa(λ) algorithm with communication, and set the reward and punishment mechanism r of the basic reinforcement learning model.

[0025] refer to figure 2 , it can be seen that the principle of the basic model of reinforcement learning is: the football robot chooses an action in the state of perceiving the current environment, and at this time the environment state migrates to a new state, and correspondingly, the new state generates a reinforcement signal to feed back to the football robot, football The robot determines the next action according to the current environment information and the reinforcement signal; wherein, the key of the reinforcement learning of the football robot in the present ...

Embodiment 2

[0090]The method in Embodiment 1 is verified by using the HFO experimental platform, specifically, it consists of m offensive players and n defensive players. Among them, defensive players include goalkeepers, and n≥m. Half-court offensive missions are played on one half of the football field and begin near the half-court line with the ball in the possession of the offensive players; see Figure 4 , the figure shows a classic 4v5 HFO platform, where the white solid circle is the ball, four offensive players, and five defensive players including the goalkeeper; To score a goal successfully, it is necessary to learn the three operations of the offensive player's passing, dribbling and shooting through the basic model of reinforcement learning, and simulate the actions of the defensive player trying to stop the offensive player.

[0091] Preferably, the present invention first carries out 30 groups of experiments to analyze errors between football robots having communication and...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a soccer robot cooperation method based on reinforcement learning. The soccer robot cooperation method comprises the steps that S1, a reinforcement learning basic model of soccer robots is constructed based on a Sarsa (lambda) algorithm with communication added, and a reward and punishment mechanism r of the reinforcement learning basic model is set; S2, a specified numberof state variables are defined based on the distance and angle between the soccer robots; and S3, operational action sets of the soccer robots are set, and the soccer robots select the next action based on the reward and punishment mechanism r, the state variables and the communication between the soccer robots. According to the soccer robot cooperation method based on the reinforcement learning,the reward and punishment mechanism in set up in the established reinforcement learning basic model, the soccer robots choose the next action according to the current environment and the reward and punishment mechanism, learning and updating are conducted by the soccer robots communicating with one another, and the cooperation efficiency of the soccer robots is effectively improved.

Description

technical field [0001] The invention belongs to the field of soccer robots, and in particular relates to a soccer robot collaboration method based on reinforcement learning. Background technique [0002] As a typical multi-soccer robot system, the football robot competition provides a good experimental platform for the study of intelligence theory and the integrated application of various technologies. The capability requirements are getting stronger and stronger, which involves a series of research topics such as robot positioning, path planning, coordinated control, target tracking and decision-making. [0003] In recent years, many scholars and experts have made a lot of research results. For example, the Chinese patent application number 201120008202.2 discloses an intelligent robot competition device, including mechanical parts and circuit control parts. The mechanical parts include ball tables, consoles and robots. , the circuit control part includes a control module ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): B25J9/16
Inventor 胡丽娟梁志伟李汉辉
Owner NANJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products