Football robot ball-carrying strategy selection method based on reinforcement learning

A football robot and reinforcement learning technology, applied in the field of football robot dribbling strategy selection based on reinforcement learning, to achieve the effects of improving running speed, fast dribbling process, and reducing memory and computing units

Active Publication Date: 2020-12-29
TONGJI UNIV
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The purpose of the present invention is exactly to provide a kind of soccer robot dribbling strategy selection method based on reinforcement learning in order to overcome the defective that above-mentioned prior art exists, and this method is passed through reinforcement Learning is introduced into the selection of football robot dribbling strategy, and the three-dimensional movement is decomposed into three independent learners, and then the frequency adjustment learning method is used to make the three agents converge synchronously, which can not only dynamically adjust the speed, but also avoid reinforcement learning The problem of too high dimensionality is difficult to converge

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Football robot ball-carrying strategy selection method based on reinforcement learning
  • Football robot ball-carrying strategy selection method based on reinforcement learning
  • Football robot ball-carrying strategy selection method based on reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0057] This embodiment takes the omnidirectional walking Nao robot used by the RoboCup standard platform group as an example to illustrate the reinforcement learning-based football robot dribbling strategy selection method proposed by the present invention. The method specifically includes the following steps:

[0058] The first step is to establish the robot-ball-target position model

[0059] At present, most of the relevant research is to establish the state space by dividing the stadium. Such a huge state space requires a large amount of memory and computing units. The present invention models local behavior using angle and distance values. Such as figure 1 As shown, taking robot-ball angle α, robot-ball distance ρ, and robot-ball-target angle supplementary angle β as state parameters, the velocity vector [V x ,V y ,V z ] is the action parameter, V x , V y , V z Represent the speed of the robot's movement in three dimensions, with the player taking the ball to the e...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a football robot ball-carrying strategy selection method based on reinforcement learning, a football field scene environment of a football robot is established on the basis ofa RoboCup simulation platform, and the method comprises the following steps: constructing a football robot ball target position model; decomposing the soccer robot ball target position model into a plurality of independent agents, obtaining a plurality of independent learners sharing the same state space and having different speed spaces, and setting a reward function for each independent learner; for each independent learner, constructing a reinforcement learning model based on SARSA (lambda), and carrying out approximate processing on the action value by adopting an RBF network; and training each independent learner, synchronously converging the learners by adopting a frequency adjustment learning method, obtaining a complete model, and completing ball-carrying strategy selection. Compared with the prior art, the invention has the advantages that the ball carrying process of the robot is quicker, the ball carrying is more controllable, the convergence is improved and the like.

Description

technical field [0001] The invention relates to the technical field of soccer robot sports, in particular to a method for selecting a football robot dribbling strategy based on reinforcement learning. Background technique [0002] In the RoboCup standard platform group competition, dribbling is a complex behavior during which the robot player tries to maneuver the ball in a very controlled manner while moving towards the desired goal. For a biped robot, the interaction between the ball, the robot, and the ground needs to be considered to obtain the velocities in the forward, lateral, and rotational directions, which makes this task highly dynamic and non-linear. Therefore, in the current technical solution, the dribbling behavior mostly controls the ball through a fixed speed and a fixed angle. What is the relationship between them? The robot always adjusts to the three-point line with the given angular speed and lateral movement speed, and then takes the ball to the target...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F30/27G06F111/06
CPCG06F30/27G06F2111/06Y02T10/40
Inventor 刘成菊张浩陈启军
Owner TONGJI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products