Football robot ball-carrying strategy selection method based on reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A football robot and reinforcement learning technology, applied in the field of football robot dribbling strategy selection based on reinforcement learning, to achieve the effects of improving running speed, fast dribbling process, and reducing memory and computing units

Active Publication Date: 2020-12-29

TONGJI UNIV

View PDF4 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] The purpose of the present invention is exactly to provide a kind of soccer robot dribbling strategy selection method based on reinforcement learning in order to overcome the defective that above-mentioned prior art exists, and this method is passed through reinforcement Learning is introduced into the selection of football robot dribbling strategy, and the three-dimensional movement is decomposed into three independent learners, and then the frequency adjustment learning method is used to make the three agents converge synchronously, which can not only dynamically adjust the speed, but also avoid reinforcement learning The problem of too high dimensionality is difficult to converge

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment

[0057] This embodiment takes the omnidirectional walking Nao robot used by the RoboCup standard platform group as an example to illustrate the reinforcement learning-based football robot dribbling strategy selection method proposed by the present invention. The method specifically includes the following steps:

[0058] The first step is to establish the robot-ball-target position model

[0059] At present, most of the relevant research is to establish the state space by dividing the stadium. Such a huge state space requires a large amount of memory and computing units. The present invention models local behavior using angle and distance values. Such as figure 1 As shown, taking robot-ball angle α, robot-ball distance ρ, and robot-ball-target angle supplementary angle β as state parameters, the velocity vector [V x ,V y ,V z ] is the action parameter, V x , V y , V z Represent the speed of the robot's movement in three dimensions, with the player taking the ball to the e...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a football robot ball-carrying strategy selection method based on reinforcement learning, a football field scene environment of a football robot is established on the basis ofa RoboCup simulation platform, and the method comprises the following steps: constructing a football robot ball target position model; decomposing the soccer robot ball target position model into a plurality of independent agents, obtaining a plurality of independent learners sharing the same state space and having different speed spaces, and setting a reward function for each independent learner; for each independent learner, constructing a reinforcement learning model based on SARSA (lambda), and carrying out approximate processing on the action value by adopting an RBF network; and training each independent learner, synchronously converging the learners by adopting a frequency adjustment learning method, obtaining a complete model, and completing ball-carrying strategy selection. Compared with the prior art, the invention has the advantages that the ball carrying process of the robot is quicker, the ball carrying is more controllable, the convergence is improved and the like.

Description

technical field [0001] The invention relates to the technical field of soccer robot sports, in particular to a method for selecting a football robot dribbling strategy based on reinforcement learning. Background technique [0002] In the RoboCup standard platform group competition, dribbling is a complex behavior during which the robot player tries to maneuver the ball in a very controlled manner while moving towards the desired goal. For a biped robot, the interaction between the ball, the robot, and the ground needs to be considered to obtain the velocities in the forward, lateral, and rotational directions, which makes this task highly dynamic and non-linear. Therefore, in the current technical solution, the dribbling behavior mostly controls the ball through a fixed speed and a fixed angle. What is the relationship between them? The robot always adjusts to the three-point line with the given angular speed and lateral movement speed, and then takes the ball to the target...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06F30/27G06F111/06

CPCG06F30/27G06F2111/06Y02T10/40

Inventor 刘成菊张浩陈启军

Owner TONGJI UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Football robot ball-carrying strategy selection method based on reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology