A Control Method of Inverted Pendulum Based on Neural Network and Reinforcement Learning
A technology of reinforcement learning and neural network, applied in the field of artificial intelligence and control, to achieve the effect of accelerating the generation of control volume, improving efficiency and fast update speed
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0031] A kind of implementation process of the inverted pendulum control method based on neural network and reinforcement learning of the present invention is:
[0032] The overall control framework of the present invention is a reinforcement learning controller, assuming that at each time step t=1,2,..., the state of the Agent observing the Markov decision process is s t , choose action a, receive immediate reward r t , and make the system transition to the next state s t+1 , the transition probability is p(s t ,a t ,s t+1 ). Therefore, the evolution process of the first n steps of the system is as follows:
[0033]
[0034] The goal of a reinforcement learning system is to learn a policy π such that the cumulative discounted reward obtained in future time steps
[0035] The maximum (0≤γ≤1 is the discount factor), this strategy is the optimal strategy, but in many real situations, the state transition probability function P and reward function R of the environment ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com