A method of interactive reinforcement learning from demonstration and human assessment feedback
A reinforcement learning, human technique, applied in the field of artificial intelligence
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0011] The present invention will be described in detail below in combination with specific embodiments.
[0012] The present invention IRL-TAMER combines Inverse Reinforcement Learning (IRL), a typical agent learning from demonstration, and the TAMER framework, a typical agent learning method from human rewards. We hypothesized that agents learning via IRL-TAMER require less feedback than agents learning from human rewards alone, especially negative feedback, tested the algorithm on the Grid World task domain and compared it to human rewards via the TAMER framework. Agent learning using different discount factors was compared, and the results show that although learning an agent through IRL with a demonstration cannot obtain an effective control policy, it can still learn a useful value function through demonstration, which represents which states compare it is good. More importantly, learning from demonstrations reduces the amount of feedback, especially negative feedback, ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com