Generative confrontation interactive imitation learning method and system, storage medium and application
A learning method and reinforcement learning technology, applied in the field of generative adversarial interactive imitation learning, which can solve the problems of algorithm performance degradation, long running time, and ill-posed recovery of reward function weights.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0067] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.
[0068] Aiming at the problems existing in the prior art, the present invention provides a method, system, storage medium and application of generative confrontation interactive imitation learning. The present invention will be described in detail below with reference to the accompanying drawings.
[0069] Such as figure 1 As shown, the generation confrontation interactive imitation learning method provided by the present invention includes the following steps:
[0070] S101: A GAIL-like stage based on maximum entropy inverse reinforcement learning, used to learn reward functions from expert demonstrations and train human eval...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com