A method and system for generating state data for reinforcement learning
A state data and reinforcement learning technology, applied in the field of deep reinforcement learning, can solve problems such as instability in the training process, and achieve the effect of shortening the time required for exploration, reducing instability, and increasing the number of rewards
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0041] Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided for more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art.
[0042] According to an embodiment of the present application, a method for generating state data for reinforcement learning is proposed, such as figure 1 shown, including:
[0043] S101. Obtain all first state data of the agent in the first learning stage, and obtain second state data in all first state data that is within a preset step range from the learning goal;
[0044] S102, using all the first state data to train a variatio...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com