Preprocess method of partially observable Markov decision process based on points
A preprocessing and decision-making technology, applied in the field of acceleration of approximate algorithms, to achieve accelerated convergence, fast optimal strategy, and overcome the effects of high computational complexity
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0023] The present invention will be described in detail below in conjunction with the accompanying drawings.
[0024] Step 0 is the initial state of the present invention, it imports experimental data from data file;
[0025] Step 1 samples point sets through random interactions with the environment, where actions and observations are randomly obtained according to the probability of the experimental data. When the agent performs action a and obtains observation z, the belief state will be updated from b to b' according to the following formula:
[0026] b ′ ( s ′ ) = b a z ( s ′ ) = Pr ( s ′ | b , a , z ) = ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com