Parallel learning automaton optimization method based on partial mean value fusion
An automaton and fusion algorithm technology, which is applied in the field of information processing and can solve problems such as difficulty in expansion and poor robustness.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0027] This embodiment adopts two groups of environments with 10 actions, and the reward probabilities of the first group of environments for actions are E A :{0.70, 0.50, 0.30, 0.20, 0.40, 0.50, 0.40, 0.30, 0.50, 0.20}, the reward probabilities of the second group of environments to actions are E B :{0.10,0.45,0.84,0.76,0.20,0.40,0.60,0.70,0.50,0.30}. The learning rules of the learning automata adopt the most typical DP RI Algorithm and DGPA algorithm. In the following examples, two kinds of learning rules are used to implement the present invention in two groups of environments, totally 4 sets of systems.
[0028] A) When the learning algorithm is DP RI Algorithm, implementing the present invention specifically includes steps as follows:
[0029]Initialization: set the input parameters of the algorithm and initialize the learning automaton, specifically: set the parallel scale N and the convergence threshold of the learning automaton, and set in turn: n is the resolution...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com