Training method of reinforced learning model, node, system and storage medium
A training method and reinforcement learning technology, applied in the field of machine learning, can solve the problems of direct leakage of training data, data leakage, and hidden worries of training data leakage, and achieve the effect of simplifying the training process and improving the training speed.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
no. 1 example
[0026] Such as figure 2 As shown, the first embodiment of a training method for a reinforcement learning model of the present application, this embodiment includes:
[0027] S11: The training node acquires local data, and inputs the local data as a sample into the first neural network for training, so as to obtain the first optimal sub-objective function.
[0028] Wherein, the local data is the training data that the training node itself can obtain, and the training data may include the training state of the environment, training actions from the set of actions performed by the training node in response to receiving the training state, due to the training node The training reward received for performing the training action, and the next training state of the environment.
[0029] Specifically, in an application example, the first neural network is a deep neural network, the deep neural network has a first sub-objective function determined by parameters, and the first neural ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com