Text generation method based on meta-reinforcement learning
A technology of reinforcement learning and text, applied in neural learning methods, machine learning, biological neural network models, etc., can solve problems such as poor collection of learning samples, achieve rapid adaptation to new environments, solve data shortages, and design reasonable effects
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0024] The present invention will be described in further detail below in conjunction with the accompanying drawings.
[0025] Meta Reinforcement Learning (Meta RL for short) is a research direction that applies meta-learning to reinforcement learning. Its core idea is to hope that the agent can acquire sufficient prior knowledge in learning a large number of reinforcement learning tasks, and then When faced with new intensive learning tasks, it can learn faster and better, and can quickly adapt to the new learning environment.
[0026] Such as figure 1 As shown, first from the database D train Choose any task τ from i The text data of is used as the current generation environment: initialize the text generation model M, and compare the generated text with the real text as the training error Loss n , using the policy gradient method to perform a few internal gradient updates to the generative model M as M' n . Then use the updated model M' n Continue to sample the text t...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 
