Target task processing method, device and equipment based on reinforcement learning migration
A target task and reinforcement learning technology, applied in machine learning, instrumentation, computing, etc., can solve problems such as slow convergence speed and learning speed that cannot meet the growing needs, and achieve the effect of accelerating convergence speed and improving task learning speed
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0028] The technical solutions of the present invention will be clearly and completely described below in conjunction with the accompanying drawings. Apparently, the described embodiments are some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.
[0029] In addition, the technical features involved in the different embodiments of the present invention described below may be combined with each other as long as there is no conflict with each other.
[0030] The embodiments of the present invention improve the learning speed of the target task by transferring the reinforcement learning process of the learned task to the target task. In each of the following embodiments, the semi-Markov decision process is used to introduce the concept of option (option). I...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


