Studying method and system based on increment Q-Learning
A learning method and learning system technology, applied in the fields of instruments, computing, electrical digital data processing, etc., can solve the problems of lack of online incremental learning, low crawling harvest rate, and inability to update.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0030] Below in conjunction with accompanying drawing and embodiment the present invention will be further described:
[0031] Reinforcement learning is an important branch of machine learning. From the perspective of intelligent Agent (agent program: in some query systems, users can put forward query requirements in their favorite format, and then the agent program Agent converts them into strictly defined query parameters suitable for database use), it is to study how to use Autonomous Agent perceives the environment and learns the optimal control strategy in the interaction with the environment, so as to achieve the goal state under the guidance of the strategy. The process for the agent to find the target state is a Markov decision process (Markov decision process, MDP), which can be defined by the reward (Reward) equation, that is, the interaction result between the agent and the environment is expressed in the form of reward. If the action taken by the environment is be...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com