Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Reinforcement learning deep searching method based on bootstrap DAQN (deep Q network)

A deep search and enhanced learning technology, applied in the field of deep learning, can solve problems such as high computing cost, inability to collect, and low learning efficiency

Inactive Publication Date: 2017-05-31
SHENZHEN WEITESHI TECH
View PDF0 Cites 23 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Effective deep search is a major challenge for reinforcement learning (RL). Commonly used algorithms such as dithering algorithms require a large amount of data. However, it is difficult to obtain such a large amount of data in reality because the correct corresponding learning cannot be collected. data, making learning inefficient and computationally expensive

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Reinforcement learning deep searching method based on bootstrap DAQN (deep Q network)
  • Reinforcement learning deep searching method based on bootstrap DAQN (deep Q network)
  • Reinforcement learning deep searching method based on bootstrap DAQN (deep Q network)

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other. The present invention will be further described in detail below in conjunction with the drawings and specific embodiments.

[0032] figure 1 It is a system framework diagram of a bootstrap DQN-based enhanced learning deep search method of the present invention. It mainly includes bootstrap Deep Q Network (DQN), deep search and environmental background; the bootstrap Deep Q Network includes bootstrap samples and bootstrap DQN, deep search includes deep search test and bootstrap DQN-driven deep search, and environmental background includes generation Online Bootstrap DQN and Bootstrap DQN Driver.

[0033] Among them, the bootstrap sample, the bootstrap principle is the most common form of sample distribution that approximates the population distribution, and the bootstrap is used as the input data set D and t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a reinforcement learning deep searching method based on bootstrap DAQN (deep Q network), mainly comprising: bootstrap DQN, deep searching and environmental background, wherein the bootstrap DQN includes a bootstrap sample and a bootstrap DQN unit; the deep searching includes deep search test and bootstrap DQN drive deep searching; the environmental background includes generating online bootstrap DQN and bootstrap DQN drive. The bootstrap DQN is a practical reinforcement learning algorithm combining deep learning and deep searching, it is proved that bootstrapping may create effective uncertain estimation on a deep neural network and may also be extended to a large-scale parallel system, information is ranked in multiple time steps, and sample diversity is guaranteed; the bootstrap DQN acts as an effective reinforcement learning algorithm in a complex environment to process mass data in parallel, calculation cost is low, learning efficiency is high, and the method has excellent performance.

Description

technical field [0001] The invention relates to the field of deep learning, in particular to a bootstrap DQN-based enhanced learning deep search method. Background technique [0002] Reinforcement learning is one of the machine learning methods. It completes the mapping learning from the environment state to the action, selects the optimal strategy according to the maximum feedback value, searches the strategy to select the optimal action, causes the state change to obtain the delayed feedback value, and evaluates the function , the iterative loop, until the learning condition is satisfied, the learning is terminated. Effective deep search is a major challenge for reinforcement learning (RL). Commonly used algorithms such as dithering algorithms require a large amount of data. However, it is difficult to obtain such a large amount of data in reality because the correct corresponding learning cannot be collected. data, making learning inefficient and computationally expensiv...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06N3/08
CPCG06N3/08
Inventor 夏春秋
Owner SHENZHEN WEITESHI TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products