Reinforcement learning deep searching method based on bootstrap DAQN (deep Q network)

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A deep search and enhanced learning technology, applied in the field of deep learning, can solve problems such as high computing cost, inability to collect, and low learning efficiency

Inactive Publication Date: 2017-05-31

SHENZHEN WEITESHI TECH

View PDF0 Cites 23 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Effective deep search is a major challenge for reinforcement learning (RL). Commonly used algorithms such as dithering algorithms require a large amount of data. However, it is difficult to obtain such a large amount of data in reality because the correct corresponding learning cannot be collected. data, making learning inefficient and computationally expensive

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0031] It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other. The present invention will be further described in detail below in conjunction with the drawings and specific embodiments.

[0032] figure 1 It is a system framework diagram of a bootstrap DQN-based enhanced learning deep search method of the present invention. It mainly includes bootstrap Deep Q Network (DQN), deep search and environmental background; the bootstrap Deep Q Network includes bootstrap samples and bootstrap DQN, deep search includes deep search test and bootstrap DQN-driven deep search, and environmental background includes generation Online Bootstrap DQN and Bootstrap DQN Driver.

[0033] Among them, the bootstrap sample, the bootstrap principle is the most common form of sample distribution that approximates the population distribution, and the bootstrap is used as the input data set D and t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a reinforcement learning deep searching method based on bootstrap DAQN (deep Q network), mainly comprising: bootstrap DQN, deep searching and environmental background, wherein the bootstrap DQN includes a bootstrap sample and a bootstrap DQN unit; the deep searching includes deep search test and bootstrap DQN drive deep searching; the environmental background includes generating online bootstrap DQN and bootstrap DQN drive. The bootstrap DQN is a practical reinforcement learning algorithm combining deep learning and deep searching, it is proved that bootstrapping may create effective uncertain estimation on a deep neural network and may also be extended to a large-scale parallel system, information is ranked in multiple time steps, and sample diversity is guaranteed; the bootstrap DQN acts as an effective reinforcement learning algorithm in a complex environment to process mass data in parallel, calculation cost is low, learning efficiency is high, and the method has excellent performance.

Description

technical field [0001] The invention relates to the field of deep learning, in particular to a bootstrap DQN-based enhanced learning deep search method. Background technique [0002] Reinforcement learning is one of the machine learning methods. It completes the mapping learning from the environment state to the action, selects the optimal strategy according to the maximum feedback value, searches the strategy to select the optimal action, causes the state change to obtain the delayed feedback value, and evaluates the function , the iterative loop, until the learning condition is satisfied, the learning is terminated. Effective deep search is a major challenge for reinforcement learning (RL). Commonly used algorithms such as dithering algorithms require a large amount of data. However, it is difficult to obtain such a large amount of data in reality because the correct corresponding learning cannot be collected. data, making learning inefficient and computationally expensiv...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06N3/08

CPCG06N3/08

Inventor 夏春秋

Owner SHENZHEN WEITESHI TECH

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Reinforcement learning deep searching method based on bootstrap DAQN (deep Q network)

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology