Systems and Methods for Providing Reinforcement Learning in a Deep Learning System

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
a deep learning system and reinforcement learning technology, applied in the field of deep learning networks, can solve the problems of inapplicability of approaches, reliance on statistically inefficient exploration strategies, and no exploration,

Inactive Publication Date: 2017-02-02

THE BOARD OF TRUSTEES OF THE LELAND STANFORD JUNIOR UNIV

View PDF4 Cites 64 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Benefits of technology

This patent describes systems and methods for using reinforcement learning in deep learning networks. This involves maintaining a deep neural network and applying a reinforcement learning process to it. The process involves receiving observed data and artificial data and generating training data based on a set of rules. The system then selects an action based on the training data, receives the result data, and updates the observed data with the result data. The process can also use a training mask to determine which data to use for training. Overall, this approach allows for improved learning and optimization in deep learning networks.

Problems solved by technology

These approaches are not practical in complex environments that require the system to generalize in order to operate properly.

Thus, these reinforcement learning approaches in large-scale application have relied upon either statistically inefficient exploration strategies or include no exploration at all.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0026]Turning now to the drawings, systems and methods for providing reinforcement learning to a deep learning network in accordance with various embodiment of the invention are disclosed. For purposes of this discussion, deep learning networks are machine learning systems that use a dataset of observed data to learn how to solve a problem in a system where all of the states of the system, actions based upon states, and / or the resulting transitions are not fully known. Examples of deep learning networks include, but are not limited to, deep neural networks.

[0027]System and methods in accordance with some embodiments of this invention that provide reinforcement learning do so by providing an exploration process for a deep learning network to solve a problem in an environment. In reinforcement learning, actions taken by a system may impose delayed consequences. Thus, the design of exploration strategies is more difficult than systems that are action-response systems where there are no...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

Systems and methods for providing reinforcement learning for a deep learning network are disclosed. A reinforcement learning process that provides deep exploration is provided by a bootstrap that applied to a sample of observed and artificial data to facilitate deep exploration via a Thompson sampling approach.

Description

CROSS-REFERENCE TO RELATED APPLICATIONS[0001]The current application is a Continuation-In-Part Application of U.S. patent application Ser. No. 15 / 201,284 filed Jul. 1, 2016 that in turn claims priority to U.S. Provisional Application No. 62 / 187,681, filed Jul. 1, 2015, the disclosures of which are incorporated herein by reference as if set forth herewith.FIELD OF THE INVENTION[0002]This invention relates to deep learning networks including, but not limited to, artificial neural networks. More particularly, this invention relates to systems and methods for training deep learning networks from a set of training data using reinforcement learning.BACKGROUND OF THE INVENTION[0003]Deep learning networks including, but not limited to, artificial neural networks are machine learning systems that receive data, extract statistics and classify results. These systems use a training set of data to generate a model in order to make data driven decisions to provide a desired output.[0004]Deep lear...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06N3/08G06N99/00

CPCG06N99/005G06N3/08G06N3/044G06N3/045

InventorOSBAND, IAN DAVID MOFFATVAN ROY, BENJAMIN

OwnerTHE BOARD OF TRUSTEES OF THE LELAND STANFORD JUNIOR UNIV

Systems and Methods for Providing Reinforcement Learning in a Deep Learning System

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Benefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology