Systems and Methods for Providing Reinforcement Learning in a Deep Learning System

a deep learning system and reinforcement learning technology, applied in the field of deep learning networks, can solve the problems of inapplicability of approaches, reliance on statistically inefficient exploration strategies, and no exploration,

Inactive Publication Date: 2017-02-02
THE BOARD OF TRUSTEES OF THE LELAND STANFORD JUNIOR UNIV
View PDF4 Cites 64 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

[0012]In accordance with some embodiments, one or more reinforcement learning processes are applied to the deep neural network. In accordance with many of these embodiments, each of the reinforcement learning processes independently maintains a set of observed data. In accordance with some other

Problems solved by technology

These approaches are not practical in complex environments that require the system to generalize in order to operate properly.
Thus, these reinforcement

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Systems and Methods for Providing Reinforcement Learning in a Deep Learning System
  • Systems and Methods for Providing Reinforcement Learning in a Deep Learning System
  • Systems and Methods for Providing Reinforcement Learning in a Deep Learning System

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026]Turning now to the drawings, systems and methods for providing reinforcement learning to a deep learning network in accordance with various embodiment of the invention are disclosed. For purposes of this discussion, deep learning networks are machine learning systems that use a dataset of observed data to learn how to solve a problem in a system where all of the states of the system, actions based upon states, and / or the resulting transitions are not fully known. Examples of deep learning networks include, but are not limited to, deep neural networks.

[0027]System and methods in accordance with some embodiments of this invention that provide reinforcement learning do so by providing an exploration process for a deep learning network to solve a problem in an environment. In reinforcement learning, actions taken by a system may impose delayed consequences. Thus, the design of exploration strategies is more difficult than systems that are action-response systems where there are no...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Systems and methods for providing reinforcement learning for a deep learning network are disclosed. A reinforcement learning process that provides deep exploration is provided by a bootstrap that applied to a sample of observed and artificial data to facilitate deep exploration via a Thompson sampling approach.

Description

CROSS-REFERENCE TO RELATED APPLICATIONS[0001]The current application is a Continuation-In-Part Application of U.S. patent application Ser. No. 15 / 201,284 filed Jul. 1, 2016 that in turn claims priority to U.S. Provisional Application No. 62 / 187,681, filed Jul. 1, 2015, the disclosures of which are incorporated herein by reference as if set forth herewith.FIELD OF THE INVENTION[0002]This invention relates to deep learning networks including, but not limited to, artificial neural networks. More particularly, this invention relates to systems and methods for training deep learning networks from a set of training data using reinforcement learning.BACKGROUND OF THE INVENTION[0003]Deep learning networks including, but not limited to, artificial neural networks are machine learning systems that receive data, extract statistics and classify results. These systems use a training set of data to generate a model in order to make data driven decisions to provide a desired output.[0004]Deep lear...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06N3/08G06N99/00
CPCG06N99/005G06N3/08G06N3/044G06N3/045
Inventor OSBAND, IAN DAVID MOFFATVAN ROY, BENJAMIN
Owner THE BOARD OF TRUSTEES OF THE LELAND STANFORD JUNIOR UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products