Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Decision-making method based on deep reinforcement learning

A technology of reinforcement learning and decision-making methods, applied in the field of decision-making based on deep reinforcement learning, can solve problems such as slow training process, poor performance, and difficult to explain results, so as to avoid catastrophic decision-making and improve training effect

Pending Publication Date: 2021-02-02
SHENZHEN UNIV
View PDF5 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, deep reinforcement learning also faces some problems. First, its training process can be very slow and resource-intensive. The final system is usually fragile, the results are difficult to interpret, and it performs poorly for a long time at the beginning of training.
Furthermore, for applications in robotics and critical decision support systems, using deep reinforcement learning can even make catastrophic decisions with costly consequences

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Decision-making method based on deep reinforcement learning
  • Decision-making method based on deep reinforcement learning
  • Decision-making method based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] Various exemplary embodiments of the present invention will now be described in detail with reference to the accompanying drawings. It should be noted that the relative arrangements of components and steps, numerical expressions and numerical values ​​set forth in these embodiments do not limit the scope of the present invention unless specifically stated otherwise.

[0041] The following description of at least one exemplary embodiment is merely illustrative in nature and in no way taken as limiting the invention, its application or uses.

[0042] Techniques, methods and devices known to those of ordinary skill in the relevant art may not be discussed in detail, but where appropriate, such techniques, methods and devices should be considered part of the description.

[0043] In all examples shown and discussed herein, any specific values ​​should be construed as exemplary only, and not as limitations. Therefore, other instances of the exemplary embodiment may have dif...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a decision-making method based on deep reinforcement learning. The method comprises the steps that an intelligent agent makes a decision according to environment information and selects a decided action; the intelligent agent compares the decided action with a knowledge base, and decides to execute the decided action or replace the decided action based on a preset rule setin the knowledge base; the intelligent agent executes the decided action or the replaced action in the environment, obtains rewards and new environment information from the environment, combines the old environment information, the action, the rewards and the new environment information into experience information, and stores the experience information into an experience playback pool; and a preset amount of experience information is randomly selected from the experience playback pool to update the deep reinforcement learning model so as to guide the next iteration. By utilizing the method, the training time can be shortened, disastrous decisions can be avoided, and the method can be widely applied to the field of dynamic decisions.

Description

technical field [0001] The present invention relates to the field of artificial intelligence, and more specifically, to a decision-making method based on deep reinforcement learning. Background technique [0002] Reinforcement learning is a field in machine learning that describes and solves problems in which agents learn strategies to maximize rewards or achieve specific goals during interactions with the environment. [0003] Currently, deep reinforcement learning has been successfully applied to a variety of dynamic decision-making domains, especially those with large state spaces. However, deep reinforcement learning also faces some problems. First, its training process can be very slow and resource-intensive, and the final system is usually fragile, the results are difficult to interpret, and it performs poorly for a long time at the beginning of training. Moreover, for applications in robotics and critical decision support systems, it is even possible to make catastro...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): A63F13/822A63F13/837G06N3/04G06N20/00
CPCA63F13/822A63F13/837G06N20/00G06N3/045
Inventor 张昊迪伍楷舜陈振浩高子航李启凡
Owner SHENZHEN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products