Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Game environment automatic decomposition method adaptive to hierarchical reinforcement learning

A technology of reinforcement learning and automatic decomposition, applied in neural learning methods, indoor games, video games, etc., can solve problems such as labor waste, energy consumption, and decomposition errors, and achieve the effects of reducing learning difficulty, improving productivity, and improving applicability

Pending Publication Date: 2021-06-15
NANJING UNIV
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But in this case, the first is that people need manual labor. The more complex the environment, the more energy they spend on manual design. It is so that every environment requires repeated human labor. Even similar environments require repeated decomposition and design by humans, resulting in unnecessary waste of labor.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Game environment automatic decomposition method adaptive to hierarchical reinforcement learning
  • Game environment automatic decomposition method adaptive to hierarchical reinforcement learning
  • Game environment automatic decomposition method adaptive to hierarchical reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] Below in conjunction with specific embodiment, further illustrate the present invention, should be understood that these embodiments are only used to illustrate the present invention and are not intended to limit the scope of the present invention, after having read the present invention, those skilled in the art will understand various equivalent forms of the present invention All modifications fall within the scope defined by the appended claims of the present application.

[0052] A method for automatically decomposing game environments adapted to hierarchical reinforcement learning, in which we use convolutional neural network visualization techniques to localize and cluster rewards in game environments to corresponding tasks, and then use hierarchical reinforcement learning with Backsight experience cache training sub-strategy to train stronger game AI.

[0053] figure 1 It is a schematic diagram of overall training described in the present invention. As shown in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a game environment automatic decomposition method adaptive to hierarchical reinforcement learning, and relates to two aspects, one is a weak supervision semantic segmentation technology in the aspect of computer vision, is a task decomposition module, and the other is the field of hierarchical reinforcement learning in reinforcement learning, and is a strategy training module. According to the method, the learning difficulty of reinforcement learning can be greatly reduced, so that a stronger game AI can be trained in a more complex game environment. Therefore, on one hand, the productivity of a game company in the aspect of designing the game AI can be improved, and on the other hand, the applicability of the reinforcement learning technology can be improved, so that the reinforcement learning technology can be further applied to more general fields.

Description

technical field [0001] The invention relates to a method for automatically decomposing a game environment adapted to layered reinforcement learning, which involves computer vision in machine learning and relevant field knowledge of reinforcement learning. Background technique [0002] With the increasing development of reinforcement learning, the demand for reinforcement learning to be implemented in real scenarios is daily strong. At present, reinforcement learning has a good application in training game AI, such as Go, StarCraft, Glory of Kings, QQ Speed, etc. These mainly adopt existing reinforcement learning algorithms, such as the DQN algorithm. However, reinforcement learning algorithms have great learning difficulties in complex environments, which are manifested in the slowness and instability of learning. For this reason, on the one hand, everyone is trying to invent more powerful reinforcement learning algorithms, and on the other hand, everyone is also trying to...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): A63F13/67G06K9/62G06N3/04G06N3/08
CPCA63F13/67G06N3/08A63F2300/6027G06N3/045G06F18/241
Inventor 俞扬詹德川周志华徐寅
Owner NANJING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products