Game environment automatic decomposition method adaptive to hierarchical reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of reinforcement learning and automatic decomposition, applied in neural learning methods, indoor games, video games, etc., can solve problems such as labor waste, energy consumption, and decomposition errors, and achieve the effects of reducing learning difficulty, improving productivity, and improving applicability

Pending Publication Date: 2021-06-15

NANJING UNIV

View PDF7 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

But in this case, the first is that people need manual labor. The more complex the environment, the more energy they spend on manual design. It is so that every environment requires repeated human labor. Even similar environments require repeated decomposition and design by humans, resulting in unnecessary waste of labor.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0051] Below in conjunction with specific embodiment, further illustrate the present invention, should be understood that these embodiments are only used to illustrate the present invention and are not intended to limit the scope of the present invention, after having read the present invention, those skilled in the art will understand various equivalent forms of the present invention All modifications fall within the scope defined by the appended claims of the present application.

[0052] A method for automatically decomposing game environments adapted to hierarchical reinforcement learning, in which we use convolutional neural network visualization techniques to localize and cluster rewards in game environments to corresponding tasks, and then use hierarchical reinforcement learning with Backsight experience cache training sub-strategy to train stronger game AI.

[0053] figure 1 It is a schematic diagram of overall training described in the present invention. As shown in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a game environment automatic decomposition method adaptive to hierarchical reinforcement learning, and relates to two aspects, one is a weak supervision semantic segmentation technology in the aspect of computer vision, is a task decomposition module, and the other is the field of hierarchical reinforcement learning in reinforcement learning, and is a strategy training module. According to the method, the learning difficulty of reinforcement learning can be greatly reduced, so that a stronger game AI can be trained in a more complex game environment. Therefore, on one hand, the productivity of a game company in the aspect of designing the game AI can be improved, and on the other hand, the applicability of the reinforcement learning technology can be improved, so that the reinforcement learning technology can be further applied to more general fields.

Description

technical field [0001] The invention relates to a method for automatically decomposing a game environment adapted to layered reinforcement learning, which involves computer vision in machine learning and relevant field knowledge of reinforcement learning. Background technique [0002] With the increasing development of reinforcement learning, the demand for reinforcement learning to be implemented in real scenarios is daily strong. At present, reinforcement learning has a good application in training game AI, such as Go, StarCraft, Glory of Kings, QQ Speed, etc. These mainly adopt existing reinforcement learning algorithms, such as the DQN algorithm. However, reinforcement learning algorithms have great learning difficulties in complex environments, which are manifested in the slowness and instability of learning. For this reason, on the one hand, everyone is trying to invent more powerful reinforcement learning algorithms, and on the other hand, everyone is also trying to...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): A63F13/67G06K9/62G06N3/04G06N3/08

CPCA63F13/67G06N3/08A63F2300/6027G06N3/045G06F18/241

Inventor 俞扬詹德川周志华徐寅

Owner NANJING UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Game environment automatic decomposition method adaptive to hierarchical reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology