Game action processing method and device

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
A processing method and game technology, applied in the field of data processing, can solve problems such as inability to learn game rules and decision-making mistakes

Active Publication Date: 2019-04-16

NETEASE (HANGZHOU) NETWORK CO LTD

View PDF6 Cites 22 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, relying solely on reinforcement learning cannot learn the game rules in the game, which will lead to decision-making errors

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0045] In order to make the above objects, features and advantages of the present invention more comprehensible, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0046] Reinforcement learning essentially selects and executes the game action with the highest expected return value in the current game state, but usually cannot learn well from the sample data whether a certain game action is allowed to be executed in the current game state and similar rule information . The main reason is that some game actions can achieve high expected reward values in a few game states (s0), but will not achieve high expected reward values in most other states (s1), due to neural The network has certain abstraction and generalization capabilities, so it is impossible to strictly distinguish the difference between the two game states of s0 and s1, which results in trying to execute the game action in some...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The embodiment of the invention provides a game action processing method and device. The method comprises the steps of acquiring the current game state and action space of a non-player character; inputting the current game state into a pre-trained reinforcement learning network model to obtain a reinforcement strategy, wherein the reinforcement strategy comprises a first selection probability of each game action; inputting the action space into a pre-trained auxiliary rule network model to obtain an auxiliary strategy, wherein the auxiliary strategy comprises a second selection probability ofeach game action; determining a target strategy according to the reinforcement strategy and the auxiliary strategy, wherein the target strategy comprises a target probability generated by each game action on the basis of the corresponding first selection probability and the corresponding second selection probability; screening out a target game action from the action space according to the targetprobability of each game action and controlling the non-player character to execute the target game action. The game action processing method and device can improve the game experience of players.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to a game action processing method and a game action processing device. Background technique [0002] Reinforcement learning, also known as reinforcement learning and evaluation learning, is an important machine learning method that has many applications in the fields of intelligent control robots and analysis and prediction. The policy network (Policy Network) trained through reinforcement learning, the so-called policy network, is to establish a neural network model, which can directly predict the most current policy (policy) by observing the state of the environment. Execution of this policy can obtain the current and the maximum expected return value (reward) in the future. [0003] Reinforcement learning has a wide range of application scenarios in games. The policy network trained through reinforcement learning has higher intelligence in games than the strategies manu...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): A63F13/822G06N3/08

CPCA63F13/822G06N3/08

Inventor陈赢峰林磊范长杰

OwnerNETEASE (HANGZHOU) NETWORK CO LTD

Game action processing method and device

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology