Game action processing method and device

A processing method and game technology, applied in the field of data processing, can solve problems such as inability to learn game rules and decision-making mistakes

Active Publication Date: 2019-04-16
NETEASE (HANGZHOU) NETWORK CO LTD
View PDF6 Cites 22 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, relying solely on reinforcement learning cannot learn the

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Game action processing method and device
  • Game action processing method and device
  • Game action processing method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] In order to make the above objects, features and advantages of the present invention more comprehensible, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0046] Reinforcement learning essentially selects and executes the game action with the highest expected return value in the current game state, but usually cannot learn well from the sample data whether a certain game action is allowed to be executed in the current game state and similar rule information . The main reason is that some game actions can achieve high expected reward values ​​in a few game states (s0), but will not achieve high expected reward values ​​in most other states (s1), due to neural The network has certain abstraction and generalization capabilities, so it is impossible to strictly distinguish the difference between the two game states of s0 and s1, which results in trying to execute the game action in some...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a game action processing method and device. The method comprises the steps of acquiring the current game state and action space of a non-player character; inputting the current game state into a pre-trained reinforcement learning network model to obtain a reinforcement strategy, wherein the reinforcement strategy comprises a first selection probability of each game action; inputting the action space into a pre-trained auxiliary rule network model to obtain an auxiliary strategy, wherein the auxiliary strategy comprises a second selection probability ofeach game action; determining a target strategy according to the reinforcement strategy and the auxiliary strategy, wherein the target strategy comprises a target probability generated by each game action on the basis of the corresponding first selection probability and the corresponding second selection probability; screening out a target game action from the action space according to the targetprobability of each game action and controlling the non-player character to execute the target game action. The game action processing method and device can improve the game experience of players.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to a game action processing method and a game action processing device. Background technique [0002] Reinforcement learning, also known as reinforcement learning and evaluation learning, is an important machine learning method that has many applications in the fields of intelligent control robots and analysis and prediction. The policy network (Policy Network) trained through reinforcement learning, the so-called policy network, is to establish a neural network model, which can directly predict the most current policy (policy) by observing the state of the environment. Execution of this policy can obtain the current and the maximum expected return value (reward) in the future. [0003] Reinforcement learning has a wide range of application scenarios in games. The policy network trained through reinforcement learning has higher intelligence in games than the strategies manu...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): A63F13/822G06N3/08
CPCA63F13/822G06N3/08
Inventor 陈赢峰林磊范长杰
Owner NETEASE (HANGZHOU) NETWORK CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products