Agent learning method based on knowledge guidance-tactical perception

A learning method and technology of agents, applied in the field of agent learning based on knowledge guidance-tactical perception, can solve problems such as strange strategies and lack of game confrontation information, achieving high winning rate, good application value, and shortening training time.

Active Publication Date: 2018-10-09
ZHEJIANG UNIV
View PDF8 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, this reinforcement learning will always learn in the direction of the greatest return, which will also lead to a strange strategy in the end. Researchers cannot analyze what this strategy will be like in advance.
Moreover, since the previous methods are purely data-driven, during the training process of this method, the agent will not be aware of the existence of the opponent, which lacks the confrontation information of the game itself.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Agent learning method based on knowledge guidance-tactical perception
  • Agent learning method based on knowledge guidance-tactical perception
  • Agent learning method based on knowledge guidance-tactical perception

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0066] This embodiment is implemented in different scenarios of StarCraft micro-operations. The specific implementation process of the learning method is as described above, and the specific steps will not be described in detail. The following only shows the effect of the case data. The scenario of this embodiment is as follows:

[0067] m5v5 scenario: In this scenario, our team controls 5 soldiers, and the enemy, the intelligent body built into the computer, controls 5 soldiers.

[0068] Scenario w15v17: In this scenario, we train to control 15 aircraft, and the enemy, the intelligent body built into the computer, also controls 17 aircraft.

[0069] Scenario w18v20: In this scenario, we train to control 18 aircraft, and the enemy, the computer built-in intelligent body, also controls 20 aircraft.

[0070] In this example, the training of this learning method is carried out in each scene, and the specific combat strategy adopted in the example is to attack the weakest and nea...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an agent learning method based on knowledge guidance-tactical perception. In the environment of a two-party fight game, human knowledge is used to train an agent which fights with a specific tactical strategy through two stages of training. The method comprises the steps that 1) a game screen and the state of the agent are encoded to form a state feature; 2) a script with aspecific fight strategy is constructed by manually writing a script; 3) the agent fights with the script to collect the state and motion data of an opponent to carry out training in the first stage;4) the trained agent fights with a computer built-in AI, and training in the second stage is carried out through reinforcement learning; and 5) the learning framework is used to train the agent to participate in fight with specific tactics. The method provided by the invention is applicable to agent training in a two-party fight mode in a micro-operation environment, and acquires a good winning rate in the face of various micro-operation fight scenes.

Description

technical field [0001] The invention belongs to the application of deep reinforcement learning in the game field, and in particular relates to an agent learning method based on knowledge guidance-tactical perception. Background technique [0002] Deep reinforcement learning is applied in many places, and the more classic ones are in Go and Atari mini-games. Due to the complexity and real-time nature of the two-player game, after conquering the field of Go, researchers in reinforcement learning have turned to the study of two-player game agents such as StarCraft. Such research can not only strengthen the research on artificial intelligence, but also has high application significance. Because after the intelligent body is constructed, it can be applied to the game industry, put the intelligent body in the game, or apply it to the competitive sports industry, for training beginners or providing the best combat strategy, etc. [0003] Typically, the problems of agent research ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N99/00A63F13/55A63F13/822
CPCA63F13/55A63F13/822A63F2300/65A63F2300/6027A63F2300/60A63F2300/807
Inventor 李玺胡玥李钧涛
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products