Agent learning method based on knowledge guidance-tactical perception

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A learning method and technology of agents, applied in the field of agent learning based on knowledge guidance-tactical perception, can solve problems such as strange strategies and lack of game confrontation information, achieving high winning rate, good application value, and shortening training time.

Active Publication Date: 2018-10-09

ZHEJIANG UNIV

View PDF8 Cites 19 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Therefore, this reinforcement learning will always learn in the direction of the greatest return, which will also lead to a strange strategy in the end. Researchers cannot analyze what this strategy will be like in advance.

Moreover, since the previous methods are purely data-driven, during the training process of this method, the agent will not be aware of the existence of the opponent, which lacks the confrontation information of the game itself.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment

[0066] This embodiment is implemented in different scenarios of StarCraft micro-operations. The specific implementation process of the learning method is as described above, and the specific steps will not be described in detail. The following only shows the effect of the case data. The scenario of this embodiment is as follows:

[0067] m5v5 scenario: In this scenario, our team controls 5 soldiers, and the enemy, the intelligent body built into the computer, controls 5 soldiers.

[0068] Scenario w15v17: In this scenario, we train to control 15 aircraft, and the enemy, the intelligent body built into the computer, also controls 17 aircraft.

[0069] Scenario w18v20: In this scenario, we train to control 18 aircraft, and the enemy, the computer built-in intelligent body, also controls 20 aircraft.

[0070] In this example, the training of this learning method is carried out in each scene, and the specific combat strategy adopted in the example is to attack the weakest and nea...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses an agent learning method based on knowledge guidance-tactical perception. In the environment of a two-party fight game, human knowledge is used to train an agent which fights with a specific tactical strategy through two stages of training. The method comprises the steps that 1) a game screen and the state of the agent are encoded to form a state feature; 2) a script with aspecific fight strategy is constructed by manually writing a script; 3) the agent fights with the script to collect the state and motion data of an opponent to carry out training in the first stage;4) the trained agent fights with a computer built-in AI, and training in the second stage is carried out through reinforcement learning; and 5) the learning framework is used to train the agent to participate in fight with specific tactics. The method provided by the invention is applicable to agent training in a two-party fight mode in a micro-operation environment, and acquires a good winning rate in the face of various micro-operation fight scenes.

Description

technical field [0001] The invention belongs to the application of deep reinforcement learning in the game field, and in particular relates to an agent learning method based on knowledge guidance-tactical perception. Background technique [0002] Deep reinforcement learning is applied in many places, and the more classic ones are in Go and Atari mini-games. Due to the complexity and real-time nature of the two-player game, after conquering the field of Go, researchers in reinforcement learning have turned to the study of two-player game agents such as StarCraft. Such research can not only strengthen the research on artificial intelligence, but also has high application significance. Because after the intelligent body is constructed, it can be applied to the game industry, put the intelligent body in the game, or apply it to the competitive sports industry, for training beginners or providing the best combat strategy, etc. [0003] Typically, the problems of agent research ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06N99/00A63F13/55A63F13/822

CPCA63F13/55A63F13/822A63F2300/65A63F2300/6027A63F2300/60A63F2300/807

Inventor 李玺胡玥李钧涛

Owner ZHEJIANG UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Agent learning method based on knowledge guidance-tactical perception

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology