Artificial intelligence AI model training method and device, equipment and medium

A technology of artificial intelligence and training methods, applied in computing models, indoor games, machine learning, etc., can solve problems such as the inaccurate state value of the value network and the complexity of the game environment

Pending Publication Date: 2021-01-15
TENCENT TECH (SHENZHEN) CO LTD
View PDF0 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] However, due to the complexity of the game environment of MOBA games and the change of game parameters due to the execution of actions, the changes of different game parameters have different influences on strategic decisions. Th

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Artificial intelligence AI model training method and device, equipment and medium
  • Artificial intelligence AI model training method and device, equipment and medium
  • Artificial intelligence AI model training method and device, equipment and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] In order to make the purpose, technical solution and advantages of the present application clearer, the implementation manners of the present application will be further described in detail below in conjunction with the accompanying drawings.

[0035] Firstly, several nouns involved in the embodiments of the present application are briefly introduced.

[0036] Artificial Intelligence (AI) is a theory, method, technology and application system that uses digital computers or machines controlled by digital computers to simulate, extend and expand human intelligence, perceive the environment, acquire knowledge and use knowledge to obtain the best results. In other words, artificial intelligence is a comprehensive technique of computer science that attempts to understand the nature of intelligence and produce a new kind of intelligent machine that can respond in a similar way to human intelligence. Artificial intelligence is to study the design principles and implementation ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an artificial intelligence AI model training method and device, equipment and a medium, and relates to the field of artificial intelligence machine learning. The method comprises the steps that an artificial intelligence AI model is called to conduct game match in a game program to obtain training data, wherein the training data comprises a reference game state in the gamematch, a target game action output by a decision network according to the reference game state and a state value output by a value network according to the reference game state, the state value comprises k state sub-values on k value classifications, and k is an integer greater than 1; according to the training data and k value calculation formulas corresponding to the k value classifications, theaction value of the target game action adopted by the artificial intelligence AI model in the reference game state is calculated, wherein the action value comprises k action sub-values on the k valueclassifications; and the artificial intelligence AI is trained model according to the difference between the state value and the action value. The method can improve the accuracy of estimating the state value of the value network.

Description

technical field [0001] The embodiments of the present application relate to the field of machine learning of artificial intelligence, and in particular to a training method, device, equipment and medium of an artificial intelligence AI model. Background technique [0002] Reinforcement learning is one of the paradigms and methodologies of machine learning, which is used to describe and solve the problem that an agent (agent, also known as "agent") learns strategies to maximize rewards or achieve specific goals in the process of interacting with the environment. The AI ​​(Artificial Intelligence, artificial intelligence) model designed based on reinforcement learning can make game decisions to win the game. [0003] The AI ​​model includes a decision network and a value network. The decision network is used to determine action instructions based on the game state, and the value network is used to evaluate the value of the game state. In the training phase of the AI ​​model, i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): A63F13/67G06N20/00
CPCA63F13/67G06N20/00
Inventor 韩国安邱福浩王亮付强
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products