Multi-Concurrent Real-time Adversarial System for Reinforcement Learning Training and Evaluation

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A reinforcement learning and engine technology, applied in the field of artificial intelligence, can solve problems such as the failure of the anti-decision-making effect to meet expectations, the unused memory training mode, and the inapplicability of reinforcement learning methods for training and evaluation, etc., to achieve fast message transmission and improved results , the effect of fast training speed

Active Publication Date: 2021-07-20

INST OF AUTOMATION CHINESE ACAD OF SCI

View PDF9 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] In order to solve the above-mentioned problems in the prior art, that is, the existing confrontation system does not use the memory training mode, so the system is not suitable for the training and evaluation of the reinforcement learning method, so that the confrontation decision-making effect cannot reach the expected problem, the present invention provides a A multi-concurrent real-time confrontation system oriented to reinforcement learning training and evaluation, the real-time confrontation system includes an engine kernel module, a confrontation scheduling management module, a deduction client and a confrontation observation terminal;

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0077] The application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain related inventions, not to limit the invention. It should also be noted that, for the convenience of description, only the parts related to the related invention are shown in the drawings.

[0078] It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other. The present application will be described in detail below with reference to the accompanying drawings and embodiments.

[0079] War game deduction is a game that effectively simulates real wars. It is known as the "magician of war" and is inseparable from real wars. Wargame deduction abstracts and extracts typical decision-making factors of military confrontation, and well simulates the ubiquitous inc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention belongs to the technical field of artificial intelligence, and specifically relates to a multi-concurrent real-time confrontation system oriented to reinforcement learning training and evaluation, aiming at solving the problem that the existing confrontation system does not use the memory training mode, so the system is not suitable for the training and training of reinforcement learning methods. Evaluation, so as to combat the problem that the effect of decision-making is not as expected. The invention includes: a confrontation scheduling management module, which creates confrontation sites, confrontation processes, and confrontation scenario parameters according to confrontation requirements; an engine kernel module, combined with deduction personnel or AI action sets, updates deduction status and situation, and generates real-time deduction situation data; deduction users On the terminal, the situational data of real-time deduction is parsed into graphics presented in map grids and displayed, and the operation instructions of the deduction personnel or AI are obtained and action sets are generated; on the counter-observation side, the situational data of real-time deduction is parsed into 3D models and graphics and displayed And switch the display at the set viewing angle. The countermeasure system of the invention has good countermeasure decision-making effect and wide application.

Description

technical field [0001] The invention belongs to the technical field of artificial intelligence, and in particular relates to a multi-concurrent real-time confrontation system oriented to reinforcement learning training and evaluation. Background technique [0002] With the development of artificial intelligence technology represented by deep learning, human beings have made great progress in tasks of "perceived intelligence" such as image processing, speech recognition, and text processing. However, "perceptual intelligence" is the ability of machines to obtain information through various sensors. The main defect is that each algorithm is only applicable to specific problems and does not have the complete cognitive ability of humans. In contrast, "cognitive intelligence" refers to the ability of machines to actively think, understand, and reason, and to achieve self-learning, purposeful reasoning, and interaction with the environment without prior programming by humans. Alt...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G06F30/20G06Q50/26

CPCG06Q50/26G06F30/20

Inventor 倪晚成邢思远胡健王士贤徐泽培

Owner INST OF AUTOMATION CHINESE ACAD OF SCI

Multi-Concurrent Real-time Adversarial System for Reinforcement Learning Training and Evaluation

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology