Enshinstein chess game algorithm based on reinforcement learning

A technology of Einstein chess and reinforcement learning, applied in the field of Einstein chess game algorithm, can solve problems such as human level limitation, achieve the effects of improving accuracy, saving simulation game time, and improving execution efficiency

A technology of Einstein chess and reinforcement learning, applied in the field of Einstein chess game algorithm, can solve problems such as human level limitation, achieve the effects of improving accuracy, saving simulation game time, and improving execution efficiency

CN110119804AInactive Publication Date: 2019-08-13ANHUI UNIVERSITY

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Enshinstein chess game algorithm based on reinforcement learning
  • Enshinstein chess game algorithm based on reinforcement learning
  • Enshinstein chess game algorithm based on reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] In order to better understand the technical solution of the present invention, the present invention will be described in more detail below in conjunction with specific examples and accompanying drawings.

[0045] 1. Checkerboard features

[0046] Features are the variables required to describe the characteristics of the chessboard, and are some useful information extracted from the Einstein chessboard according to the Einstein chess rules. The present invention firstly extracts the chess piece features of each chess piece, and combines them linearly into a chessboard feature vector. The characteristics of the chess piece include the probability of moving (Probability), the coordinates of the chess piece (Coordinate) and the threat (Threat), which are combined below figure 1 The chessboard in specifically describes the chess piece characteristics.

[0047] 1) Move probability

[0048] Before the two sides move, the chess pieces that can be moved are determined accord...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an Enshinstein chess game algorithm based on reinforcement learning. BP neural network is applied to a value evaluation method of a chessboard and an action selection strategyof a Monte Carlo tree search algorithm. The characteristics of the chessboard are learned and the network parameters are gradually adjusted by means of the self-chess-playing learning rule of the reinforcement learning method, so that the value evaluation of the BP neural network on the chessboard and the strategy calculation of the chess playing action are gradually accurate, and the performanceof the whole game algorithm is gradually improved. According to the invention, the two BP neural networks are respectively used as a value estimation function and a behavior strategy function of the Einstein chess. The reinforcement learning algorithm serves as an evolution mechanism for adjusting BP neural network parameters, the defect that the level of an existing Enshinstein chess training setis limited by the human level is overcome, and the upper limit of the game level of the Enshinstein chess is improved.

Description

technical field [0001] The invention belongs to the research field of chess game machine games, in particular to an Einstein chess game algorithm based on reinforcement learning. Background technique [0002] Artificial Neural Network (ANN) is a research hotspot in the field of artificial intelligence since the 1980s. A neural network is an operational model consisting of a large number of nodes (or called neurons) that interact with each other. Each node represents a special processing function called an activation function. Each connection between two nodes represents a weighted value for the signal passing through the connection, called weight, which is equivalent to the learning experience of the artificial neural network. The output of the network varies according to the way the network is connected, the weight value and the activation function. The network itself is usually an approximation to a certain algorithm or function in nature, or it may be an expression of ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
13 Aug 2019
Publication
CN110119804A
IPC
G06N3/04; G06N3/08; A63F11/00
CPC
G06N3/08; A63F11/0074; A63F2011/0093; G06N3/047; G06N3/045
Inventors
袁仪驰; 吴蕾