A dual-tree Monte Carlo search algorithm for sequential synchronous games

A search algorithm and sequential technology, applied in the field of Monte Carlo search algorithm, can solve problems such as low complexity, combinatorial explosion, and degradation of solution quality, and achieve the effects of increasing search depth, reducing selection branches, and reducing scale

Inactive Publication Date: 2018-12-11
NORTHEASTERN UNIV LIAONING
View PDF0 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] For sequential simultaneous game problems, the existing search methods can only deal with low-complexity game problems. This is because the number of nodes in the game tree is within an acceptable range, so it can effectively solve
However, in the case of a large scale, if the game tree search method is directly used to solve the problem, due to the huge number of action combinations and the exponential increase of the nodes in the tree with the increase of depth, it will inevitably lead to the problem of combination explosion.
Therefore, the existing search methods usually reduce it to a sequential asynchronous problem first, and then use the max-min search algorithm to solve it, but the quality of the solution is degraded due to the serious simplification of the problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A dual-tree Monte Carlo search algorithm for sequential synchronous games
  • A dual-tree Monte Carlo search algorithm for sequential synchronous games
  • A dual-tree Monte Carlo search algorithm for sequential synchronous games

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0064] The specific embodiments of the present invention will be further described below in conjunction with the accompanying drawings.

[0065] as attached figure 1 As shown in -6, a double-tree Monte Carlo search algorithm for sequential synchronous games, the algorithm is applicable to a search system, the system includes a search server, a search entry and a search device, and the algorithm includes the following steps:

[0066] Step 1-1: Establish a sequential and synchronized double game tree, abstract the synchronization and sequence respectively, and model the global optimization, use A and B to represent two players respectively, and obtain the actions of both parties in one environment sequence, the opponent's decision-making node can only execute actions, and the opponent's information completes the interaction through the environment;

[0067] Step 1-2: Distinguish the properties of game tree nodes and search algorithms for perfect information and imperfect inform...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a dual-tree Monte Carlo search algorithm for solving a large-scale sequential synchronous game problem. Compared with a sequential synchronous Monte Carlo search based on single tree structure, the invention has the advantages that two trees are established to represent the state transition of both sides of the game, which greatly reduces the selection branches of the gametree and reduces the size of the game tree while maintaining the characteristics of synchronous operation, the explosion problem caused by synchronous action is eliminated and the searching depth is increased, the quality of the solution is ensured, and also the efficiency of the solution is improved. Specific technical means include: through the construction of a Nash equilibrium support library,the problem that synchronous Nash equilibrium online calculation time is too long is solved; a deep strategy net and a deep valuation net of a sequential synchronous game are designed, realizing knowledge guidance of sequential synchronous search; research on environment-oriented reinforcement learning solves the decision-making problem under the condition of state transition or lack of benefits.

Description

technical field [0001] The invention relates to the field of machine game search, in particular to a Monte Carlo search algorithm using a double tree structure. Background technique [0002] The Monte Carlo method, also known as the statistical simulation method, is a kind of very important numerical calculation guided by the theory of probability and statistics, which was proposed in the mid-1940s due to the development of science and technology and the invention of electronic computers. method. Refers to the use of random numbers (or more commonly pseudorandom numbers) to solve many computational problems. In the 1970s, the theoretical research on the Monte Carlo method reached its peak. So far, for the research of Monte Carlo theory and method, the theoretical and practical research in the United States is still in a leading position. Many other countries are now involved in Monte Carlo studies. Monte Carlo research has strongly promoted the application and developmen...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N5/00
CPCG06N5/01
Inventor 王骄潘家鑫黄湛钧
Owner NORTHEASTERN UNIV LIAONING
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products