Game control method and device and storage medium
A game control and game technology, applied in indoor games, video games, sports accessories, etc., can solve the problems of deep reinforcement learning instability, reduce the effect of event control, amplify the overestimation of deep reinforcement learning algorithms, etc., to improve the control effect, Reduce the effect of overestimation and coupling reduction
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0074] An embodiment of the present invention provides a game control method, such as figure 1 As shown, the method includes:
[0075] S101. Acquire the current video frame when it is detected that the target video game starts;
[0076] When the device detects that the target video game starts, the first frame image when the target video game starts is used as the current video frame, and the current video frame represents the current state.
[0077] In some embodiments, when the device detects that the target video game starts, it also sets the playback memory unit, the total capacity of the playback memory unit, the total number of preset training rounds, the preset training total step size of each round of training, the preset online A value network and a preset target value network; wherein, the playback memory unit is used to store data generated when controlling the target video game; the preset training total step size is greater than the preset number of samples.
[...
Embodiment 2
[0135] Based on the same inventive concept of the first embodiment, further description will be made.
[0136] An embodiment of the present invention provides a game control device, such as Figure 5 As shown, the game control device 3 includes:
[0137] The control unit 31 is used to obtain the current video frame when detecting that the target video game starts; and based on the current video frame and the preset online value network, obtain the current grayscale image and the current action, and control the target video game to execute the current action, Get the current reward value and the next video frame;
[0138] The data construction unit 32 is used to obtain the current five-tuple based on the current grayscale image, the current action, the current reward value, the next video frame and the preset online value network and save it to the preset database;
[0139] The parameter updating unit 33 is used for when the number of obtained current quintuples is greater th...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com