Self-adaptive dynamic programming optimal control method and system based on co-state assistance

A dynamic programming and optimal control technology, applied in adaptive control, general control system, control/regulation system, etc., can solve problems such as poor control effect, cost function cannot guarantee accurate evaluation, etc., to reduce computational cost, reduce The number of times, the effect of speeding up

Active Publication Date: 2021-09-07
HUNAN UNIV
View PDF5 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Among them, the patent "An Approximate Dynamic Programming Optimization Control Method for Underwater Thermal Glider Attitude Adjustment" owned by Shanghai Maritime University discloses a two-input and two-output decoupling control that is not based on models. Update the control law online, but the cost function cannot guarantee accurate evaluation during the online training process, so it may get worse control effect during the training process

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Self-adaptive dynamic programming optimal control method and system based on co-state assistance
  • Self-adaptive dynamic programming optimal control method and system based on co-state assistance
  • Self-adaptive dynamic programming optimal control method and system based on co-state assistance

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034]The present invention will be described in further detail below in conjunction with the embodiments given in the accompanying drawings.

[0035] The specific embodiment of the combination of the present invention is an autonomous underwater robot depth control system, which has the characteristics of strong nonlinearity, strong coupling, and uncertainty in the hydrodynamic coefficient. At the same time, the working environment of the autonomous underwater robot also has random and time Due to the changing dynamic characteristics, we cannot accurately predict external disturbances.

[0036] The flow chart of the adaptive dynamic programming optimal control method based on co-state assistance of the present invention is as follows figure 1 shown.

[0037] Step 1. Construct the state variable, control variable, transfer environment, utility function and cost function in the depth control of autonomous underwater vehicle.

[0038] First define two reference coordinate syst...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a self-adaptive dynamic programming optimal control method based on co-state assistance. The method comprises the following steps: step 1, constructing a state variable, a control variable, a transfer environment, a utility function and a cost function of a system; step 2, constructing an execution network, a co-state network and an evaluation network of self-adaptive dynamic programming based on co-state assistance; step 3, carrying out a strategy evaluation process of self-adaptive dynamic planning based on co-state assistance; step 4, carrying out a strategy improvement process of self-adaptive dynamic planning based on co-state assistance; and step 5, repeating the steps 3-4 until the loop iteration frequency i is greater than the specified frequency or the cost function difference value abs(Ji+1(xt, ut, [Lambda]t)-Ji(xt, ut, [Lambda]t)) after two adjacent external loops is less than a set value. According to the self-adaptive dynamic programming optimal control method based on co-state assistance, the derivative of the cost function to the state quantity serves as an auxiliary variable when the cost function is estimated, meanwhile, the process of dynamic modeling on a complex system is avoided, and the solving precision of the complex dynamic system is improved.

Description

technical field [0001] The invention belongs to the technical field of intelligent control, and in particular relates to an adaptive dynamic programming optimal control method and system based on co-state assistance. Background technique [0002] Complex dynamic systems are ubiquitous in many research fields, such as vehicle systems, underwater robot systems, chemical process systems, energy management systems, etc. With the development of science and technology, people have put forward higher and higher requirements on the performance of these control systems. requirements. Optimal control is to seek a control quantity on the basis of ensuring that the control system can achieve the goal stably, so that the given system cost function reaches the maximum value (or minimum value). However, in practical applications, complex systems usually have characteristics such as high nonlinearity, unknown dynamic characteristics, and model uncertainty, which will make it difficult to e...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G05B13/04
CPCG05B13/042
Inventor 胡满江叶俊边有钢徐彪秦晓辉谢国涛秦兆博王晓伟秦洪懋丁荣军
Owner HUNAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products