The invention belongs to the field of
intelligent robots, and specifically relates to a
robot motion decision-making method,
system, and device introducing an emotion regulation mechanism, aiming at solving the problems of
robot decision-making speed and learning efficiency. The
system method includes using the
environmental perception model to generate the predicted state value at the next moment according to the current action variable and state value; based on the action variable, state value, and immediate reward, update the state-action value function network; obtain the predicted trajectory based on the
environmental perception model , calculate the local optimal solution of the predicted trajectory, and perform differential
dynamic programming to obtain the
optimal decision based on the model; according to the current state and strategy, minimize the state-action value function to obtain a model-free decision; based on the
state prediction error and reward prediction error And the average
reward value, the emotional response
signal is generated through the
computational model of emotional
processing, and the path decision is selected according to the threshold of the
signal. The present invention gradually improves the decision-making speed while ensuring the learning efficiency.