Robot degenerated environment obstacle avoidance method based on impulsive neural network internal plasticity
A spiking neural network and plasticity technology, applied in the field of brain-like robots, to achieve the effect of improving the success rate
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0038] The specific implementation manners of the present invention will be further described below in conjunction with the drawings and technical solutions.
[0039] This method uses LIF and SRM neuron models as the main neuron structure of the network, and uses DDPG as the framework of deep reinforcement learning. The state includes lidar data, the distance from the target point and the speed at the last moment; the action is composed of brain-like robots. Composed of linear velocity and angular velocity; the reward function includes the state of the distance from the target at each moment (the closer is the positive reward, and vice versa), it is -20 if there is a collision, and 30 if it reaches the target point, encouraging the robot to take every step The range of action taken should not be too large, that is, it should not exceed 1.7 times the angular velocity at the previous moment.
[0040] The reinforcement learning algorithm is implemented in Pytorch. Stochastic gra...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


