Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Bionic robotic fish cluster navigation simulation method based on deep reinforcement learning technology

A technology of reinforcement learning and simulation methods, applied in neural learning methods, biological models, biological neural network models, etc., can solve problems such as slow convergence speed, small action space and state space, and inability to truly reflect the behavior of fish groups

Pending Publication Date: 2021-09-21
NORTHEASTERN UNIV
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Artificial fish swarm algorithm (AFSA) has problems such as low convergence accuracy, easy to fall into local optimum, and slow convergence speed in the later stage.
The algorithm is very sensitive to various hyperparameters and is easily affected by step size, population size and crowding factor, so it has great limitations
[0006] In order to simplify the training, the current method of using deep reinforcement learning to simulate the flocking behavior of fish schools is mostly a simple 2D environment, with small action space and state space, which cannot truly reflect the flocking behavior of fish flocks in nature.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Bionic robotic fish cluster navigation simulation method based on deep reinforcement learning technology
  • Bionic robotic fish cluster navigation simulation method based on deep reinforcement learning technology
  • Bionic robotic fish cluster navigation simulation method based on deep reinforcement learning technology

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0061] The specific implementation manners of the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. The following examples are used to illustrate the present invention, but are not intended to limit the scope of the present invention.

[0062] In this embodiment, the bionic robot fish cluster navigation simulation method based on deep reinforcement learning technology, such as figure 1 shown, including the following steps:

[0063] Step 1, constructing a 3D fish cluster environment model;

[0064] Step 1.1, build a fish cluster environment;

[0065] In order to simulate the real-world fish cluster environment, a 3D scene is constructed in the Unity3D engine system with the length of a bionic robot fish as 1 unit; the surrounding and top of the 3D scene are set as transparent air walls, and the bottom of the 3D scene simulates a real ocean The terrain is composed of uneven ground and aquatic plants; the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a bionic robotic fish cluster navigation simulation method based on the deep reinforcement learning technology, and relates to the technical field of multi-agent path navigation planning. The method comprises the following steps: firstly, constructing a 3D fish swarm environment model, and then constructing an agent model of a bionic robotic fish cluster in the 3D fish swarm environment; the intelligent agent model comprises a perception model, a movement model and a decision-making model; constructing a reward function of the fish swarm cluster, and introducing a curiosity mechanism into the reward function; constructing a distributed training framework of an agent model based on a curiosity mechanism and a PPO2 algorithm, and enabling an agent to obtain a behavior strategy in a learning mode; and finally, training an agent model based on the constructed distributed training framework to realize navigation simulation of the bionic robotic fish cluster. According to the method, a virtual fish school can learn reasonable fish school behaviors in a 3D environment, and the method is applied to bionic robot fish school navigation behaviors in the real world.

Description

technical field [0001] The invention relates to the technical field of multi-agent path navigation planning, in particular to a bionic robot fish swarm navigation simulation method based on deep reinforcement learning technology. Background technique [0002] The flocking behavior of fish is a typical self-organization phenomenon. In the process of swimming, fish will naturally gather in groups to ensure their own survival, and display complex group behaviors. The swimming of each fish only needs to follow two basic rules: follow the fish next to it; keep moving. If only based on these two simple rules to simulate the behavior of natural fish swarms and realize bionic robot fish swarm navigation, it is difficult to realize most artificial swarm systems at present. [0003] A common method used to simulate fish swarm behavior to realize bionic robot fish swarm navigation is Artificial Fish Swarm Algorithm (AFSA). Artificial fish swarm algorithm is an optimization algorithm...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06T19/00G06N3/00G06N3/08
CPCG06T19/003G06N3/006G06N3/084Y02A40/81
Inventor 高天寒张岩
Owner NORTHEASTERN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products