Ship control method based on competitive hybrid network

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A hybrid network and control method technology, applied in neural learning methods, biological neural network models, instruments, etc., can solve problems such as low ship control accuracy

Active Publication Date: 2021-02-26

HARBIN ENG UNIV

View PDF4 Cites 4 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0010]The purpose of this invention is to solve the problem of low control accuracy of existing ships in complex environments, and propose a ship control method based on competitive hybrid network

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

specific Embodiment approach 1

[0039] Specific implementation mode one: the specific process of the ship control method based on the competitive hybrid network in this implementation mode is as follows:

[0040] Step 1. Establishing an individual agent network model;

[0041] Step 2. Establish a dominant hybrid network model;

[0042] Step 3. Establish a state-value hybrid network model;

[0043] Step 4: Input the individual observation history into the individual agent network model to obtain the individual advantage value function A i (τ i ,a i ) and the individual state value function V i (τ i );

[0044] The individual advantage value function A i (τ i ,a i ) is passed to the dominant mixed network model, and the dominant mixed network model outputs the joint advantage function value A tot (τ,a);

[0045] The individual state value function V i (τ i ) is passed to the state value mixed network model, and the state value mixed network model outputs the joint state value mixed value V tot (τ...

specific Embodiment approach 2

[0052] Specific embodiment two: the difference between this embodiment and specific embodiment one is: the individual agent network model is established in the step one; the specific process is:

[0053] The individual agent network model includes n agent network structures (the number of n depends on the specific application scenario);

[0054] The network structure of each agent is the same, consisting of Group1 and Group2;

[0055] Group1 includes an input layer, a ReLU activation layer, a GRU unit and the first linear network layer;

[0056] The output end of the input layer is connected to the input end of the ReLU activation layer, the output end of the ReLU activation layer is connected to the input end of the GRU unit, and the output end of the GRU unit is connected to the first linear network layer;

[0057] These structures are connected in series.

[0058] The input layer accepts the observation history of the agent, and the observation history consists of the loc...

specific Embodiment approach 3

[0064] Specific implementation mode three: the difference between this implementation mode and specific implementation mode one or two is that: in the step two, a dominant hybrid network model is established; specifically:

[0065] The advantage hybrid network model consists of a parameter network structure and a super network structure;

[0066] The parameter network structure includes a fourth linear network layer and a fifth linear network layer, the fourth linear network layer and the fifth linear network layer are connected in parallel, and the inputs of the fourth linear network layer and the fifth linear network layer are global observations;

[0067] The fourth linear network layer processes the input global observation value to obtain an output value, which is used as the parameter matrix of the first layer network in the hypernetwork structure;

[0068] The fifth linear network layer processes the input global observation value to obtain the output value as the param...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a ship control method, in particular to a ship control method based on a competitive hybrid network. The objective of the invention is to solve the problem of low control precision of an existing ship in a complex environment. The method comprises the following steps: 1, establishing an individual agent network model; 2, establishing a dominant hybrid network model; 3, establishing a state value hybrid network model; 4, inputting the individual observation history into the individual intelligent agent network model to obtain an individual dominant value function and anindividual state value function, transmitting the individual dominant value function to a dominant hybrid network model, and outputting a joint dominant function value by the dominant hybrid networkmodel; transmitting the individual state value function to a state value hybrid network model, and outputting a joint state value hybrid value by the state value hybrid network model; and adding the joint dominance function value and the joint state value mixed value to obtain a joint action value function. The method is applied to the field of ship control.

Description

technical field [0001] The invention relates to a ship control method. Background technique [0002] Many problems in real life can be attributed to cooperative multi-agent problems, such as traffic planning, robot control and automatic driving. However, there are many problems in directly applying the multi-intelligence algorithm to this real situation. For example, in reality, the field of view of the agent should only be local, and as the number of agents increases, the action space of the model will grow explosively, and How to judge the contribution of each agent in each task. These problems make the performance of multi-agent algorithms unsatisfactory in practical tasks. [0003] The initial solution to these problems is that each agent has a network to learn its own strategy alone, that is, independent learners (IQL). But this approach leads to no communication between agents, and the agent will regard other agents as part of the environment. This instability will ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06F30/27G06N3/04G06N3/08G06N20/00

CPCG06F30/27G06N3/08G06N20/00G06N3/045

Inventor 王红滨谢晓东何鸣王念滨周连科崔琎

Owner HARBIN ENG UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Ship control method based on competitive hybrid network

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

specific Embodiment approach 1

specific Embodiment approach 2

specific Embodiment approach 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology