Mechanical arm anti-interference motion planning method based on multi-agent reinforcement learning
A multi-agent and reinforcement learning technology, applied in manipulators, program-controlled manipulators, claw arms, etc., can solve problems such as weak anti-interference ability, achieve strong anti-interference ability, improve robustness, and improve anti-interference ability.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
specific Embodiment approach 1
[0053] Specific embodiment one: a kind of anti-jamming motion planning algorithm of the manipulator based on multi-agent reinforcement learning, comprises the following steps:
[0054] Step 1: Discretize the manipulator into a multi-intelligence form based on the joint graph;
[0055] Step 11: Take the joints of the n-degree-of-freedom manipulator as nodes V={1,2,...,n} in the graph, and the links between the joints as edges ε, then the joint graph of the manipulator can be expressed as It is an undirected graph G=(V,ε);
[0056] Step 1 and 2: Each agent can select joint nodes from the joint graph to build its own sub-joint graph, and each agent can only control the joint nodes in its own sub-joint graph;
[0057] Step 13: Design the observation space for each agent. The observation information of each agent consists of three parts. The first part is the state information of each joint node in the agent's own sub-joint graph (such as joint angle, angular velocity, torque inf...
Embodiment
[0083] 1) Experimental tasks
[0084] The training scene for multi-agent manipulator motion planning is a desktop scene built based on the MuJoCo physical simulation engine, such as figure 1 As shown, the task of the manipulator is to move to the target position without colliding with the environment (the target position point is represented by q goal and the forward kinematics model of the manipulator), and the planning is considered successful when the distance between the origin of the coordinate system at the end of the manipulator and the target position is less than 1 cm. The initial configuration q of the training process start and the target configuration q goal are randomly selected within a certain range.
[0085] 2) Multi-agent decomposition of the robotic arm
[0086] The present invention divides the manipulator into three situations: single-agent, double-agent and three-agent based on the joint graph. figure 2 shown. Single agent refers to all the joint no...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More - R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com



