Robot control method, device and equipment and computer storage medium

A control method and robot technology, applied in the Internet field, can solve problems such as poor generalization and slow robot task training process, and achieve the effect of improving utilization rate and accelerating training

Pending Publication Date: 2021-03-12
TENCENT TECH (SHENZHEN) CO LTD
View PDF0 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, the implementation methods in related technologies usually need to train a model to com

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Robot control method, device and equipment and computer storage medium
  • Robot control method, device and equipment and computer storage medium
  • Robot control method, device and equipment and computer storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] In order to make the purpose, technical solutions and advantages of the application clearer, the application will be further described in detail below in conjunction with the accompanying drawings. All other embodiments obtained under the premise of creative labor belong to the scope of protection of this application.

[0049] In the following description, references to "some embodiments" describe a subset of all possible embodiments, but it is understood that "some embodiments" may be the same subset or a different subset of all possible embodiments, and Can be combined with each other without conflict. Unless otherwise defined, all technical and scientific terms used in the embodiments of the present application have the same meaning as commonly understood by those skilled in the technical field of the embodiments of the present application. The terms used in the embodiments of the present application are only for the purpose of describing the embodiments of the pres...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

An embodiment of the invention provides robot control method, device and equipment and a computer storage medium, and relates to the technical field of artificial intelligence. The method comprises the steps that environment interaction data and an actual target value actually completed after an action corresponding to action data in the environment interaction data is executed are obtained; a reward value after the action is executed is determined according to the state parameter of the first moment in two adjacent moments, the action data and the actual target value; a reward value in the environment interaction data is updated by the aid of the reward value after the action is executed; an intelligent agent corresponding to a robot control network is trained by the aid of the updated environment interaction data; and the action of the target robot is controlled by the aid of the trained intelligent agent. Through the robot control method, device and equipment and the computer storage medium, the data utilization rate can be increased, training of the intelligent agent can be accelerated, a large number of targets can be trained at the same time, and all tasks in a certain targetspace can be completed through one model.

Description

technical field [0001] The embodiment of the present application relates to the technical field of the Internet, and relates to but not limited to a robot control method, device, equipment and computer storage medium. Background technique [0002] At present, when controlling the robot, one implementation method is the deep reinforcement learning control algorithm based on the priority experience playback mechanism, which uses the state information of the object operated by the robot to calculate the priority, and uses the deep reinforcement learning method to complete the end-to-end The robot control model allows deep reinforcement learning agents to learn independently in the environment and complete specified tasks; another implementation method is a kinematic self-grasping learning method based on simulated industrial robots, which belongs to the field of computer-aided manufacturing and is based on simulation environment and use reinforcement learning theory to carry ou...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): B25J9/08B25J9/16
CPCB25J9/08B25J9/1602B25J9/1661B25J9/1664B25J9/1679B25J9/1612B25J9/163G05B2219/40499
Inventor 杨瑞李蓝青罗迪君
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products