Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Learning method, device, equipment and storage medium of behavior control strategy

A technology of control strategy and learning method, applied in the computer field, can solve the problems of high complexity of a certain skill and inability to have motor skills of the object, and achieve the effect of reducing the complexity

Active Publication Date: 2021-04-13
TENCENT TECH (SHENZHEN) CO LTD
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, in the existing demonstration learning process, if you want the object to learn the skill to have a certain motor skill, you need to obtain the action demonstration data corresponding to the motor skill in advance; if you lack the corresponding action demonstration data, you cannot make the object Possess corresponding motor skills, resulting in high complexity for the object to learn the skill to generate a certain skill

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Learning method, device, equipment and storage medium of behavior control strategy
  • Learning method, device, equipment and storage medium of behavior control strategy
  • Learning method, device, equipment and storage medium of behavior control strategy

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037]The scheme of the present application is suitable for demonstration learning, which involves demonstration objects and objects to be learned behavioral skills. Wherein, the demonstration object is used for demonstration behavior, so as to generate demonstration behavior data on which demonstration learning is based. The object to learn the behavioral skill is the object that finally learns the corresponding action behavioral skill based on the demonstration behavior data. For example, the object may be a robot, or a game object in a game.

[0038] For example, taking the game field as an example, the objects to learn skills may be game characters in the game. In this case, the demonstration behavior data can be obtained according to the actions demonstrated by the real user (such as walking, jumping, etc.), and the game characters in the game can be intensively studied according to the demonstration behavior data, so that the game characters can have Demonstrated skill...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present application discloses a behavior control strategy learning method, device, computer equipment, and storage medium. The method includes: sampling a demonstration behavior data segment including at least two demonstration behavior data from a demonstration behavior data sequence; Fragment, set the initial state information of each joint of the target object simulated in the physical simulator, and use the neural network model to be trained to determine the force data of each joint of the target object; control each joint of the target object simulated in the physical simulator Joint movement, so that the physical simulator simulates the simulation behavior data sequence of the target object based on the set action behavior limitation characteristics; according to the demonstration behavior data and simulation behavior data, the action behavior difference is determined; based on the action behavior difference, Optimize the neural network model until the optimization goal is reached. The solution of the present application is beneficial for the object of demonstration learning to generate extended action behaviors based on demonstration actions.

Description

technical field [0001] The present application relates to the field of computer technology, and in particular to a learning method, device, equipment and storage medium of a behavior control strategy. Background technique [0002] Demonstration learning is an autonomous learning technology that takes demonstration behavior as the goal. In demonstration learning, the object to learn skills is required to imitate the behavior of the demonstration, so that the object can acquire the motor skills corresponding to the demonstration behavior. Among them, in different application fields, the objects of skills to be learned will also be different. For example, in the field of games, the objects to learn skills can be characters, animals, etc. in the game; and for example, in the field of robot control, the objects to learn skills can be robots. [0003] At present, in the demonstration learning process, behavior control strategies can be learned from several groups of demonstration...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F30/20G06N3/08
CPCG06N3/08
Inventor 孙明飞石贝付强
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products