Unlock instant, AI-driven research and patent intelligence for your innovation.

A Temporal Logic Task Planning Method Based on Reinforcement Learning

A sequential logic and task planning technology, applied in the direction of program control manipulators, manufacturing tools, manipulators, etc., can solve problems such as insufficient effectiveness, poor real-time performance, violation of task sequential logic, etc., to improve real-time performance, improve safety, Guaranteed effectiveness

Active Publication Date: 2020-11-10
BEIJING INSTITUTE OF TECHNOLOGYGY
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The above scheme 1 needs to be calculated during use, and the real-time performance is not good; scheme 2 cannot handle new tasks that are not in the training data, and the effectiveness is insufficient, and the planning results are not safe, which may violate the given task timing logic

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Temporal Logic Task Planning Method Based on Reinforcement Learning
  • A Temporal Logic Task Planning Method Based on Reinforcement Learning
  • A Temporal Logic Task Planning Method Based on Reinforcement Learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] The present invention will be described in detail below with reference to the accompanying drawings and examples.

[0045]The present invention is completely different from the two solutions mentioned in the background art and the existing solutions not listed. Aiming at the temporal logic task planning problem, a new temporal logic task planning method based on reinforcement learning is proposed. First, use linear sequential logic language and non-deterministic Büchi automata to model the task, and use the finite state transition system to describe the environment; according to the needs of reinforcement learning, innovatively set the combination state, state transition relationship, and reward function ; Finally, use the Q-Learning method to train to get the planning result.

[0046] In order to realize the temporal logic task planning method, it is first necessary to describe the generative Büchi automaton created by the present invention. The generative Büchi auto...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a sequential logic task planning method based on reinforcement learning. The sequential logic task planning method comprises the following steps: modeling a task by utilizing alinear sequential logic language and a non-deterministic Buchi automaton at first, describing an environment by utilizing a finite state transfer system FTS, and then generating a generative Buchi automaton by utilizing the FTS and the Buchi automaton; carrying out task planning training on the generative Buchi automaton by utilizing a Q-Learning method; and when the combined state enters the acceptable state or the dead zone state during the iterative training process, terminating the current round of iteration and jumping into the next iteration process, wherein a reward function utilized for updating the state-action value comprises the setting of the reward value or the penalty value given when the combined state enters the acceptable state or the dead zone state. By the adoption of the sequential logic task planning method provided by the invention, the effective, safe and high-speed task planning of the sequential logic task can be achieved.

Description

technical field [0001] The invention belongs to the field of artificial intelligence, and in particular relates to a temporal logic task planning method based on reinforcement learning. Background technique [0002] In actual tasks, many tasks not only require simple parallel cooperation, but also need to perform more complex serial cooperation tasks, that is, robots need to perform different tasks according to different stages of the task. Such tasks that require environment, time, and execution order are called sequential logic tasks. The task planning problem is to find a sequence of operations that change the system from an initial state to a goal state, given a task. The resolution process is similar to human deliberation, selecting and organizing actions by anticipating their outcomes. Mission planning can be applied in many fields: artificial intelligence, robotic systems, military command, etc. Therefore, the research in the field of mission planning attracts more...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): B25J9/16
CPCB25J9/1661
Inventor 方浩宇文涛陈杰杨庆凯曾宪琳
Owner BEIJING INSTITUTE OF TECHNOLOGYGY