A Temporal Logic Task Planning Method Based on Reinforcement Learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A sequential logic and task planning technology, applied in the direction of program control manipulators, manufacturing tools, manipulators, etc., can solve problems such as insufficient effectiveness, poor real-time performance, violation of task sequential logic, etc., to improve real-time performance, improve safety, Guaranteed effectiveness

Active Publication Date: 2020-11-10

BEIJING INSTITUTE OF TECHNOLOGYGY

View PDF5 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0006] The above scheme 1 needs to be calculated during use, and the real-time performance is not good; scheme 2 cannot handle new tasks that are not in the training data, and the effectiveness is insufficient, and the planning results are not safe, which may violate the given task timing logic

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0044] The present invention will be described in detail below with reference to the accompanying drawings and examples.

[0045]The present invention is completely different from the two solutions mentioned in the background art and the existing solutions not listed. Aiming at the temporal logic task planning problem, a new temporal logic task planning method based on reinforcement learning is proposed. First, use linear sequential logic language and non-deterministic Büchi automata to model the task, and use the finite state transition system to describe the environment; according to the needs of reinforcement learning, innovatively set the combination state, state transition relationship, and reward function ; Finally, use the Q-Learning method to train to get the planning result.

[0046] In order to realize the temporal logic task planning method, it is first necessary to describe the generative Büchi automaton created by the present invention. The generative Büchi auto...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a sequential logic task planning method based on reinforcement learning. The sequential logic task planning method comprises the following steps: modeling a task by utilizing alinear sequential logic language and a non-deterministic Buchi automaton at first, describing an environment by utilizing a finite state transfer system FTS, and then generating a generative Buchi automaton by utilizing the FTS and the Buchi automaton; carrying out task planning training on the generative Buchi automaton by utilizing a Q-Learning method; and when the combined state enters the acceptable state or the dead zone state during the iterative training process, terminating the current round of iteration and jumping into the next iteration process, wherein a reward function utilized for updating the state-action value comprises the setting of the reward value or the penalty value given when the combined state enters the acceptable state or the dead zone state. By the adoption of the sequential logic task planning method provided by the invention, the effective, safe and high-speed task planning of the sequential logic task can be achieved.

Description

technical field [0001] The invention belongs to the field of artificial intelligence, and in particular relates to a temporal logic task planning method based on reinforcement learning. Background technique [0002] In actual tasks, many tasks not only require simple parallel cooperation, but also need to perform more complex serial cooperation tasks, that is, robots need to perform different tasks according to different stages of the task. Such tasks that require environment, time, and execution order are called sequential logic tasks. The task planning problem is to find a sequence of operations that change the system from an initial state to a goal state, given a task. The resolution process is similar to human deliberation, selecting and organizing actions by anticipating their outcomes. Mission planning can be applied in many fields: artificial intelligence, robotic systems, military command, etc. Therefore, the research in the field of mission planning attracts more...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): B25J9/16

CPCB25J9/1661

Inventor 方浩宇文涛陈杰杨庆凯曾宪琳

Owner BEIJING INSTITUTE OF TECHNOLOGYGY

A Temporal Logic Task Planning Method Based on Reinforcement Learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology