Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Framework for Generation and Management of Adaptive Rules Based on Reinforcement Learning

A reinforcement learning and self-adaptive technology, applied in data processing applications, electrical digital data processing, instruments, etc., can solve problems such as inability to guarantee the optimal realization of goals, inability to adapt to dynamic changes, etc.

Active Publication Date: 2020-12-22
PEKING UNIV
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The rule-based planning method has the advantages of being readable, easy to modify, and efficient in the planning process, but this type of planning method also has its shortcomings: 1. Since the rule set is usually defined manually, it cannot be guaranteed that it will achieve the goal optimally; 2. , Since the defined rule set usually does not change anymore, it cannot adapt to the goals that may change dynamically

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Framework for Generation and Management of Adaptive Rules Based on Reinforcement Learning
  • A Framework for Generation and Management of Adaptive Rules Based on Reinforcement Learning
  • A Framework for Generation and Management of Adaptive Rules Based on Reinforcement Learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] figure 1 and figure 2 The frame diagram of the present invention is shown, and the important concepts are introduced as follows:

[0029] 1. State s: A state s consists of a set of environmental features and attribute features. For example, for a website system, "high load", "medium load" and "low load" are environmental characteristics, while "faster response speed" and "slower response speed" are attribute characteristics. A set {"high load", "faster response speed"} composed of environmental characteristics and attribute characteristics represents a state. All possible state sets are denoted as S;

[0030] 2. Operation a: An operation is to select a system configuration, that is, to select a set of system features. For example, at the change point of "authentication method", there may be three optional system features of "strict authentication", "normal authentication" and "simple authentication", select a set of system characteristics {"simple authentication", ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a frame for generating and managing an adaptive rule based on reinforcement learning. According to the frame, a mapping relationship is established between an adaptive scene and a reinforcement learning algorithm. The frame comprises a designing period and an operating period. In the designing period, the reinforcement studying algorithm is used and adaptive rules are respectively generated for aiming at a plurality of representative target settings. In the operating period, reasoning based on cases is utilized for finding and activating the adaptive rule which is most suitable for the current target setting is found and activated, and furthermore the adaptive rule is continuously updated by means of the reinforcement learning algorithm. The frame has a technical effect of improving a traditional rule-based adaptive method and is advantageous in that 1, an automatic generating algorithm for the adaptive rule which can ensure matching between the generated rule and the target is supplied; and 2, an adaptive rule evolution algorithm which can make the rule set perform evolution that matches a target change after occurrence of the target change is supplied.

Description

Technical field: [0001] The invention relates to a framework for generating and managing adaptive rules based on reinforcement learning, which is suitable for supporting automatic generation and evolution of rules in an adaptive system. It belongs to the field of software technology. Background technique: [0002] An adaptive system refers to a system that can monitor changes in the environment and the system itself, and can dynamically modify its behavior and structure to continuously meet the goals. After sensing and analyzing the change, the adaptive system needs to plan in real time, that is, decide how to adjust the components or parameters of the software to continuously meet the goal. Among the common planning methods for adaptive systems, rule-based methods use rules to predefine adaptive strategies. Such methods usually maintain a set of CA (Condition-Action) rules, and each rule specifies that when a specific condition is true, the system should perform a specifi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F9/00
CPCG06Q10/04
Inventor 金芝赵天琪张伟赵海燕
Owner PEKING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products