Elevator system self-learning optimal control method and system based on deep reinforcement learning

An elevator system and reinforcement learning technology, applied in the direction of neural learning methods, constraint-based CAD, complex mathematical operations, etc., can solve problems such as the inability to achieve optimal control of elevator efficiency

Active Publication Date: 2021-09-07
INST OF AUTOMATION CHINESE ACAD OF SCI
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Researchers have tried to explore optimal solutions in different ways, including expert systems, fuzzy mathematics, genetic algorithms, and reinforcement learning, but none of them can achieve optimal control of elevator efficiency.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Elevator system self-learning optimal control method and system based on deep reinforcement learning
  • Elevator system self-learning optimal control method and system based on deep reinforcement learning
  • Elevator system self-learning optimal control method and system based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0074] Preferred embodiments of the present invention are described below with reference to the accompanying drawings. Those skilled in the art should understand that these embodiments are only used to explain the technical principles of the present invention, and are not intended to limit the protection scope of the present invention.

[0075] The purpose of the present invention is to provide a self-learning optimal control method for an elevator system based on deep reinforcement learning. Based on constraints, operating models and probability distribution models, the data information of the elevator system is preprocessed to obtain current data information, and further The global iteration is performed according to the current data information, and in the global iteration process, local processing is performed through multiple asynchronous thread iterations to determine the weight of the action evaluation network, and the optimal elevator control strategy is obtained throug...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention relates to a self-learning optimal control method and system of an elevator system based on deep reinforcement learning. The control method includes: establishing an operation model and a probability distribution model; preprocessing the data information of the elevator system to obtain current data information; Perform global iteration according to the current data information, and perform local processing through multiple asynchronous thread iterations: for each asynchronous thread, according to the current data information, use deep reinforcement learning to train the local action evaluation network, and correct the weight of the action evaluation network; until multiple At the end of the thread iteration and the end of the global iteration, the global action evaluation network is determined according to the weight of the action evaluation network; the optimal elevator control strategy is obtained according to the global action evaluation network to determine the average waiting time. In the global iterative process, the present invention performs local processing through a plurality of asynchronous thread iterations, determines the weight value of the action evaluation network, and obtains the optimal elevator control strategy through self-learning.

Description

technical field [0001] The invention relates to the technical field of intelligent optimization control, in particular to a self-learning optimal control method and system for an elevator system based on deep reinforcement learning. Background technique [0002] With the development and progress of society, a large number of working people flow to the city to work, and the population density of buildings in large and medium cities has reached an unprecedented height. Ensuring the efficient flow of people in a building is a prerequisite for maintaining the normal operation of the building, and the elevator system plays an extremely important role in the efficient flow of people. The number, capacity, running speed and scheduling algorithm of the elevator cars determine the efficiency of the elevator system. Since the number, capacity and running speed of the cars are more or less limited by the hardware conditions of the building, the elevator scheduling algorithm has become ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F30/27G06F17/18G06N3/04G06N3/08B66B1/06B66B1/34G06F111/04
CPCG06F30/27G06F17/18G06N3/08B66B1/06B66B1/3415G06F2111/04G06N3/044G06N3/045
Inventor 魏庆来王凌霄宋睿卓
Owner INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products