Elevator system self-learning optimal control method and system based on deep reinforcement learning

An elevator system and reinforcement learning technology, applied in neural learning methods, constraint-based CAD, complex mathematical operations, etc., can solve problems such as the inability to achieve optimal control of elevator efficiency

Active Publication Date: 2020-10-09
INST OF AUTOMATION CHINESE ACAD OF SCI
View PDF4 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Researchers have tried to explore optimal solutions in different ways, including expert systems, fuzzy mathemati

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Elevator system self-learning optimal control method and system based on deep reinforcement learning
  • Elevator system self-learning optimal control method and system based on deep reinforcement learning
  • Elevator system self-learning optimal control method and system based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0074] Preferred embodiments of the present invention are described below with reference to the accompanying drawings. Those skilled in the art should understand that these embodiments are only used to explain the technical principles of the present invention, and are not intended to limit the protection scope of the present invention.

[0075] The purpose of the present invention is to provide a self-learning optimal control method for an elevator system based on deep reinforcement learning. Based on constraints, operating models and probability distribution models, the data information of the elevator system is preprocessed to obtain current data information, and further The global iteration is performed according to the current data information, and in the global iteration process, local processing is performed through multiple asynchronous thread iterations to determine the weight of the action evaluation network, and the optimal elevator control strategy is obtained throug...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an elevator system self-learning optimal control method and system based on deep reinforcement learning. The control method comprises the following steps: establishing an operation model and a probability distribution model; preprocessing the data information of the elevator system to obtain current data information; carrying out global iteration is according to the current data information, and carrying out local processing through iteration of a plurality of asynchronous threads: for each asynchronous thread, training a local action evaluation network through deep reinforcement learning according to the current data information, and correcting the weight of the action evaluation network; determining a global action evaluation network according to the weight of the action evaluation network until multi-thread iteration and global iteration are finished; and obtaining an optimal elevator control strategy according to the global action evaluation network so as to determine the average waiting time. In the global iteration process, local processing is carried out through iteration of the plurality of asynchronous threads, the weight of the action evaluation network is determined, and the optimal elevator control strategy is obtained through self-learning.

Description

technical field [0001] The invention relates to the technical field of intelligent optimization control, in particular to a self-learning optimal control method and system for an elevator system based on deep reinforcement learning. Background technique [0002] With the development and progress of society, a large number of working people flow to the city to work, and the population density of buildings in large and medium cities has reached an unprecedented height. Ensuring the efficient flow of people in a building is a prerequisite for maintaining the normal operation of the building, and the elevator system plays an extremely important role in the efficient flow of people. The number, capacity, running speed and scheduling algorithm of the elevator cars determine the efficiency of the elevator system. Since the number, capacity and running speed of the cars are more or less limited by the hardware conditions of the building, the elevator scheduling algorithm has become ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F30/27G06F17/18G06N3/04G06N3/08B66B1/06B66B1/34G06F111/04
CPCG06F30/27G06F17/18G06N3/08B66B1/06B66B1/3415G06F2111/04G06N3/044G06N3/045
Inventor 魏庆来王凌霄宋睿卓
Owner INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products