Unlock instant, AI-driven research and patent intelligence for your innovation.

Reinforcement learning systems and methods for inventory control and optimization

An inventory, randomization technique with applications in reinforcement learning. field, which can solve problems such as slow response to changes in system requirements, monopoly, and no performance

Pending Publication Date: 2021-06-29
AMADEUS S
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Furthermore, conventional RMS models treat markets as monopolies under the assumption that competitors' actions are implicitly considered in customer behavior
[0007] Another disadvantage of the conventional approach to RMS is that there are often interdependencies between the model and its inputs such that any change in the available input data requires the model to be modified or rebuilt to take advantage of the new or changed information
Furthermore, without human intervention, the modeled system is slow to respond to changes in requirements that are poorly or not represented in the historical data on which the model is based

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Reinforcement learning systems and methods for inventory control and optimization
  • Reinforcement learning systems and methods for inventory control and optimization
  • Reinforcement learning systems and methods for inventory control and optimization

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0082] figure 1 is a block diagram illustrating an exemplary networked system 100 including an inventory system 102 embodying the present invention. In particular, inventory system 102 includes a reinforcement learning (RL) system configured to perform revenue optimization according to embodiments of the present invention. In particular, embodiments of the invention are described with reference to an inventory and revenue optimization system for selling and reserving airline seats, where networked system 100 generally includes an airline reservation system and inventory system 102 includes an airline-specific inventory system. However, it should be appreciated that this is merely an example to illustrate the system and method, and that additional embodiments of the invention may be applied to systems other than those associated with the sale and reservation of airline seats Inventory and revenue management systems other than .

[0083] Airline inventory system 102 may includ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A method of reinforcement learning for a resource management agent in a system for managing an inventory of perishable resources having a sales horizon, while seeking to optimize revenue generated therefrom. The inventory has an associated state. The method comprises generating a plurality of actions. Responsive to the actions, corresponding observations are received, each observation comprising a transition in the state associated with the inventory and an associated reward in the form of revenues generated from sales of the perishable resources. The received observations are stored in a replay memory store. A randomised batch of observations is periodically sampled from the replay memory store according to a prioritised replay sampling algorithm wherein, throughout a training epoch, a probability distribution for selection of observations within the randomised batch is progressively adapted. Each randomised batch of observations is used to update weight parameters of a neural network that comprises an action-value function approximator of the resource management agent, such that when provided with an input inventory state and an input action, an output of the neural network more closely approximates a true value of generating the input action while in the input inventory state. The neural network may thereby be used to select each of the plurality of actions generated depending upon a corresponding state associated with the inventory.

Description

technical field [0001] The present invention relates to technical methods and systems for improved inventory control and optimization. In particular, embodiments of the present invention employ machine learning techniques, particularly reinforcement learning, in the implementation of an improved revenue management system. Background technique [0002] Inventory systems are employed in many industries to control the availability of resources, for example through pricing and yield management and any associated calculations. Inventory systems enable customers to purchase or reserve available resources or goods from providers. Additionally, inventory systems allow providers to manage available resources and maximize revenue and profits by making these resources available to customers. [0003] In this context, the term "revenue management" refers to the application of data analytics to predict consumer behavior and optimize product offerings and pricing to maximize revenue gro...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06Q10/02G06N3/08
CPCG06N3/08G06Q10/02G06N3/006G06Q10/087G06Q10/04G06Q10/06312G06Q10/0637G06Q10/067G06Q30/0206G06N3/092
Inventor R·A·阿库纳·阿格斯特T·菲戈N·邦杜A-Q·阮
Owner AMADEUS S