Reinforcement learning systems and methods for inventory control and optimization

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
An inventory, randomization technique with applications in reinforcement learning. field, which can solve problems such as slow response to changes in system requirements, monopoly, and no performance

Pending Publication Date: 2021-06-29

AMADEUS S

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Furthermore, conventional RMS models treat markets as monopolies under the assumption that competitors' actions are implicitly considered in customer behavior

[0007] Another disadvantage of the conventional approach to RMS is that there are often interdependencies between the model and its inputs such that any change in the available input data requires the model to be modified or rebuilt to take advantage of the new or changed information

Furthermore, without human intervention, the modeled system is slow to respond to changes in requirements that are poorly or not represented in the historical data on which the model is based

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0082] figure 1 is a block diagram illustrating an exemplary networked system 100 including an inventory system 102 embodying the present invention. In particular, inventory system 102 includes a reinforcement learning (RL) system configured to perform revenue optimization according to embodiments of the present invention. In particular, embodiments of the invention are described with reference to an inventory and revenue optimization system for selling and reserving airline seats, where networked system 100 generally includes an airline reservation system and inventory system 102 includes an airline-specific inventory system. However, it should be appreciated that this is merely an example to illustrate the system and method, and that additional embodiments of the invention may be applied to systems other than those associated with the sale and reservation of airline seats Inventory and revenue management systems other than .

[0083] Airline inventory system 102 may includ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

A method of reinforcement learning for a resource management agent in a system for managing an inventory of perishable resources having a sales horizon, while seeking to optimize revenue generated therefrom. The inventory has an associated state. The method comprises generating a plurality of actions. Responsive to the actions, corresponding observations are received, each observation comprising a transition in the state associated with the inventory and an associated reward in the form of revenues generated from sales of the perishable resources. The received observations are stored in a replay memory store. A randomised batch of observations is periodically sampled from the replay memory store according to a prioritised replay sampling algorithm wherein, throughout a training epoch, a probability distribution for selection of observations within the randomised batch is progressively adapted. Each randomised batch of observations is used to update weight parameters of a neural network that comprises an action-value function approximator of the resource management agent, such that when provided with an input inventory state and an input action, an output of the neural network more closely approximates a true value of generating the input action while in the input inventory state. The neural network may thereby be used to select each of the plurality of actions generated depending upon a corresponding state associated with the inventory.

Description

technical field [0001] The present invention relates to technical methods and systems for improved inventory control and optimization. In particular, embodiments of the present invention employ machine learning techniques, particularly reinforcement learning, in the implementation of an improved revenue management system. Background technique [0002] Inventory systems are employed in many industries to control the availability of resources, for example through pricing and yield management and any associated calculations. Inventory systems enable customers to purchase or reserve available resources or goods from providers. Additionally, inventory systems allow providers to manage available resources and maximize revenue and profits by making these resources available to customers. [0003] In this context, the term "revenue management" refers to the application of data analytics to predict consumer behavior and optimize product offerings and pricing to maximize revenue gro...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06Q10/02G06N3/08

CPCG06N3/08G06Q10/02G06N3/006G06Q10/087G06Q10/04G06Q10/06312G06Q10/0637G06Q10/067G06Q30/0206G06N3/092

Inventor R·A·阿库纳·阿格斯特T·菲戈N·邦杜A-Q·阮

Owner AMADEUS S

Reinforcement learning systems and methods for inventory control and optimization

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology