Realtime numerical solution of stochastic models

By using a dedicated processor to perform arithmetic on distribution representations in a single path, the method addresses the computational inefficiencies of Monte Carlo simulations, achieving faster and more accurate financial instrument price predictions.

WO2026139490A1PCT designated stage Publication Date: 2026-07-02SIGNALOID LTD

Patent Information

Authority / Receiving Office
WO · WO
Patent Type
Applications
Current Assignee / Owner
SIGNALOID LTD
Filing Date
2025-12-22
Publication Date
2026-07-02

Smart Images

  • Figure EP2025088766_02072026_PF_FP_ABST
    Figure EP2025088766_02072026_PF_FP_ABST
Patent Text Reader

Abstract

The present disclosure relates to a computer-implemented method for price prediction, comprising: - providing a stochastic model configured to predict future price information of a financial instrument; and - executing the stochastic model in a single iteration having multiple steps by propagating a distribution of a price of the financial instrument through the multiple steps to predict the future price information of the financial instrument, wherein the distribution of the price indicates potential values of the price, and wherein each step of the multiple steps of the single iteration performs calculations on the distribution of the price.
Need to check novelty before this filing date? Find Prior Art

Description

REALTIME NUMERICAL SOLUTION OF STOCHASTIC MODELS FIELD

[0001] The present disclosure relates to a computer-implemented method for price prediction, a computer program comprising instructions for carrying out the computer-implemented method for price prediction, a computer-readable storage medium comprising instructions for carrying out the computer-implemented method for price prediction, and a data processing system for price prediction. More particularly, the present disclosure relates to real-time numerical solution of stochastic differential equations for quantitative finance using distribution arithmetic.BACKGROUND

[0002] In financial markets, the ability to accurately estimate the future price of a financial instrument, such as equities (e.g., shares of companies) or commodities (e.g., the price per barrel of oil), is of great importance to financial institutions. Such predictions are integral to decision-making processes for investment strategies, risk management, and pricing financial derivatives.

[0003] Price movements in financial markets are influenced by numerous unpredictable factors, including news events, changes in market sentiment, and broader economic conditions. Consequently, future price estimation requires a probabilistic approach that accounts for the inherent randomness in these influences. To this end, mathematical models, specifically stochastic models, are widely employed to simulate the dynamics of price changes. These models often rely on stochastic differential equations (SDEs) to describe the evolution of a financial instrument’s price over time.

[0004] Solving stochastic differential equations to predict future prices involves accounting for random market behaviors and fluctuations. A common approach is to perform multiple simulations of potential price trajectories by introducing slightlydifferent random variations in each simulation. Each individual simulation produces a “path” that represents a possible evolution of the instrument’s price over time. By running a large number of such paths and aggregating the results, it becomes possible to estimate the distribution and most likely outcomes for the future price of the financial instrument.

[0005] This simulation process is frequently implemented using Monte Carlo methods, which are well-suited for handling the random and iterative nature of stochastic models. Monte Carlo simulations involve generating hundreds of thousands, or even millions, of these price paths to ensure sufficient convergence on an accurate prediction. However, the computational demands of these repeated simulations are substantial, requiring significant processing time and resources.SUMMARY

[0006] In light of the above, a computer-implemented method for price prediction, a computer program comprising instructions for carrying out the computer-implemented method for price prediction, a computer-readable storage medium comprising instructions for carrying out the computer-implemented method for price prediction, and a data processing system for price prediction are provided.

[0007] It is an obj ect of the present disclosure to enable price prediction with reduced processing time and resource consumption through the use of a dedicated processor that is able to operate and perform arithmetic directly on distribution representations. Another object of the present disclosure is to extend the capabilities of commonly used processor architectures through the use of dedicated software that is able to operate and perform arithmetic directly on distribution representations.

[0008] The objects are solved by the features of the independent claims. Preferred embodiments are defined in the dependent claims.

[0009] According to an independent aspect of the present disclosure, a computer-implemented method for price prediction is provided. The method includes providinga stochastic model configured to predict future price information of a financial instrument; and executing the stochastic model in a single path having multiple steps by propagating a distribution of a price of the financial instrument through the multiple steps to predict the future price information of the financial instrument, wherein the distribution of the price indicates potential values (or changes) of the price, and wherein each step of the multiple steps of the single path performs calculations on the distribution of the price.

[0010] The term “financial instrument” as used throughout the present disclosure refers to an asset or contract that can be traded in financial markets and whose value (price) is subject to fluctuations over time. These instruments can be classified broadly into categories, each with distinct characteristics and risk profiles.

[0011] According to some embodiments, which can be combined with other embodiments described herein, the financial instrument is selected from the group including, or consisting of, equities, commodities, currencies, bonds and derivatives.

[0012] The term “stochastic model” as used throughout the present disclosure refers to a probabilistic framework designed to account for randomness and uncertainty in market movements. These stochastic models are based on the idea that price changes are not entirely deterministic but influenced by various unpredictable factors such as market sentiment, news, and macroeconomic conditions.

[0013] The stochastic model used in the embodiments of the present disclosure has multiple steps. This refers to a framework where the evolution of the price of the financial instrument is modeled as a sequence of calculations or simulations performed over discrete time intervals or steps. Each step represents a small increment of time in the progression of the model and incorporates the effects of deterministic trends and random fluctuations.

[0014] According to some embodiments, which can be combined with other embodiments described herein, the stochastic model is based on at least one stochasticdifferential equation. A stochastic differential equation (SDE) is a mathematical equation used to model the evolution of a system, such as the price of a stock or commodity, over time under the influence of both deterministic trends and random fluctuations. Specifically, SDEs describe how the price changes as a function of time, incorporating predictable factors like long-term growth or decay (drift) and unpredictable, random factors (volatility) arising from market noise or external events.

[0015] According to some embodiments, which can be combined with other embodiments described herein, the stochastic model is selected from the group consisting of a Geometric Brownian Motion Model, an Integrated Brownian Motion Model, a Correlated Brownian Motion Model, a Mean-Reverting Model, a Jump-Diffusion Model and a Stochastic Volatility Model.

[0016] According to some embodiments, which can be combined with other embodiments described herein, the distribution of the price which is propagated through the stochastic model to predict the future price information indicates potential values (or changes) of the price according to predictable factors and random factors.

[0017] Predictable factors refer to the components of the stochastic model that represent the systematic and relatively stable influences on a financial instrument’s price over time. These factors can be derived from historical trends, economic indicators and / or inherent characteristics of the asset and can be modeled deterministically.

[0018] Random factors refer to the stochastic elements that introduce uncertainty and volatility into the price of a financial instrument. These factors arise from external events and / or market behaviors that are inherently random and cannot be forecast with certainty.

[0019] In the embodiments of the present disclosure, the stochastic model is executed in a single path (also referred to as “iteration”). According to some embodiments, which can be combined with other embodiments described herein, thesingle path is a single path at processor level. At the processor level, a single path (or iteration) refers to the execution of a single cycle or pass of a loop or operation in a computational program, as executed by the processor’s hardware. This path involves, for example, the processor fetching instructions from memory, decoding them, performing arithmetic or logical operations, and then storing the results. In contrast thereto, at the computing level, a single path (or iteration) represents the completion of one entire cycle of a computational algorithm or simulation. In particular, the expression “at computing level” may refer, for example, to the level of the algorithm and / or the level of the application. This involves higher-level operations and may consist of multiple processor-level iterations working together to achieve a larger task. For example, a single path in a Monte Carlo Simulation could involve generating a random input, solving a stochastic differential equation, and recording the outcome as one path.

[0020] According to some embodiments, which can be combined with other embodiments described herein, the calculations on the distribution of the price are performed on a processor unit or processor architecture that is configured to represent and perform arithmetic on probability distributions.

[0021] According to some embodiments, which can be combined with other embodiments described herein, the distribution of the price of the financial instrument is in the form of a representation. The representation can be a representation of probability mass / density functions / distributions on which a processor unit or processor architecture can operate. In one embodiment, the representations enable commonly utilized processor architectures to operate on the distribution of the price of the financial instrument. Non-limiting exemplary methods for generating such representations from a data set are explained in WO 2022 / 248719 Al and WO 2022 / 248714 Al.

[0022] According to some embodiments, which can be combined with other embodiments described herein, the representation includes price position parametersand associated price probability parameters. In particular, a representation generation algorithm can be used to generate the representation of the distribution of the price of the financial instrument. In preferred embodiments, the representation generation algorithm can be configured to generate the representation according to the position parameter and the probability parameter, wherein each position of the representation is represented by a Dirac delta associated with a corresponding probability mass (i.e., the price position and price probability parameters correspond to the position and probability parameters of a Dirac mixture representation, respectively). In other words, each real number in the domain of a distribution may be described by a Dirac delta with some probability mass located at the value of the real number.

[0023] According to some embodiments, which can be combined with other embodiments described herein, the representation is a Telescoping Torques Representation (TTR). The Telescoping Torques Representation (TTR) builds on Dirac mixture representation and is based on the intuition that the mean value of a set of samples splits the samples into two sets with equal products of the distance of the sample from the mean and the sample probability. This allows the representation to capture the effects of unlikely, yet high-value samples even at small representation sizes.

[0024] According to some embodiments, which can be combined with other embodiments described herein, the representation is a Joint Telescoping Torques Representation (jTTR). The Joint Telescoping Torques Representation (jTTR) builds on TTR and further tracks correlations between distributional variables. Briefly summarized, jTTR effectively tracks correlations between distributional variables by tracking the origins of each distributional variable from other distributional variables. This capability allows jTTR to accurately propagate uncertainty through many arithmetic operations where TTR falls short. For instance, a conversion algorithm can be used to take in a sequence of multidimensional samples (each of the same dimensionality) and output a jTTR for each variable corresponding to a dimension.

[0025] According to some embodiments, which can be combined with other embodiments described herein, the calculations on the distribution of the price include, or are, arithmetic operations on the representation of the distribution of the price of the financial instrument.

[0026] According to some embodiments, which can be combined with other embodiments described herein, the arithmetic operation is selected from the group including, or consisting of, addition, subtraction, multiplication, division and exponentiation.

[0027] According to some embodiments, which can be combined with other embodiments described herein, the method further includes generating a tensor based on a result of the arithmetic operations. This tensor train decomposition avoids an exponential increase in memory required when combining distributions by means of arithmetic operations, by only storing the building blocks and not the complete final tensor. By only storing the most vital building blocks, such as the input Dirac delta distributions along with some arrays of the same length, any entry of the final output tensor can be accessed and operated on, via a set of rules provided by the tensor train decomposition, so that the entire tensor never has to be stored in the computer’s memory.

[0028] According to some embodiments, which can be combined with other embodiments described herein, the future price information obtained by executing the stochastic model is a distribution of the future price of the financial instrument. Accordingly, the output of the stochastic model is a distribution, particularly a probability distribution, rather than a single value.

[0029] According to some embodiments, which can be combined with other embodiments described herein, the method is deterministic, giving the same result each time. In contrast, Monte Carlo methods produce slightly different results with each execution due to their reliance on random sampling.

[0030] According to some embodiments, which can be combined with other embodiments described herein, the method provides at least 100% speed-up or at least 400% speed-up as compared with Monte Carlo simulations.

[0031] According to some embodiments, which can be combined with other embodiments described herein, the method operates without utilizing a Monte Carlo process.

[0032] According to another independent aspect of the present disclosure, a computer program is provided. The computer program has instructions which, when the program is executed by a computer, cause the computer to carry out the method for price prediction according to the embodiments described in this document.

[0033] According to another independent aspect of the present disclosure, a computer-readable storage medium is provided. The computer-readable storage medium has instructions which, when the program is executed by a computer, cause the computer to carry out the method for price prediction according to the embodiments described in this document.

[0034] Preferably, the computer-readable storage medium is a non-transitory computer-readable storage (or memory) medium. The term “non-transitory computer-readable storage (or memory) medium” includes, but is not limited to, magnetic or optical media, e.g., disks, hard drives, or DVDs / CD-ROMs, volatile or non-volatile media such as RAM (e g., SDRAM, DDR, RDRAM, SRAM, etc ), ROM, etc. The non-transitory computer-readable storage (or memory) medium may include other types of non-transitory memory as well or combinations thereof.

[0035] According to another independent aspect of the present disclosure, a data processing system for price prediction is provided. The data processing system includes one or more processors configured to execute a stochastic model configured to predict future price information of a financial instrument in a single path having multiple steps by propagating a distribution of a price of the financial instrumentthrough the multiple steps to predict the future price information of the financial instrument, wherein the distribution of the price indicates potential values (or changes) of the price, and wherein each step of the multiple steps of the single path performs calculations on the distribution of the price.

[0036] According to some embodiments, which can be combined with other embodiments described herein, the one or more processors of the data processing system are configured to perform the computer-implemented method for price prediction according to the embodiments described in this document.BRIEF DESCRIPTION OF THE DRAWINGS

[0037] So that the manner in which the above recited features of the present disclosure can be understood in detail, a more particular description of the disclosure, briefly summarized above, may be had by reference to embodiments. The accompanying drawings relate to embodiments of the disclosure and are described in the following:FIG. 1 illustrates a schematic view of a conventional processor unit;FIG. 2 illustrates a schematic view of a processor unit according to embodiments of the present disclosure;FIG. 3 shows a flowchart of a method for price prediction according to embodiments of the present disclosure;FIG. 4 illustrates the construction of an uncertainty representation using Telescoping Torques Representation (TTR) according to embodiments of the present disclosure;FIG. 5 illustrates a bit-level layout of an uncertainty representation according to embodiments of the present disclosure;FIG. 6 illustrates a comparison between price predictions generated via a conventional Monte Carlo simulation approach and those obtained according to embodiments of the present disclosure;FIG. 7 illustrates another comparison between price predictions generated via a conventional Monte Carlo simulation and those obtained according to embodiments of the present disclosure; andFIG. 8 illustrates codes used in a Monte Carlo simulation and methods according to embodiments of the present disclosure.DETAILED DESCRIPTION OF EMBODIMENTS

[0038] Reference will now be made in detail to the various embodiments of the disclosure, one or more examples of which are illustrated in the figures. Within the following description of the drawings, the same reference numbers refer to same components. Generally, only the differences with respect to individual embodiments are described. Each example is provided by way of explanation of the disclosure and is not meant as a limitation of the disclosure. Further, features illustrated or described as part of one embodiment can be used on or in conjunction with other embodiments to yield yet a further embodiment. It is intended that the description includes such modifications and variations.

[0039] This document includes references to “one embodiment” or “an embodiment”. The appearances of the phrases “in one embodiment” or “in an embodiment” do not necessarily refer to the same embodiment. Particular features, structures, or characteristics may be combined in any suitable manner consistent with this disclosure.

[0040] The term “comprising” is open-ended. As used in the appended claims, this term does not foreclose additional structure or steps.

[0041] Various units, circuits, or other components may be described or claimed as “configured to” perform a task or tasks. In such contexts, “configured to” is used to connote structure by indicating that the units / circuits / components include structure (e.g., circuitry) that performs those task or tasks during operation. As such, the unit / circuit / component can be said to be configured to perform the task even when the specified unit / circuit / component is not currently operational (e.g., is not on). The units / circuits / components used with the “configured to” language include hardware for example, circuits, memory storing program instructions executable to implement the operation, etc. Additionally, “configured to” can include generic structure (e.g., generic circuitry) that is manipulated by software and / or firmware (e.g., an FPGA or a general-purpose processor executing software) to operate in manner that is capable of performing the task(s) at issue.

[0042] As used herein, the terms “first”, “second”, etc. are used as labels for nouns that they precede, and do not imply any type of ordering (e.g., spatial, temporal, logical, etc.).

[0043] As used herein, the term “based on” is used to describe one or more factors that affect a determination. This term does not foreclose additional factors that may affect a determination. That is, a determination may be solely based on those factors or based, at least in part, on those factors. Consider the phrase “determine A based on B.” While in this case, B is a factor that affects the determination of A, such a phrase does not foreclose the determination of A from also being based on C. In other instances, A may be determined based solely on B.

[0044] As used herein, the term “uncertainty tracking” (or uncertainty propagation) is the process of monitoring how uncertainties in data affect the outputs of sequences of arithmetic operations that act on the data. For a given data item, such as a measurement, the term uncertainty quantification (UQ) is usually used to refer to the process of quantifying the uncertainty of that single data item.

[0045] FIG. 1 illustrates a schematic view of a conventional processor unit 100.

[0046] A conventional processor unit 100 takes as input integer and floating-point values and performs operations (e g., ADD, SUBSTRACT, MULTIPLY, XOR, etc.) on these data. The results of these operations are other integers or floating-point values.

[0047] Uncertainties in input data such as stock market data are not uncommon, but the conventional processor unit 100 can only deal with integer and floating-point values which are certain and unambiguous. Therefore, the software running over the conventional processor unit 100 eliminates the uncertainties e.g. by averaging the variations across readings into a single number or sampling from the input data, to get the data into a form that the conventional processor unit 100 can handle, before feeding the averaged data into subsequent steps.

[0048] However, in many cases, the accuracy of the processing result is critical. For example, the ability to accurately estimate the future price of a financial instrument, such as equities (e.g., shares of companies) or commodities (e.g., the price per barrel of oil), is of great importance to financial institutions. Such predictions are integral to decision-making processes for investment strategies, risk management, and pricing financial derivatives.

[0049] In such financial use cases, Monte Carlo methods can be employed, involving generating hundreds of thousands, or even millions, of these price paths to ensure sufficient convergence on an accurate prediction. However, the computational demands of these repeated simulations are substantial. In particular, repeating the calculations required for multiple potential paths of a financial instrument’s price can be time-consuming, and completing these calculations within a practical timeframe is not always feasible. For example, when attempting to predict the price of a financial instrument one hour into the future, the calculation must be completed in under an hour to ensure the prediction remains actionable. At the same time, it is crucial that the calculation is not only fast but also accurate, striking a balance between computational efficiency and precision.

[0050] The embodiments of the present disclosure address the aforementioned drawback by representing each price of the financial instrument as a distribution and propagating this distribution through a stochastic model in a single path (or iteration), enabling uncertainty tracking of the price. This approach eliminates the need for repeated Monte Carlo simulations and effectively reduces the process to calculating a single “path”. At each step along this path, the entire distribution of potential price movements, both upward and downward, is considered. This allows for prediction calculations that are not only faster but also more accurate, enabling users of financial models to make well-informed decisions and capitalize on accurately predicting future price movements.

[0051] In one exemplary embodiment of the present disclosure, the conventional processor unit 100 according to FIG. 1 can be extended without any hardware reconfiguration using the concept of representation described later, particularly the so-called Telescoping Torques Representation (TTR). TTR enables the uncertainty tracking of the present disclosure on the conventional processor unit 100 without any hardware re-configuration.

[0052] FIG. 2 illustrates a schematic view of a processor unit 200 according to embodiments of the present disclosure, which is capable of processing and operating on a distribution of the price of a financial instrument. The processor unit 200 is a dedicated architecture, such as the architecture disclosed in WO 2022 / 248719 Al operating on a representation according to WO 2022 / 248719 Al and WO 2022 / 248714 Al.

[0053] The processor unit 200 of the present disclosure can take in as input the same kinds of integer and floating-point values and perform the same kinds of operations (e g., ADD, SUBSTRACT, MULTIPLY, XOR, etc.) as the conventional processor unit. In addition to that, the values taken as input include additional information about distributions associated with each value, and the processor unit 200 may also, in someembodiments, enable a number of additional operations that act on the distribution information (e.g., BAYESLAPLACE, SAMPLEFROMDIST, etc ).

[0054] In some embodiments, the processor unit 200 includes additional and / or specifically configured hardware to enable the uncertainty tracking of the present disclosure. For example, the processor unit 200 may include a floating-point register file. The floating-point register file may include a first register file for storing particle data items and a second register file for storing distribution data. The distribution data represents distributions (e.g., TTR) that encode the uncertainty associated with the corresponding particle data items in the first register file.

[0055] A first arithmetic logic unit (ALU) can be configured to perform arithmetic operations on particle data items selected from the first register file, while a second arithmetic logic unit (ALU) can be configured to perform arithmetic operations on distribution data selected from the second register file. For example, the first arithmetic logic unit may execute an arithmetic operation (e.g., addition, subtraction, multiplication, or division) on two floating-point particle data items selected from the first register file and output the result. Simultaneously, the second arithmetic logic unit may execute the same arithmetic operation on two corresponding distribution data items selected from the second register file, which are associated with the particle data items used in the first arithmetic operation. The second arithmetic logic unit then outputs the result.

[0056] Notably, the arithmetic operation applied to the distribution data in the second register file is identical to the arithmetic operation applied to the particle data items in the first register file. Consequently, the output of the second arithmetic logic unit represents updated distributional data that quantifies the uncertainty associated with the result of the arithmetic operation performed on the particle data items.

[0057] A detailed description of an exemplary embodiment of the processor unit 200 is provided in WO 2022 / 248719 Al.

[0058] In some embodiments, the processor unit 200 of the present disclosure can be implemented as a virtualization layer. By moving the tracking of price uncertainty into a virtualization layer below the user’s software, applications can be significantly simplified. Furthermore, the implementation of the processor unit 200 as a virtualization layer means users can benefit from advances in the technology as it evolves, without needing to rewrite the user’s software.

[0059] FIG. 3 shows a flowchart of a method 300 for price prediction according to embodiments of the present disclosure.

[0060] The method 300 includes providing a stochastic model configured to predict future price information of a financial instrument (block 310); and executing the stochastic model in a single path (or iteration) having multiple steps by propagating a distribution of a price of the financial instrument through the multiple steps to predict the future price information of the financial instrument, wherein the distribution of the price indicates potential values (or changes) of the price, and wherein each step of the multiple steps of the single path (or iteration) performs calculations on the distribution of the price (block 320).

[0061] In some embodiments, the financial instrument is selected from the group including, or consisting of, equities, commodities, currencies, bonds and derivatives. Equities represent shares of ownership in a company, granting shareholders a stake in the company's assets and earnings. Commodities refer to tangible goods such as oil, gold, or agricultural products, whose values are primarily influenced by fluctuations in supply and demand, along with geopolitical and economic factors. Currencies refer to the exchange rates between different monetary units, such as USD / EUR, representing the relative value of one currency compared to another. Bonds are fixed-income securities that represent a loan made by an investor to a borrower, typically a corporation or government, with their prices fluctuating based on changes in interest rates and overall market conditions. Derivatives are financial contracts, such as optionsand futures, whose value is derived from the performance of underlying assets, such as stocks, commodities, or indices.

[0062] The stochastic model used in the embodiments of the present disclosure has multiple steps. This refers to a framework where the evolution of the price of the financial instrument is modeled as a sequence of calculations or simulations performed over discrete time intervals or steps. Each step represents a small increment of time in the progression of the model and incorporates the effects of deterministic trends and random fluctuations.

[0063] In some embodiments, the stochastic model is based on at least one stochastic differential equation. A stochastic differential equation (SDE) is a mathematical equation used to model the evolution of a system, such as the price of a stock or commodity, over time under the influence of both deterministic trends and random fluctuations. Specifically, SDEs describe how the price changes as a function of time, incorporating predictable factors like long-term growth or decay (drift) and unpredictable, random factors (volatility) arising from market noise or external events.

[0064] In some embodiments, the stochastic model is selected from the group consisting of a Geometric Brownian Motion Model, an Integrated Brownian Motion Model, a Correlated Brownian Motion Model, a Mean-Reverting Model, a Jump-Diffusion Model and a Stochastic Volatility Model.

[0065] The Geometric Brownian Motion (GBM) Model describes prices as evolving continuously with a combination of predictable drift and random noise. The mathematical representation involves the price at a given time, the expected return (or drift), volatility (which measures the variation of returns), and a Wiener process representing the randomness. GBM assumes that percentage changes in prices are normally distributed and that prices grow exponentially over time.

[0066] The Integrated Brownian Motion (IBM) Model refers to the process obtained by taking the (deterministic) time-integral of a standard Brownian motion (also called a Wiener process), and is a continuous Gaussian process with zero mean.

[0067] The Correlated Brownian Motion (CBM) Model refers to a collection (or vector) of Brownian motions that exhibit specified correlation relationships among their components. In other words, instead of having each component of the vector be an independent Brownian motion, these components have a prescribed correlation structure.

[0068] The Mean-Reverting Model, such as the Omstein-Uhlenbeck process, is particularly useful for assets that tend to oscillate around a historical average or equilibrium level. Here, the price dynamics include a term that pulls the price back toward the long-term mean, combined with a random noise component.

[0069] For scenarios where sudden, significant price changes occur - such as market crashes or major news events - Jump-Diffusion Models provide a more realistic representation by incorporating random jumps alongside the usual drift and volatility.

[0070] Stochastic Volatility Models allow the volatility itself to change over time. This is achieved by coupling the price process with another stochastic process that governs volatility. For instance, the Heston model is a widely used approach that captures the dynamic behavior of volatility and its impact on prices.

[0071] The distribution of the price which is propagated through the stochastic model to predict the future price information indicates potential values (or changes) of the price according to predictable factors and random factors. Predictable factors refer to the components of the stochastic model that represent the systematic and relatively stable influences on a financial instrument’s price over time. These factors can be derived from historical trends, economic indicators and / or inherent characteristics of the asset and can be modeled deterministically. Random factors refer to the stochastic elements that introduce uncertainty and volatility into the price of a financialinstrument. These factors arise from external events and / or market behaviors that are inherently random and cannot be forecast with certainty.

[0072] In the embodiments of the present disclosure, the stochastic model is executed in a single iteration. According to some embodiments, which can be combined with other embodiments described herein, the single iteration is a single iteration at processor level. At the processor level, a single iteration refers to the execution of a single cycle or pass of a loop or operation in a computational program, as executed by the processor’s hardware. This iteration involves, for example, the processor fetching instructions from memory, decoding them, performing arithmetic or logical operations, and then storing the results. In contrast thereto, at the computing level, a single iteration represents the completion of one entire cycle of a computational algorithm or simulation. This involves higher-level operations and may consist of multiple processor-level iterations working together to achieve a larger task. For example, a single iteration in a Monte Carlo Simulation could involve generating a random input, solving a stochastic differential equation, and recording the outcome as one path.

[0073] In some embodiments, the future price information obtained by executing the stochastic model is a distribution of the future price of the financial instrument. Accordingly, the output of the stochastic model is a distribution, particularly a probability distribution, rather than a single value.

[0074] According to some embodiments, which can be combined with other embodiments described herein, the distribution of the price of the financial instrument which is propagated through the stochastic model is in the form of a representation. The representation can be a representation of probability mass / density functions / distributions on which the processor unit shown in FIG. 2 can operate.

[0075] The representations also enable commonly utilized processor architectures, such as the one shown in FIG. 1, to operate on the distribution of the price of the financial instrument. In the same way that each value of a floating-point register is anapproximate representation of a real-valued number, a value in the representation is an approximation of an entire probability mass function.

[0076] A generation representation algorithm can be used to generate the representation of the distribution of the price of the financial instrument from a data set, such as stock market data. In some embodiments, two or more parameters can be used to construct the representation from the data set, i.e., a distribution of data items.

[0077] In preferred embodiments, the representation generation algorithm can be configured to generate the representation of the distribution of the price of the financial instrument according to a position parameter and a probability parameter, wherein each position of the representation is represented by a Dirac delta associated with a corresponding probability mass. In other words, each real number in the domain of a distribution may be described by a Dirac delta with some probability mass located at the value of the real number.

[0078] It should be noted that this representation of a distribution in terms of Dirac deltas within its domain is different from that of a probability mass function (PMF), where the domain of the distribution is by definition discrete-valued and integrals can be replaced with sums. Because the Dirac delta distribution is not a function but rather is a distribution, the representation of distributions as a collection of Dirac deltas is also different from the concept of a probability density function (PDF). In general, one can call a representation made up of Dirac deltas that integrate to 1.0 (i.e., their probability masses sum up to 1.0) as probability density distribution (PDD) which is a generalization of a probability density function (PDF) (i.e., a PDF is a PDD but not vice versa).

[0079] The so-called Telescoping Torques Representation (TTR) of preferred embodiments of the present disclosure is described below with reference to FIGs. 4 and 5 as an example of an uncertainty representation of a distribution in terms of Dirac deltas. The Telescoping Torques Representation is described in detail in WO 2022 / 248719 Al and WO 2022 / 248714 Al. It is to be understood that the presentdisclosure is not limited thereto and that other methods may also be used to construct representations, such as Moment-Quantized Histogram Representation (MQHR) described in WO 2022 / 248714 Al.

[0080] The Telescoping Torques Representation (TTR) builds on Dirac mixture representation of a set of samples of a data set, and is based on the intuition that the mean value of a set of samples splits the samples into two sets with equal products of the distance of the sample from the mean and the sample probability. This allows the representation to capture the effects of unlikely, yet high-value samples even at small representation sizes.

[0081] Let srcDM be a Dirac delta mixture representing a set of samples from a random variable X. Let N be a positive integer power of two. srcDM is converted to a distribution representation of size N using a recursive conversion algorithm of log2N steps. The number of steps is what is referred to as the “order” of the distribution representation. At each step, the algorithm splits a source Dirac mixture on its mean value m, creating two Dirac mixtures, which are the inputs of two new calls to the algorithm.

[0082] FIG. 4 illustrates an example for N = 4, starting from a SFCDM of 10 data samples (e.g., particles). The top part of FIG. 4 illustrates the source Dirac mixture. The mixture consists of 10 Dirac deltas, each one positioned at the value of each unique source sample. The mass of each Dirac delta is 0.1, equal to the relative frequency of each data sample (e.g., particle).

[0083] In step 0 of FIG. 4(a), the conversion algorithm creates the zeroth-order distribution representation of SI-CDM, which is a single Dirac delta located at the mean (m = 14.85) of srcDM (i.e., t / po^O] = 14.85) with probability mass equal to the sum of the probability masses of all Dirac deltas in srcDM (t / ma^[0] = 1.0).

[0084] In step 1 of FIG. 4(b), the conversion algorithm splits SVCDM into two mixtures (each with five Dirac deltas in this example) using m as a boundary, and repeats theprocess for each sub-mixtures. FIG. 4(b) illustrates the first-order distribution representation of SI-CDM, which has two Dirac deltas positioned at the mean value of the Dirac mixture splits of step 0, each one with probability mass equal to 0.5.

[0085] In step 2 of FIG. 4(c), the conversion process repeats, further splitting the sub-mixtures of step 1, into two parts. This results in four sub-mixtures and four Dirac deltas that represent them and form the second-order distribution representation of srcDM. FIG. 4(c) illustrates that the split in step 1 results in sub-mixtures that each contain a different number of source particle samples. Step 2 represents these submixtures with Dirac deltas with unequal probability masses. FIG. 4(c) illustrates that the outcome Dirac deltas retain the property of equal probability torque on the two sides of the mean value of the source Dirac mixture (m = 14.85).

[0086] A Dirac mixture requires only a series of positions x, and probability masses pt for its description. In the embodiments of the present disclosure, price position and price probability may correspond to these position and probability masses of the Dirac representation, respectively. It should be noted that the terms “position” and “support position” can be used interchangeably in this context. For a random variable d, its Dirac mixture of size N is an array of IN floating-point values. For the zth Dirac delta of the mixture, the position dpos\i\ and probability mass dmass[i\ are stored.

[0087] FIG. 5 illustrates a one bit-level layout for such a Dirac mixture. The set of bits in the layout allow a concise in-processor representation for the distribution of the price of the financial instrument.

[0088] The above sum-of-Dirac-deltas representation may be described in the form of a set of tuples (i.e., a set of n ^-tuples). A reference herein to a “tuple” may be considered to include a reference to a data structure consisting of multiple parts defining an ordered set of data constituting a record, as is commonly understood in the art. In mathematics, a tuple is a finite sequence of elements such as a sequence of k elements, where & is a positive integer (e.g., k=2).

[0089] In a preferred embodiment of the present disclosure, each tuple of the set of tuples is a real or integer value (i.e., the position parameter) paired with a probability (i.e., the probability mass, which is a real number between 0 and 1), and the probabilities in the set of tuples add up to 1. Accordingly, each tuple provides a data structure within which distributional information is encoded. These tuples are distinct from the probability distribution itself.

[0090] Tuples other than those described above may be used in further embodiments. For example, (m+l)-tuples could be defined where the first m numbers tell the m-dimensional position of a Dirac delta, and the last number is the probability mass of the Dirac delta.

[0091] The present disclosure is not limited to the above sum-of-Dirac-deltas representation, and other representations can be constructed from the real-world data set. Another example of such an uncertainty representation is an interval-based representation (i.e., a piecewise constant approximation of the probability density function, which is equivalent to the linear approximation of the CDF) having n intervals and k = 3 (start of interval, end of interval, density value).

[0092] Yet another example is a mixture of these, e.g., consider a distribution density with half of its mass in a uniform [0, 1] (thus, exactly represented by the 3 -tuple (0, 1, 0.5)) and the other half is degenerate at the value 2 (thus, exactly represented by the 2-tuple (2, 0.5)). A degenerate distribution can be understood as a probability distribution that has a non-trivial probability mass (i.e., greater than 0) concentrated in a lower dimensional manifold than the dimension of the range of the corresponding random variable (a.k.a., the domain of the distribution). In particular, any 1-dimensional PDD that contains a Dirac delta with probability mass greater than 0 is a degenerate distribution (which has nontrivial mass concentrated at a point, a 0-dimensional manifold).

[0093] The representations described above make it unnecessary to generate large numbers of samples in order to sample the tails of a distribution (i.e., to converge) inthe terminology of Monte Carlo methods. In particular, the representation (and subsequent computation using it) makes irrelevant the whole idea of having to keep running Monte Carlo iterations until one has sampled far enough out into the tails (of the inputs as well as of the results).

[0094] In further embodiments, the TTR is a Joint Telescoping Torques Representation (jTTR). The Joint Telescoping Torques Representation (jTTR) builds on TTR and further tracks correlations between distributional variables. Briefly summarized, jTTR effectively tracks correlations between distributional variables by tracking the origins of each distributional variable from other distributional variables. This capability allows jTTR to accurately propagate uncertainty through many arithmetic operations where TTR falls short. For instance, a conversion algorithm can be used to take in a sequence of multidimensional samples (each of the same dimensionality) and output a jTTR for each variable corresponding to a dimension. The Joint Telescoping Torques Representation is described in detail in PCT / EP2025 / 067326.

[0095] The memory layout of the Dirac deltas of a jTTR follows a tZ-dimensional discrete hypercube where d is the number of ancestors the jTTR has (each of the distributional variables that a given distributional variable originates from can be referred to as its ancestor, and the totality of such distributional variables as its ancestors'). The d ancestors are thought to make up the dimensions of the discrete hypercube. Each dimension of the discrete hypercube has N entries where N is the representation size for a TTR (which is equivalent to a single-dimensional jTTR). One can refer to N as the base representation size for the j TTR. In general, a j TTR will have NdDirac deltas, coming from the N possible distinct entries per dimension of the d-dimensional discrete hypercube. Based on this model, each Dirac delta of a jTTR will have a tZ-dimensional index that uniquely identifies it. The jTTR algorithm makes use of this indexing framework to track the origins of Dirac deltas in order to compute with them in a way that respects correlations.

[0096] The challenge with jTTR is the exponential computer memory and compute requirements as the number of ancestors d and representation size N increases. Some embodiments of the present disclosure represent the output tensor (hypercube) in computer memory as a tensor train, such as a low-rank tensor train. In particular, the hypercube can be decomposed according to a tensor train. This reduces the computer memory requirements from exponential to linear complexity. Another feature of the tensor train representation is that a large subset of statistics of the output distribution can be computed directly in tensor train format, avoiding the need to either form the full tensor (losing the memory gained) or access each element of the output tensor (exponential in compute). As a result, a large number of user applications can be addressed with linear complexity in memory and compute, without losing any correlation information.

[0097] The tensor can be represented by a collection of small three-dimensional cores that can be strung together to individually access elements of the final tensor. That means, a tZ-dimensional tensor in tensor train form will have d cores (one for each dimension) and that the index T{n, t2, , id}, can be computed by multiplying together slice number ix from the x-th core with the other slices. For this reason, the left dimension of the first core must be one and so also the right dimension of the last core, so that the output is a scalar.

[0098] The efficiency of the tensor train representation depends on the matrix rank of each respective dimension. Since jTTR multiplication is an outer product of two vectors, which is a rank 1 matrix operation, the tensor train representation can be kept in that exact rank 1 form. Addition, on the other hand, is a rank 2 operation, and hence the maximum dimension of the tensor train cores that involve an addition operation will be two. As a result, for latent variables formed by only multiplication or division, all d tensor train cores will have dimension 1 x Nx 1. For latent variables that have undergone an addition or subtraction operation as well, the extreme cores will have dimensions 1 xN x 2 and 2 xN x 1 respectively, whereas the central cores will havedimension 2 xNx 2. As a result, one ends up storing a linear number of A-vectors with an increase in the number of dimensions.

[0099] The tensor train concept is described in detail in EP25189608.

[0100] FIG. 6 illustrates a comparison between price predictions generated via a conventional Monte Carlo simulation approach (FIG. 6(a)) and those obtained according to embodiments of the present disclosure (FIG. 6(b)).

[0101] The Monte Carlo simulation involves generating hundreds of thousands, or even millions, of price paths over time t to ensure sufficient convergence on an accurate prediction. However, the computational demands of these repeated simulations are substantial. In particular, repeating the calculations required for multiple potential paths of a financial instrument’s price can be time-consuming, and completing these calculations within a practical timeframe is not always feasible. For example, when attempting to predict the price of a financial instrument one hour into the future, the calculation must be completed in under an hour to ensure the prediction remains actionable. At the same time, it is crucial that the calculation is not only fast but also accurate, striking a balance between computational efficiency and precision.

[0102] The embodiments of the present disclosure address the aforementioned drawback by representing each price of the financial instrument as a distribution and propagating this distribution through a stochastic model in a single path, enabling uncertainty tracking of the price. This approach eliminates the need for repeated Monte Carlo simulations and effectively reduces the process to calculating a single “path”, as shown in FIG. 6(b). At each step along this path, the entire distribution of potential price movements, both upward and downward, is considered. This allows for prediction calculations that are not only faster but also more accurate, enabling users of financial models to make well-informed decisions and capitalize on accurately predicting future price movements.

[0103] FIG. 7 illustrates a comparison between price predictions generated via a conventional Monte Carlo simulation approach (FIG. 7(a)) and those obtained according to embodiments of the present disclosure (FIG. 7(b)).

[0104] The traditional Monte Carlo-based path simulation is slower and requires hundreds of thousands of paths to reach convergence. The one-pass execution using distributions is faster and achieves more accurate results than the traditional Monte Carlo-based path simulation.

[0105] FIG. 8 illustrates codes used in a Monte Carlo simulation (FIG. 8(a)) and methods according to embodiments of the present disclosure (FIG. 8(b)). It is clear from the exemplary code snippet in FIG. 8(b) that, with the embodiments of the present disclosure, the outer loop requiring to iterate the algorithmic steps of the finance models through each output provided by each Monte-Carlo simulation is deleted and replaced by a single computation path for each algorithmic step.

[0106] Although the code in FIG. 8(b) employs a particular representation, the choice of which representation to implement can vary based on the desired balance between distribution accuracy and computational speed.

[0107] The methods described herein may be implemented in software, hardware, or a combination thereof, in different embodiments. In addition, the order of the blocks of the methods may be changed, and various elements may be added, reordered, combined, omitted, modified, etc. Various modifications and changes may be made as would be obvious to a person skilled in the art having the benefit of this disclosure. The various embodiments described herein are meant to be illustrative and not limiting. Many variations, modifications, additions, and improvements are possible. Accordingly, plural instances may be provided for components described herein as a single instance. Boundaries between various components, operations and data stores are somewhat arbitrary, and particular operations are illustrated in the context of specific illustrative configurations. Other allocations of functionality are envisioned and may fall within the scope of claims that follow. Finally, structures andfunctionality presented as discrete components in the example configurations may be implemented as a combined structure or component. These and other variations, modifications, additions, and improvements may fall within the scope of embodiments as defined in the claims that follow.

Claims

1. CLAIMS1. A computer-implemented method for price prediction, comprising:providing a stochastic model configured to predict future price information of a financial instrument; andexecuting the stochastic model in a single path having multiple steps by propagating a distribution of a price of the financial instrument through the multiple steps to predict the future price information of the financial instrument, wherein the distribution of the price indicates potential values of the price, and wherein each step of the multiple steps of the single path performs calculations on the distribution of the price.

2. A computer-implemented method of claim 1, wherein:the stochastic model is based on at least one stochastic differential equation; and / orthe stochastic model is selected from the group consisting of a Geometric Brownian Motion Model, an Integrated Brownian Motion Model, a Correlated Brownian Motion Model, a Mean-Reverting Model, a Jump-Diffusion Model and a Stochastic Volatility Model.

3. A computer-implemented method of claim 1 or 2, wherein the distribution of the price indicates potential values of the price according to predictable factors and random factors.

4. A computer-implemented method of any one of claims 1 to 3, wherein the single path is a single path at processor level.

5. A computer-implemented method of any one of claims 1 to 4, wherein the calculations on the distribution of the price are performed on a processor architecture that is configured to represent and perform arithmetic on probability distributions.

286. A computer-implemented method of any one of claims 1 to 5, wherein the distribution of the price of the financial instrument is in the form of a representation, in particular wherein the representation includes price position parameters and associated price probability parameters.

7. A computer-implemented method of claim 6, wherein the representation is in the form of a sum of Dirac deltas.

8. A computer-implemented method of claim 6 or 7, wherein the representation is a Telescoping Torques Representation.

9. A computer-implemented method of any one of claims 7 to 8, wherein the calculations on the distribution of the price include, or are, arithmetic operations on the representation of the distribution of the price of the financial instrument.

10. A computer-implemented method of claim 9, wherein the arithmetic operations are selected from the group consisting of addition, subtraction, multiplication, division and exponentiation.

11. A computer-implemented method of any one of claims 1 to 10, wherein the future price information obtained by executing the stochastic model is a distribution of the future price of the financial instrument.

12. A computer-implemented method of any one of claims 1 to 11, wherein:the method is deterministic, giving the same result each time; and / orthe method provides at least 100% speed-up or at least 400% speed-up as compared with Monte Carlo simulations.

13. A computer program comprising instructions which, when the program is executed by a computer, cause the computer to carry out the method of any one of claims 1 to 12.

14. A computer-readable storage medium comprising instructions which, when the program is executed by a computer, cause the computer to carry out the method of any one of claims 1 to 12.

15. A data processing system for price prediction, comprising one or more processors configured to:execute a stochastic model configured to predict future price information of a financial instrument in a single path having multiple steps by propagating a distribution of a price of the financial instrument through the multiple steps to predict the future price information of the financial instrument, wherein the distribution of the price indicates potential values of the price, and wherein each step of the multiple steps of the single path performs calculations on the distribution of the price.