Randomized Neural Networks for estimation of exposure profiles and Credit Valuation Adjustment (CVA) for American Equity Options
Pith reviewed 2026-06-25 21:54 UTC · model grok-4.3
The pith
Randomized neural networks estimate exposure and CVA for American options with the same accuracy as least-squares Monte Carlo but lower cost in high dimensions.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The randomized feedforward neural network approach preserves convergence to the LSM benchmark when it is extended from pricing to exposure and CVA estimation, while its main advantage appears in high-dimensional problems, where it scales more efficiently and leads to lower computational cost.
What carries the argument
Randomized feedforward neural network trained on simulated paths to estimate continuation values for American exercise decisions inside Monte Carlo exposure and CVA calculations.
If this is right
- The method produces usable exposure and CVA numbers for both Black-Scholes and Heston American options.
- Portfolio netting effects are captured at the same accuracy level as LSM.
- Pathwise convergence and sensitivity tests remain stable when the network replaces LSM regression.
- Computational cost stays lower than LSM once the dimension exceeds a few assets.
Where Pith is reading between the lines
- The same network could replace LSM regression steps in other path-dependent claims such as Bermudan swaptions.
- Real-time risk systems for large equity baskets might become feasible if training cost stays sub-linear in dimension.
- Extending the approach to stochastic interest rates would require only adding the rate factors to the network inputs.
Load-bearing premise
Once trained on the same paths used by LSM, the randomized network produces continuation-value estimates whose statistical properties stay close enough to LSM that downstream exposure and CVA numbers do not diverge materially.
What would settle it
Run the identical high-dimensional test set and observe whether the neural-network CVA or exposure profile deviates from the LSM benchmark by more than Monte Carlo sampling error.
Figures
read the original abstract
This thesis studies the use of randomized neural networks for the estimation of exposure profiles and unilateral CVA of American options within a Monte Carlo framework. The analysis is carried out separately under both Black-Scholes and Heston dynamics, combining American option valuation, expected exposure and potential future exposure estimation, and unilateral CVA calculation with portfolio netting effects. The numerical experiment compares this approach with the classical Least-Squares Monte Carlo (LSM) used as a benchmark in both low-dimensional single-asset and high-dimensional multi-asset scenarios, and also includes a path convergence test and a sensitivity analysis. The results show that the randomized feedforward neural network approach preserves convergence to the LSM benchmark when it is extended from pricing to exposure and CVA estimation, while its main advantage appears in high-dimensional problems, where it scales more efficiently and leads to lower computational cost. These results support the use of randomized neural networks as a useful alternative for exposure and CVA estimation in high-dimensional American-style options.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes the use of randomized feedforward neural networks within a Monte Carlo simulation framework to estimate exposure profiles (EE, PFE) and unilateral CVA for American equity options. The method is tested separately under Black-Scholes and Heston dynamics, for single-asset and multi-asset portfolios (with netting), and is benchmarked against Least-Squares Monte Carlo (LSM). Numerical results include path-convergence tests and sensitivity analysis; the central claim is that the randomized NN approach converges to the LSM benchmark while offering computational scaling advantages in high-dimensional settings.
Significance. If the empirical convergence holds, the work supplies a practical, lower-cost alternative to LSM for CVA and exposure calculations on American-style claims in high-dimensional equity portfolios. The explicit inclusion of both dynamics, netting, and path-convergence diagnostics strengthens the applicability claim for risk-management use cases.
major comments (2)
- [§4.2] §4.2 (high-dimensional experiments): the reported CPU-time savings versus LSM are stated without accompanying standard errors or number of independent runs; it is therefore unclear whether the observed speed-up is statistically distinguishable from run-to-run Monte Carlo variability.
- [Table 5] Table 5 (CVA values under Heston with netting): the absolute differences between NN and LSM are on the order of 1–3 % of the LSM value, yet no formal statistical test (e.g., paired t-test across paths) is supplied to confirm that these differences are consistent with Monte Carlo noise rather than systematic bias in the continuation-value estimator.
minor comments (3)
- [Abstract] The abstract states that “a path-convergence test … were performed” but does not report the observed convergence rates or the number of paths used; adding a short quantitative summary would improve readability.
- [§3.1] Notation for the randomized NN (weights drawn from which distribution, activation function, number of hidden units) is introduced only in §3.1; a compact table summarizing the hyper-parameters used in each experiment would aid reproducibility.
- [Figure 3] Figure 3 (exposure profiles) uses different y-axis scales across panels; aligning the scales or adding a common reference line would make visual comparison with LSM easier.
Simulated Author's Rebuttal
We thank the referee for the careful reading and constructive comments. We address the two major comments point by point below. We agree that the suggested additions of statistical information will strengthen the manuscript and plan to incorporate them.
read point-by-point responses
-
Referee: [§4.2] §4.2 (high-dimensional experiments): the reported CPU-time savings versus LSM are stated without accompanying standard errors or number of independent runs; it is therefore unclear whether the observed speed-up is statistically distinguishable from run-to-run Monte Carlo variability.
Authors: We agree that reporting the number of independent runs and standard errors is necessary for a rigorous interpretation of the timing results. In the revised manuscript we will state the number of independent timing runs performed in §4.2 and include standard errors for the reported CPU times, allowing readers to confirm that the observed speed-ups exceed run-to-run Monte Carlo variability. revision: yes
-
Referee: [Table 5] Table 5 (CVA values under Heston with netting): the absolute differences between NN and LSM are on the order of 1–3 % of the LSM value, yet no formal statistical test (e.g., paired t-test across paths) is supplied to confirm that these differences are consistent with Monte Carlo noise rather than systematic bias in the continuation-value estimator.
Authors: We accept that a formal test would provide additional reassurance. In the revision we will compute and report a paired t-test (or equivalent) on the CVA estimates obtained from multiple independent Monte Carlo path sets, demonstrating that the observed 1–3 % differences lie within the range attributable to Monte Carlo noise rather than estimator bias. The test results and updated discussion will be added to the presentation of Table 5. revision: yes
Circularity Check
No significant circularity; empirical results rest on external LSM benchmark
full rationale
The paper's core claims are established through direct numerical comparison of randomized NN continuation values against the independent LSM benchmark on the same simulated paths, under both Black-Scholes and Heston dynamics, including path-convergence tests and sensitivity analysis for exposure and CVA. No derivation step reduces a reported CVA or exposure statistic to a fitted parameter or self-defined quantity by construction; the randomized NN is trained once and then evaluated for statistical closeness to LSM, which is an external method. No load-bearing self-citation chains or uniqueness theorems imported from the authors' prior work appear in the derivation. The work is therefore self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
and Broadie, M
Andersen, L. and Broadie, M. (2004). A primal-dual simulation algorithm for pricing multidi- mensional american options.Management Science, 50(9):1222–1234
2004
-
[2]
and Oosterlee, C
Andersson, K. and Oosterlee, C. W. (2021). A deep learning approach for computations of expo- sure profiles for high-dimensional bermudan options.Applied Mathematics and Computation, 408:126332. Bank for International Settlements (2018). Counterparty credit risk in basel iii – executive summary. FSI Executive Summary. Accessed 2026-04-15. Bank for Interna...
2021
-
[3]
Becker, S., Cheridito, P., and Jentzen, A. (2019). Deep optimal stopping.Journal of Machine Learning Research, 20(74):1–25
2019
-
[4]
and Scholes, M
Black, F. and Scholes, M. (1973). The pricing of options and corporate liabilities.Journal of Political Economy, 81(3):637–654
1973
-
[5]
Boyle, P. P. (1977). Options: A monte carlo approach.Journal of Financial Economics, 4(3):323–338
1977
-
[6]
and Alfonsi, A
Brigo, D. and Alfonsi, A. (2005). Credit default swap calibration and derivatives pricing with the ssrd stochastic intensity model.Finance and Stochastics, 9(1):29–42
2005
-
[7]
(2013).Counterparty Credit Risk, Collateral and Funding: With Pricing Cases for All Asset Classes
Brigo, D., Morini, M., and Pallavicini, A. (2013).Counterparty Credit Risk, Collateral and Funding: With Pricing Cases for All Asset Classes. John Wiley & Sons, Chichester
2013
-
[8]
and Cao, M
Broadie, M. and Cao, M. (2008). Improved lower and upper bound algorithms for pricing american options by simulation.Quantitative Finance, 8(8):845–861
2008
-
[9]
and Glasserman, P
Broadie, M. and Glasserman, P. (2004). A stochastic mesh method for pricing high-dimensional american options.Journal of Computational Finance, 7(4):35–72
2004
-
[10]
Cao, W., Wang, X., Ming, Z., and Gao, J. (2018). A review on neural networks with random weights.Neurocomputing, 275:278–287. Carri` ere, J. F. (1996). Valuation of the early-exercise price for options using simulations and nonparametric regression.Insurance: Mathematics and Economics, 19(1):19–30
2018
-
[11]
(2009).Modelling, Pricing, and Hedging Counterparty Credit Exposure: A Technical Guide
Cesari, G., Aquilina, J., Charpillon, N., Filipovic, Z., Lee, G., and Manda, I. (2009).Modelling, Pricing, and Hedging Counterparty Credit Exposure: A Technical Guide. Springer, Berlin and Heidelberg
2009
-
[12]
Cortis, D. (2019). Evaluating credit counterparty risk of american options via monte carlo methods.Frontiers in Applied Mathematics and Statistics, 5:60
2019
-
[13]
(2004).Monte Carlo Methods in Financial Engineering, volume 53 ofApplica- tions of Mathematics
Glasserman, P. (2004).Monte Carlo Methods in Financial Engineering, volume 53 ofApplica- tions of Mathematics. Springer, New York
2004
-
[14]
(2010).Counterparty Credit Risk: The New Challenge for Global Financial Markets
Gregory, J. (2010).Counterparty Credit Risk: The New Challenge for Global Financial Markets. John Wiley & Sons, Chichester
2010
-
[15]
(2015).The xVA Challenge: Counterparty Credit Risk, Funding, Collateral, and Capital
Gregory, J. (2015).The xVA Challenge: Counterparty Credit Risk, Funding, Collateral, and Capital. John Wiley & Sons, Chichester. 59
2015
-
[16]
Herrera, C., Krach, F., Ruyssen, P., and Teichmann, J. (2024). Optimal stopping via randomized neural networks.Frontiers of Mathematical Finance, 3(1):31–77
2024
-
[17]
Heston, S. L. (1993). A closed-form solution for options with stochastic volatility with applica- tions to bond and currency options.The Review of Financial Studies, 6(2):327–343
1993
-
[18]
Huang, G.-B., Chen, L., and Siew, C.-K. (2006). Universal approximation using incremental constructive feedforward networks with random hidden nodes.IEEE Transactions on Neural Networks, 17(4):879–892
2006
-
[19]
Hull, J. C. (2012).Options, Futures, and Other Derivatives. Pearson, 8 edition
2012
-
[20]
and Oosterlee, C
Jain, S. and Oosterlee, C. W. (2015). The stochastic grid bundling method: Efficient pricing of bermudan options and their greeks.Applied Mathematics and Computation, 269:412–431
2015
-
[21]
and Lelong, J
Lapeyre, B. and Lelong, J. (2021). Neural network regression for bermudan option pricing. Monte Carlo Methods and Applications, 27(3):227–247
2021
-
[22]
Longstaff, F. A. and Schwartz, E. S. (2001). Valuing american options by simulation: A simple least-squares approach.The Review of Financial Studies, 14(1):113–147. Lukoˇ seviˇ cius, M. and Jaeger, H. (2009). Reservoir computing approaches to recurrent neural network training.Computer Science Review, 3(3):127–149
2001
-
[23]
Merton, R. C. (1973). Theory of rational option pricing.The Bell Journal of Economics and Management Science, 4(1):141–183
1973
-
[24]
Rogers, L. C. G. (2002). Monte carlo valuation of american options.Mathematical Finance, 12(3):271–286
2002
-
[25]
Schrauwen, B., Verstraeten, D., and Van Campenhout, J. (2007). An overview of reservoir computing: Theory, applications and implementations. InProceedings of the 15th European Symposium on Artificial Neural Networks, pages 471–482
2007
-
[26]
Tsitsiklis, J. N. and Van Roy, B. (1999). Optimal stopping of markov processes: Hilbert space theory, approximation algorithms, and an application to pricing high-dimensional financial derivatives.IEEE Transactions on Automatic Control, 44(10):1840–1851
1999
-
[27]
Tsitsiklis, J. N. and Van Roy, B. (2001). Regression methods for pricing complex american-style options.IEEE Transactions on Neural Networks, 12(4):694–703
2001
-
[28]
and Pykhtin, M
Zhu, S. and Pykhtin, M. (2007). A guide to modelling counterparty credit risk.GARP Risk Review, pages 16–22. Issue 37. 60 List of Tables 1 Vanilla American call results under Black-Scholes dynamics. . . . . . . . . . . . . 39 2 Vanilla American call results under Heston dynamics. . . . . . . . . . . . . . . . 40 3 American max-call results under Black-Sch...
2007
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.