Modern Economy
Vol.06 No.05(2015), Article ID:56558,14 pages
10.4236/me.2015.65055
Combining Internal Data with Scenario Analysis
Elias Karam, Frédéric Planchet
Laboratoire SAF EA2429, ISFA, Université Claude Bernard Lyon 1, Université Lyon, Lyon, France
Email: ek.eliaskaram@gmail.com, frederic.planchet@univ-lyon1.fr
Copyright © 2015 by authors and Scientific Research Publishing Inc.
This work is licensed under the Creative Commons Attribution International License (CC BY).
http://creativecommons.org/licenses/by/4.0/



Received 8 April 2015; accepted 18 May 2015; published 22 May 2015
ABSTRACT
A Bayesian inference approach offers a methodical concept that combines internal data with experts’ opinions. Joining these two elements with precision is certainly one of the challenges in operational risk. In this paper, we are interested in applying a Bayesian inference technique in a robust manner to be able to estimate a capital requirement that best approaches the reality. In addition, we illustrate the importance of a consistent scenario analysis in showing that the expert opinion coherence leads to a robust estimation of risk.
Keywords:
Bayesian Inference, Operational Risk, MCMC

1. Introduction
Under the new regulations of Basel II and Solvency II, to be able to estimate their aggregate operational risk capital charge, many financial institutions have adopted a Loss Distribution Approach (LDA), consisting of a frequency and a severity distribution, based on its own internal losses. Yet, basing our models on historical losses only might not be the perfect robust approach since no future attention is being taken into consideration which can generate a biased capital charge, defined as the 0.01% quantile of the loss distribution, facing reality. On the other hand, adding scenario analysis given by the experts provide to some extent a future vision.
The main idea in this article is the following: A Bayesian inference approach offers a methodical concept that combines internal data with scenario analysis. We are searching first to integrate the information generated by the experts with our internal database; by working with conjugate family distributions, we determine a prior estimate. This estimate is then modified by integrating internal observations and experts’ opinion leading to a posterior estimate; risk measures are then calculated from this posterior knowledge. See [1] for more on the subject.
On the second half, we use Jeffreys non-informative prior and apply Monte Carlo Markov Chain with Metro- polis Hastings algorithm, thus removing the conjugate family restrictions and developing, as the article shows, a generalized application to set up a capital evaluation. For a good introduction to non-informative prior distribu- tions and MCMC see [2] .
Combining these different information sources for model estimation is certainly one of the main challenges in operational risk.
Modelling frequency and severity losses for estimating annual loss distribution, is known actuarial technique used to model, as well, solvency requirements in the insurance industry, see for e.g. [3] [4] . More literature on Operational Risk and Bayesian Inference techniques could be found in [5] - [8] .
2. Combining Two Data Sources: The Conjugate Prior
In our study, our data related to retail banking business line and external fraud event type is of size 279, collect- ed in $over 4 years. The data fits the Poission(5.8) as a frequency distribution, and
as the severity distribution.
Applying Monte Carlo simulation [9] , with
,
, and
, we obtained a Value-at- Risk of
at 99.9%, using internal losses only.
On the other hand, working with the scenario analysis, our experts gave us their assumptions for the frequency parameter
. As for the severity, our experts represent a histogram reflecting the probability that a loss is in an interval of losses (see Table 1 below).
If we consider our severity distribution being Lognormal with paramters
and
, the objective is to find the parameters
that adjust our histogram in a way to approach as much as possible the theoretical lognormal distribution. For this we can use chi-squared statistic that allows us to find
that minimize the chi-squared distance:

where
and
are respectively the empirical and theoretical probability.
Our experts provided
, and by applying chi-squared, we obtained our lognormal parameters:
with the
.
The high spread between the two VaR values, can cause a problem in allocating a non-biased capital require- ment. In the next sections, we will apply the Bayesian inference techniques, thus joining our internal observa- tions with the experts opinion.
2.1. Modelling Frequency Distribution
We are going to work with the Poisson and Lognormal distributions since they are the most used distributions in Operational Risk [10] .
Consider the annual number of events N for a risk in a bank modelled as a random variable from the Poisson distribution


Table 1. Scenario analysis.
we have:

As for the likelihood function, given the assumption that 

where n is the number of historical losses and 
Thus, the posterior density would be:


Which is

So we have:
To apply this, and since the only unknown parameter is 

The experts may estimate the expected number of events, but cannot be certain of the estimate. Our experts specify 






where 

Solving the above equations would give us the prior distribution parameters



• Using the estimated Posterior 

• Generate n number of monthly loss regarding the frequency of loss distribution
• Generate n losses 

• Repeat steps b and c for

• Repeat steps a to d many times (in our case 105) to obtain the annual aggregate loss distribution.
• The VaR is calculated taking the 99.9th percentile of the aggregate loss distribution.
We notice that the Value-at-Risk is close to the VaR generated by the internal losses alone, since the only thing took as unknown was



2.2. Modelling Severity Distribution: 
Assume that the loss severity for a risk is modelled as a random variable from a lognormal distribution 

So we have,
Taking
since we are using a conjugate prior distribution, we know that the posterior distribution will follow a Normal distribution with parameters 
By identification we obtain:
So, 


Assuming that the loss severity for a risk is modelled as a random variable from a lognormal distribution 

Since the only thing unknown is




where 
Solving these two equations, we find that the prior distribution of 
Hence using the formulas stated above where, 



We find out that the posterior distribution:
At the end, using the posterior 

The same analysis goes here as well, since the only unknown parameter is

2.3. Modelling Frequency and Severity Distributions
In the two previous subsections, we illustrated the case of modelling frequency and severity distributions with unknown 



Joining these two distributions is relatively simple since we have the hypothesis of independence between frequency and severity, which allows us to estimate independently the two posterior distributions and estimate the parameters.
As so, we have already demonstrated the fact that our posterior density 






Since we have the hypothesis of independence between frequency and severity, which allows us to estimate independently the two posterior distributions, which have been already calculated for the parameter 

By simulating those two laws using Monte Carlo simulation (cf. Section 2.1), we obtain a Value-at Risk of 1199000.00 using the estimated posterior Gamma and Normal distributions.
The result is interesting, since with two unknown parameters 



The general case where all parameters are unknown will not be treated in this section since it is more complex to tackle it with the use of conjugate prior distributions.
2.4. Sensitivity Analysis
Working with this method, is generally simple since conjugate prior is involved, yet one of the main questions is how to ensure that experts opinion are consistent, relevant, and capture well the situation, which might in a way, cause a model error. In this work we did not take into consideration this aspect and the experts opinion were treated as correct. To improve our results, we can do a sensitivity test regarding our prior parameters. On the other hand, we only have the mean of the number of losses, given by our expert. So it appears difficult to obtain the distribution of 
We are going to test out prior parameters given by the experts and highlight the direct consequence on our Capital Required. To start with the first case, where we are working with unknown



As for the cases of unknown 



The following figures show the stability of our Value-at-Risk calculations regardless of all changes in our prior parameters
3. MCMC-Metropolis Hastings Algorithm
In this section, we will use a noninformative prior and more particularly the Jeffreys prior [11] , that attempts to represent a near-total absence of prior knowledge that is proportional to the square root of the determinant of the Fisher information:
Figure 1. Sensitivity for a0 and b0 respectively.
Figure 2. Sensitivity for μ0 and σ0 respectively.
where
Then we are going to apply an MCMC model to obtain a distribution for the parameters and generate our capital required at 99.9%. This will allow us to compare both methods’ results and develop a generalized application to set up our capital allocation, since no restrictions is made regarding the distributions. As for the parameter
3.1. MCMC with Poisson(l) Distribution
Assuming that the parameter 


So by applying Metropolis Hastings algorithm, (check Appendix 5.2.1 for full support on detailed algorithm), with the objective density:
and with a uniform proposal density: 

We have removed the first 3000 iterations so that the chain is stationary (burn-in iterations effect), [13] . We
Figure 3. MCMC for the parameter λ.
obtain a 99.9% Value-at-Risk of 1000527.00
The result is close to the VaR considered with the use of conjugate family.
3.2. MCMC with 
Assuming that the parameters 


(see Appendix 5.1), thus finding the posterior distribution 
So by applying Metropolis Hastings algorithm, (check Appendix 5.2 for full support on detailed algorithm), with the objective density:
and with a uniform proposal density:

We obtain a Value-at-Risk of 1167060.00.
Comparing this to the same case generated with conjugate prior, we can check the closeness of both values.
In the next subsection, we will tackle the general case, where all parameters are unknown, this case was not treated with conjugate prior distributions since it would be more complicated.
3.3. MCMC: The General Case
We are going to assume the general case, where all the parameters are unknown





Figure 4. MCMC for the parameter μ.
So by applying Metropolis Hastings algorithm, (check Appendix 5.2.3 for full support on detailed algorithm), with the objective density:
and with a uniform proposal density: 





We have removed as well, the first 3000 iterations so that the chain is stationary (burn-in iteration effect). We obtain a Value-at-Risk of 3061151.00.
The general case clearly generates a good combination between internal data and experts’ opinion with a capital requirement of 3,061,151$.
3.4. Confidence Interval Calculation
To recapitulate on all the calculations, Table 2 summarizes all Value-at-Risk generated. As for the calculation of the confidence interval, since we are working with order statistics, the interval 

In our calculations, we took





Table 2 clearly shows the helpful use of the Bayesian inference techniques. The results of both methods are close and comparable; though conjugate prior is simple but the distributions are restricted to the conjugate family, yet with the Jeffreys non-informative prior and MCMC-Metropolis Hastings algorithm, we will have a wider options and generate a good combination between internal data and experts’ opinion.
In addition, we are going to use the Monte Carlo confidence intervals approach [15] in order to compare it with the previous approach and ensure the similarities.
Consider a parameter X with its consistent estimator Y, with cumulative distribution function 

given by 


it to the nearest integer values or even use linear interpolation.
Figure 5. MCMC for the parameters μ and σ.
Table 2. Value at risk and confidence intervals for all cases treated.
We seek to calculate



The actual confidence level has a beta distribution with parameters 


Respecting that B has a beta distribution, 
In our case, by using the confidence level of 99.9% and by applying the previous calculations we have obtain- ed an approximation 95% interval for actual confidence level with







close to the previous interval calculation in Table 2.
Furthermore, Figure 6 and Figure 7 illustrate the calculation of our Value-at-Risk for different confidence level. It clearly shows the presence of the general case, between both Internal and Scenario Analysis curves. On the other hand, the conjugate prior Figure 7, regarding all 3 unknown variables, point out the closeness of the curves which add to our previous analysis that 
We have to note as well, that the concept of Scenario Analysis with the expert opinion should deserve more clarification. Roughly speaking, when we refer to experts judgments, we express the idea that banks’ experts and experienced managers have some reliable intuitions on the riskiness of their business and that these intuitions are not entirely reflected in the bank’s historical, internal data. In our case, experts’ intuitions were directly plugged into severity and frequency estimations through building a loss histogram.


Figure 6. Different VaR calculation for all MCMC cases, internal data and scenario analysis.
Figure 7. Different VaR calculation for all conjugate prior cases.
3.5. Bayesian Approach Reviewed
In this part, we are going to replace the experts opinions, by assuming that the experts’ parameters are set using the Basel II standardized approach calculation. Hence, the experts opinion is questionable in the meaning of when it’s used, we shift into the Markovian process which can cause problems.
3.6. Standardized Approach Reviewed
In the Standardized Approach (SA), banks’ activities are divided into 8 business lines [16] [17] : corporate finance, trading & sales, retail banking, commercial banking, payment & settlements, agency services, asset management, and retail brokerage. Within each business line, there is a specified general indicator that reflects the size of the banks’ activities in that area. The capital charge for each business line is calculated by multiply- ing gross income by a factor 
The total capital charge is calculated as a three year average over all positive gross income (GI) as follows:
Hence, the application of the Standardized Approach generates a capital requirement of
Numerical Results for Expert Opinion Treated as SA
Setting the parameters to give us the same Standardized approach capital requirement and treating them as the expert parameters gave us:


We note that the Standardized approach from Basel II is the one to rely on when it comes to calculate the VaR with it’s confidence interval. It is interesting to compare both results in Table 2 and Table 4 where we notice
Table 3. Business lines and the beta factors.
Table 4. Value at risk and confidence intervals for all cases treated.
that the VaR results in the cases of bayesian unknown






4. Conclusions
Using the information given by the experts, we were able to determine all the parameters of our prior distri- bution, leading to the posterior distributions with the use of internal data, which allowed us to compute our own capital requirement. This approach offers a major simplicity in its application through the employment of the conjugate distributions. Therefore, allowing us to obtain explicit formulas to calculate our posterior parameters. Yet, the appliance of this approach could not be perfected since it's restricted to the conjugate family.
On the other hand, Jeffreys prior with MCMC-Metropolis Hastings algorithm provided us with wider options and generated a satisfactory result regarding all three unknown variables



Yet, treating the experts outputs, the same as Basel’s operational risk Standardized approach, illustrated the necessity of calling attention to the judgments given. In our application, judgments were needed to make sensi- ble choices but these choices will influence the results. Understanding this influence, should be an important aspect of capital calculations, since it created an estimation risk that has highly influenced our capital require- ment [18] , for the judgment under uncertainty. Moreover, we did not take into consideration external data, which might be interesting to elaborate and apply in practice, more on this subject could be found in [19] .
References
- Shevchenko, P.V. (2011) Modelling Operational Risk Using Bayesian Inference. Springer, Berlin. http://dx.doi.org/10.1007/978-3-642-15923-7
- Robert, C.P. (2007) The Bayesian Choice: From Decision-Theoretic Foundations to Computational Implementation second edition. Springer, Berlin.
- Solvency II (2010) Quantitative Impact Study 5 (Final Technical Specifications). http://www.eiopa.europa.eu
- Gamonet, J. (2006) Modélisation du risque opérationnel dans l’assurance. Mémoire d’actuaire, CEA.
- Berger, J.O. (1985) Statistical Decision Theory and Bayesian Analysis. 2nd Edition, Springer, New York (NY). http://dx.doi.org/10.1007/978-1-4757-4286-2
- Moosa, I.A. (2007) Operational Risk Management. Palgrave Macmillan, Basingstoke. http://dx.doi.org/10.1057/9780230591486
- Moosa, I.A. (2008) Quantification of Operational Risk under Basel II: The Good, Bad and Ugly. Palgrave Macmillan, Basingstoke. http://dx.doi.org/10.1057/9780230595149
- Santos, C.H., Krats, M. and Munoz, F.V. (2012) Modelling Macroeconomics Effects and Expert Judgements in Operational Risk: A Bayesian Approach. Research Center ESSEC Working Paper 1206.
- Frachot, A., Georges, P. and Roncalli, T. (2001) Loss Distribution Approach for Operational Risk. Groupe de Recherche Opérationelle, Crédit Lyonnais, France.
- Shevchenko, P.V. and Wüthrich, M.V. (2006) The Structural Modelling of Operational Risk via Bayesian Inference: Combining Loss Data with Expert Opinions. The Journal of Operational Risk, 1, 3-26.
- Jeffreys, H. (1946) An Invariant form for the Prior Probability in Estimation Problems. Proceedings of the Royal Society of London, Series A, Mathematical and Physical Sciences, 186, 453-461. http://dx.doi.org/10.1098/rspa.1946.0056
- Hastings, W.K. (1970) Monte Carlo Sampling Methods Using Markov Chains and Their Applications. Biometrika, 57, 97-109. http://dx.doi.org/10.1093/biomet/57.1.97
- Gilks, W.R., Richardson, S. and Spiegelhalter, D. (1996) Markov Chain Monte Carlo in Practice. Chapman & Hall, London.
- David, H.A. and Nagaraja, H.N. (2003) Order Statistics. 3th Edition. John Wiley & Sons Inc., New York. http://dx.doi.org/10.1002/0471722162
- Buckland, S.T. (1985) Monte Carlo Confidence Intervals. Journal of Royal statistical society, Series C (Applied statistics), 34, 296-301.
- Basel Committee on Banking Supervision (2006) International Convergence of Capital Measurement and Capital Standards. BCBS, 144-146. http://www.bis.org
- Basel Committee on Banking Supervision (2002) Quantitative Impact Study 3 Technical Guidance. http://www.bis.org/bcbs/qis/qis3tech.pdf
- Tversky, A. and Kahneman, D. (1974) Judgment under Uncertainty: Heuristics and Biases. New Series, 185, 1124- 1131. http://dx.doi.org/10.1126/science.185.4157.1124
- Lambrigger, D.D., Shevchenko, P.V. and Wüthrich, M.V. (2008) Data Combination under Basel II and Solvency 2: Operational Risk Goes Bayesian. Bulletin Françaisd’ Actuariat, 8, 4-13.
5. Appendix
5.1. Jeffreys Prior Distribution
Jeffreys prior attempts to represent a near-total absence of prior knowledge that is proportional to the square root of the determinant of the Fisher information:
where
5.1.1. Jeffreys Prior for Poisson(l) and Lognormal(m, s) Distributions
Let

and consequently,
Let
Hence, by letting 

As a consequence, 
5.2. MCMC Metropolis-Hastings Algorithm
5.2.1. Applying MCMC with Metropolis Hastings Algorithm for l
• Initialize
• Update from 


- Generating
- Define
- Generate
- If

• Remove the first 
5.2.2. Applying MCMC with Metropolis-Hastings Algorithm for m
• Initialize
• Update from 


- Generating
- Define
- Generate
- If

• Remove the first 3000 iterations, so that the chain is stationary (burn-in effect).
5.2.3. Applying MCMC with Metropolis-Hastings Algorithm for w = (m,s)
• Initialize 
• Update from 




- Generating 
- Define
- Generate
- If



• Remove the first 3000 iterations from both distributions, so that the chains is stationary (burn-in effect).




























































