‘Technical’ Category
Learning the Difference Between Risk and Uncertainty, or not
Update: if you’re looking for an academic distinction, see: The Difference between Risk and Uncertainty. In the post below, we criticize firms for not changing their practices despite the recent failures of their estimations, methodologies, and models. Of course, we think that both are worth reading.
Every Monday morning for the past several years, we’ve received an e-mail from http://jobs.phds.org that lists available positions according to our specifications, which are:
“Send Weekly emails containing jobs…
…for PhDs in: Business /Finance /Economics
…of types: Contract /Project /Temporary, Employee, Non-tenure-track faculty, Postdoctoral researcher, Tenure-track /tenured faculty
…in sectors: all
…located: in United States
…with keywords: none
Generally, there’s 20 — 40 positions listed each week, and most of those involve quantitative finance, usually in the NYC area. For the past year or so, we’ve been particularly interested to see if the job descriptions would change given the failure of many quantitative trading strategies, modeling techniques and risk measures. (Yeah, we know they didn’t actually “fail.” Recent results were just plain bad luck that no one could have predicted. The models worked perfectly, except when they didn’t.)
Unfortunately, our parenthetical sarcasm seems to be the implicit position of many financial firms – without the sarcasm, of course. We say because we haven’t observed any change in the posted job descriptions in the jobs.phds.org emails or any of the other ones that we receive from recruiters who regularly send similar descriptions.
Now, we’ve been meaning to write about this observation for a few months but were finally motivated to do so because of several other items we read this morning, including two opinion columns and one article.
The article, Computer-Trading Models Meet Match in The Wall Street Journal, describes how several algorithmic-based hedge funds have lost money recently because of “the recent high volatility.” So, we guess their models aren’t flawless.
One of the op-ed pieces is by L. Gordon Crovitz, and it is also in the Journal: In Finance, Too, Learning Entails Risk. In it, Mr. Crovitz attempts to relate “financial engineering” to other types of engineering, e.g., mechanical engineering, and he seems to imply that it’s still a young discipline; so, give it time, but we think that his argument ultimately fails and is unconvincing.
That’s because “financial engineering” isn’t really engineering, which we’d define as the thoughtful application of science or technology to (or in) a well-understood, physical environment. Finance is a subset of a social “science.”
Mr. Crovitz writes in his last paragraph that: “The measure of innovators is not in the mistakes they make, but in the lessons they learn. We now know that our complex markets need better models, which should include more humility, acknowledging that some risks are still too uncertain to measure and should be avoided.” We’d argue with the “still too” in the last sentence as we doubt that such social uncertainty can be resolved or precisely measured. (By the way, we also disagree with his conclusion in that sentence that “some risks…should be avoided.” We have no problem with folks taking wild or uncertain gambles; however, we see no reason that we should subsidize their losses when those gambles go bad.)
To his main point, however, we don’t see much learnin’ goin’ on. It seems to be business as usual at many firms and funds.
A much more critical op-ed piece is by Michael Barone, and it’s entitled ‘Formulas’ for certain failure, and his first sentence is “Beware of geeks bearing formulas.” He discusses (and criticizes) financial models, global warming/climate change models, and health-care models, and it reads much like our post from six months ago, Global Warming and the Mortgage Crisis. Remember that this is Michael Barone, who is very well-known for using statistical data in the analysis of politics and demographics.
As usual, we point new readers to our essay, Uncertainty Management, which details our perspective and philosophy on these issues as well as any number of related posts: see our blog archives. The main point is that not all uncertainty is measurable, i.e., that measurable uncertainty, or risk, is a proper subset of uncertainty and unknowing. (In other words, specific mathematical conditions must be met for uncertainty to be risk. So, uncertainty is a more general term, i.e., all risk involves uncertainty, but not everything that is uncertain is risky because not all uncertainty is measurable, which a specific mathematical definition.)
As we read the evidence, many institutions and their ‘quants’ will continue to solve mis-specified risk problems, because they don’t know how to treat more diffuse and difficult uncertainty problems; so, they assume them away and treat them as risk problems. We’re clearly not underestimating the difficulties these folks face nor the necessity of making trade-offs, but we’re not sure if they understand the nature of the problem or trade-off. As we’ve written many times before, if they don’t understand them, then they are ignorant, and if they do, then they are cynical., e.g., Our Eternal Question: Cynical or Naïve? Neither charactistic is appealing or useful.
Ignoring the larger epistemological issues and the problem of induction, here’s a simple example of the difficulty of making inferences and finding useful information. Even when a distribution can be perfectly known, it’s moments – like the mean and variance – need not exist. (Look a Cauchy distributions and, more generally, certain stable distributions. While one can calculate historical means and variances from a time series, those “estimates” may be nonsensical. (They can’t estimate something that doesn’t exist.) The arithmetic can be performed, but the notion is empty.
As we see it, too often if one has a (risk) hammer, then everything looks like a (risk) nail, and it’s easy to pound away, especially when the alternate is to admit that a solution doesn’t exist, which too often sounds like, “I don’t know.” So, while various numbers can be calculated – even calculated very precisely, earnestly, and diligently – to do so is to apply technology, but it’s not engineering nor is it very smart and it can be very harmful.
Calculating Counterparty Credit Reserves
Implied Risk Neutral Default Rates Versus Historical Default Rates
For some problems, there is no good or true solution, but something must be done or estimated. Such is the case with calculating credit reserves because real default rates can never be known, but risk-neutral implied or historical default rates can be calculated and used, but both are flawed.
Generally, when we discuss this topic, we have reduced-form models in mind (as opposed to structural ones, but there’s no shortage of assumptions in structural models, either).
We’ve written about implied default rates on several occasions, and we recently had a conversation with someone who mentioned that for trading credit reserve calculations, regulators are requiring firms to use implied default rates from risk-neutral pricing models rather than historical default rates. To be precise, these implied default rates would be derived/inferred from CDS (credit default swap) prices using risk-neutral models and any number of quite arbitrary assumptions.
Presumably, given recent high prices for protection – the credit default swaps – implied defaults rates are substantially higher than historical rates, and the regulators are just trying to be “conservative.” Oh well, so much for our motto of thought before calculation.
Now, it’s true that for all else equal, if investors are risk-averse, implied, risk neutral default rates will always be greater than actual default rates, and the “more” risk-averse investors are, the higher the implied, risk-neutral, probability of default. There’s a variety of ways it can be stated, but using CDS prices, we can roughly say that for a fixed gamble with a fixed probability of default, the more risk-averse the insurer, the higher the price required to compensate him for bearing that risk, the higher the price of guaranteeing default, the higher the risk-neutral probability of default.
As we mentioned above, in the real-world, actual (future) rates are never known, but sometimes, historical default rates can be used as proxies for actual rates, especially if the analyst believes that the environment is unchanged. Below, we’ll briefly explain why we think that is preferable to use implied, risk-neutral default rates.1
Estimating a Credit Reserve
Like a loan-loss reserve, which is a bank’s estimate of the expected loss of default by its borrowers, a trading organization must also calculate a credit reserve for its trades. For trading, that means making a guess or estimate of the expected loss associated with default (by the counterparty) when the trade is in one’s favor.2
Conditional Expected Values
Roughly, that means to calculate a credit reserve, it’s necessary to determine when the trade is in one’s favor and then assume or estimate the probabilities of that occurring over the life of the trade.3
By knowing that range of winning values and using estimates of the probabilities of those values, one can then calculate the conditional expected gain from the trade – the average trading gain given that one has gained (and not lost). (We’re thinking of a a discrete-time, single-period problem here.)
Ignoring collateral agreements for the moment, which would reduce the potential credit exposure when one is ahead, the conditional expected value represents the average amount that the other party will owe at the end of the trade (if it owes anything).
So, that conditional expected value is a reasonable estimate of the credit exposure at, say, the end of the accounting period. It’s very similar to the estimated utilization of a credit line at a future date, which is needed to calculate a loan-loss provision and reserve. Whether for loans or trades, one needs to estimate the exposure at default, which those in the industry abbreviate as EAD.
EADs, LGDs, and PDs
Once that expected exposure at default is estimated, one needs two more estimated values to calculate the counterparty credit reserve (for a single trade): the probability of default (PD) and the loss given default (LGD) rate.
The product of the three – the (EAD × LGD rate) × PD – is the reserve for that trade.4 Also, note that the product of the first two terms is the (expected) lost given default.
For completely collateralized trades, the loss given default is nearly zero. There are some timing issues, so sharp changes in values and the lags in posting collateral could create a small chance of loss, but that’s a relatively small level of exposure compared to a similar but uncollateralized trade.5
In the past, institutional LGD rates were particularly difficult to estimate because there were so few observations of bankruptcies of firms within particular industries and of particular sizes. (By the way, it’s worth noting that there is some evidence that banks recover more than bond-holders – banks are better organized for the contingency – so they have lower LGD rates.)
Now, because LGD rates are difficult to estimate, they’re usually assumed. With an assumption about the LGD rate, one can then solve for the default rate in most risk-neutral models. For more on this topic, please see our post from last summer: Implied Default Probabilities and Risk Neutral Models, particularly the graph that shows a simple relationship between the LGD rate and the implied default rate (in a simple model).
Now we have enough to compare two options for proxies of default rates that firms can use to calculate credit reserves.
Of course, and to reiterate, we’d like to know the actual, true default rate during some future period for a firm, but we can never know that for sure. (Actually, we’d like to know if the trade is a winner and whether the counterparty will default.) In fact, for a single firm, after that future period, we’ll know if the firm defaulted or not, but we won’t know the true probability of default, and even with a cross-section of firms, we’ll be able to calculate a realized average rate, but that will be only one possible average rate, not the true average rate so-to-speak. So, we can estimate (1) a historical average default rate across firms, which may or may not be stationary and/or meaningful, or (2) an implied default rate, which depends upon any number of assumptions, including an assumption about the LGD rate, and which–by definition–is hypothetical and does not reflect reality.
Which One is Better?
Of the two, we’d prefer to use actual observed rates rather than implied, risk-neutral rates. Why? For a few reasons. First, there are settings in which the observed historical default rate is a reasonably proxy of the unknown, “true” default rates. That’s never the case with risk-neutral, implied default rates, which could only equal “true” rates if investors were risk-neutral, which they are not. (Some of our related posts provide examples of this difference. They are stark and easy to follow.)
Second, the credit reserve is supposed to represent the expected loss – or discounted expected loss in a multi-period setting – not the price of the expected loss. (Risk neutral models permit prices to be viewed as expected values, rather than as expected utilities (of unknown utility functions). That’s the benefit of risk-neutral models; they “simplify” the math.) By definition, the reserve is the bank’s best guess of its expected loss over some time horizon. It’s not the price the bank would pay to eliminate default risk. Those are two clearly separate notions, and the difference would be the risk premium.
Third, as we mentioned two paragraphs above, using observed or historical rates does require assumptions about the validity of the past representing the future. That’s a huge problem – the Problem of Induction – but in our mind that’s cleaner – and more likely to remain in one’s mind – than are the many additional, specific, mathematical assumptions to derive/infer risk-neutral default rates, the LGD.
As we mentioned at the beginning of the post, there’s no good solution, but we think that using historical rates is the better solution. We think that when others disagree it’s because they think that implied, risk-neutral rates are more than the really are, i.e., the “market’s” estimate of default rates – not the “market’s” estimate of default rates under a risk-neutral measure, which means that they’re hypothetical, not real.
We’ll likely edit this post during the next few days.
Copyright © 2009 Spero Consulting.
Footnotes:
- Here’s a common fallacy in the field: often, to solve a challenging and interesting problem, it’s necessary to perform a large number of tedious and possibly complicated calculations. However, performing a bunch of tedious and possibly complicated calculations does not ensure that an interesting or challenging problem has been solved. Too often, folks confuse the two. ↩
- We’re thinking of a simple formula for a single counterparty rather than for a portfolio. ↩
- That’s true regardless of the nature of the underlying or traded variable, i.e., equity prices, interest rates, commodities, etc. Note, that we’ll ignore the whole problem of finding these probabilities, which follows a process similar to finding implied default probabilities. Likewise, we’ll assume that default probabilities are unrelated to the gain from the trade and probability that the trade is in one’s favor – that the gains and counter-party default probabilities are independent. ↩
- Remember, we’re considering a very simple, single-period case and ignoring discounting. If it’s a multi-period setting, the problem isn’t much different, but there is more discounting, multiplication, and addition. See Good Column, Bad Math for a discussion of analogous probabilities through time. ↩
- In addition, any entity guaranteed by the government would have a LGD rate of nearly zero. It could be slightly positive if it took a long time to be made whole. ↩
Multi-period Bond Price Implied Default Rates and CDS
Implied Under the Assumption of Risk Neutrality
We have several posts related to the calculation of price-implied default rates under the assumption of risk neutrality and several posts related to simple CDS calculations.
Those posts have involved discrete, single-period problems, where there are only two dates of interest: today and a future date where an uncertain claim or cash flow will be realized, i.e., when bankruptcy would occur.
We’ve focused on binary models and will continue to do so here. In fact, to analyze a two-period problem, we’ll just build upon our latest post from December 2: Price Implied Default Rates.
We think that needless detail obfuscates the central points while providing no marginal explanatory power: either in a statistical or pedagogical sense. So, we like to keep things simple.
Note that we’re providing examples of simple, reduced-form models à la Jarrow and Turnbull (1995) or Hull and White (2000), not a structural Merton model like KMV. We’ll do that when we have the time.
In our December 2nd post, we considered a risky, one-year, zero-coupon bond. We assumed a face value of $1,000, a risk-free rate of 5%, and the risky bond’s yield to be 8%. We could have stated that last assumption as the bond has a price of $925.93.
From those assumptions, and the additional assumption that the owner of the bond would recover 60% of the face value, we calculated the risk-neutral-model-implied default rate of 6.94%.
Now the calculation of that default rate depends upon all of the assumptions, and obviously the answer will vary with changes in any of the assumed variables: the bond’s price or yield, the risk-free rate, and the loss given default rate.
Obviously, it also depends upon the applicability of risk-neutral valuation, which allows us to impose two very important considerations (versus reality). It allows us to (1) treat the bond’s price as the expected value of its cash flows, which is only valid if the creditor (in the model, not in real life) is risk-neutral, and (2) use the risk-free rate as the proper discount rate for a risk-neutral person. Those assumptions allow us to work with expected cash flows, rather than curvy preferences. We’ll focus on calculations in this post and not on applicability.
Finally, the answer also depends upon our choice of probability functions. Here, the only uncertainty involves full payment or not; so, that credit risk is easily modeled as a binary function, but it is important to note that risk-neutrality does not imply a particular probability function. Once the analyst has chosen from a family of distribution functions, the assumption of risk neutrality will determine (imply) particular parameter values, but that is all. For the more mathematically inclined, that is the change-of-measure that is referred to in the texts. (Probabilities are weights. Different parameter values within a distribution cause possible events to be weighed differently; ergo, the measure is changed.)
In this problem, we’ll keep the same assumptions as in our previous post for the first of our two periods. So, here is the setting: We have two zero-coupon, risky bonds issued by the same firm and each with a face value of $1,000: one matures in one-year and the other matures in two years. Imagine that there are two risk-free bonds, too.
The one-year risky bond is described as above; so, it will have a price of $925.93. If that bond were risk-free, it would have a price of $952.93. In a risk-neutral model, the difference in prices is the present value of the expected loss (of the risky bond, of course).
The risk-free rate in the second period is 7%. Note that there is no market risk – that is, no interest rate risk – so there is no evolution of interest rates or any type of rate process in our humble, little example. (We’re just making up numbers to illustrate a few basic ideas.)
The bond that matures in two years has a yield-to-maturity of 9.982%, which for all intents and purposes – and for everyone except the truly anal – is 10%.1
As an aside, with our two sets of interest rates, we can calculate an overall yield-to-maturity from our term structure of forward, risk-free rates, and for risky rates, we can determine the structure of forward rates from our risky yield curve.
Risk-free yield-to-maturity: we don’t really need to calculate this, so you can skip it is you want, but if the risk-free bonds are priced to earn 5% in the first year, and a two-year bond is priced to earn 7% in the second year, then the geometric average return for the zero-coupon, risk-free bond better be close to the arithmetic mean of 6%. That yield-to-maturity is simply:
[(1 + r1)·(1 + r2)]1⁄2 — 1 = [1.05·1.07]1⁄2 — 1 = 5.995%
So, the yield on a two-year, zero-coupon, riskless bond is about 6%: just like we knew before we did the calculation.
Risky forward rate: now, given the risky yield-to-maturity is about 10% on the two-year, zero coupon, bond, and given a first-year risky rate of 8%, then the implied forward rate for the second period must be:
[(1 + 0.08)·(1 + r2)]1⁄2 — 1 ≈ 10% implies r2 = 1.12 /1.08 - 1 = 12%
So, if (and only if) the two-year, risky bond yields (about) 10%, then its price is:
$1,000 ÷ 1.12 = $826.45 ≈ $826.72.
By the way, we’re off by 26¢ by using the easy 10% instead of the more precise 9.982%, but the lesson is free; so, the reader really shouldn’t complain.
Notice that credit spread increased from 3% (8% — 5%) in the first year to 5% (12% — 7%) in the second. All things equal, we should expect that the risk-neutral, price-implied, default rate will increase, too. Let’s see if that happens.
Three Probabilities of Default (or default rates): when we move to a multi-period problem, we have to be careful to specify the default rate to which we’re referring. There are conditional, marginal, and cumulative probabilities of default, and that is true whether we’re discussing actual (but unknown) probabilities of default or risk-neutral-implied probabilities of default like we’re doing here.
The conditional probability of default for a period, t, is the easiest notion to understand: given that the firm has survived until the beginning of that period, it is the probability that the firm can’t pay its bills during the next interval of time; here, we’re using one year as the time interval. We’ll denote conditional probabilities as pt for every period t.
The marginal probability of default is the probability that the firm will default in period t. Now, the firm only has the opportunity to default in period t, if it hasn’t already defaulted; so, the marginal probability considers the probability of surviving until that point and the conditional probability of default. If p1 is the (marginal) probability of default in the first period, the (1 — p1), then the marginal probability of default is:
(1 — p1)·p2,
For our little problem, we won’t introduce any special notation for the marginal probabilities of default.
Finally, the cumulative probability of default is the sum of all the marginals: p1 + (1 — p1)·p2 in a two-period problem. We wrote about longer term cumulative probabilities of events in this post, Good Column, Bad Math, where we talk about 100-year floods.
So, let’s find the conditional probability of default in the second period. Given that there was no default at the end of the first period, what is the probability of default in the second period implied by the bond’s price?
Well, with one period remaining, the price of the only remaining bond is:
$1,000 ÷ 1.12 = $892.86.
So, we can find the conditional probability of default in the second-period, p2, the same way that we found the probability in our one-period problem.2
price = $892.86= (1 — p2) × ($1,000 ÷ (1 + 0.07)) + p2 × (600 ÷ (1 + 0.07))
$892.86= (1 — p2) × $934.58 + p2 × 560.75.
So, if the firm survives the first period, there is an 11.16% conditional probability of default in the second period. That means that the marginal probability of default for the second period is the probability that the firm survives the first period multiplied by the conditional probability of default in the second:
(1 — p1) ·p2 = (1 — 0.0694) · 0.1116 = 10.385%
The cumulative probability of default is the sum of the two marginals: 6.94% + 10.39 = 17.33%.
Note that at the end of the first period the difference between the risk-free bond’s price of $934.58 and the risky bond’s price of $892.86 is $41.72. The $41.72 represents the risk-neutral, “present value” at the start of the second period of the conditional expected loss in the second period of the two-period bond. So, the $41.72 is related to the conditional probability of loss and the potential loss of $400:
($400 × 11.16%) ÷ 1.07.
But the second period will be experienced only if there was no default in the first period! So, in a risk-neutral world, a creditor will only experience the opportunity to lose (a discounted average) of $41.72 if there is no default in the first period: with probability (1 — 6.944%).
And the value of that today – at the start of it all – must be discounted by the first period’s risk-free rate of 5%. So, the present value of that expected loss that
$41.72 × (1 — 0.06944) ÷ 1.05 = $36.97.
Is our analysis correct? Let’s see. A two-year, risk-free, zero-coupon bond would have a price of $890.08. Our risky bond has a price of $826.45. That means that in a risk-neutral world – given all of our assumptions – the present value of the sum of the expected losses is the difference: $890.08 — $826.45 = $63.63.
In the first year, the present value of the expected loss on debt with a face value of $1,000 is $26.67. That means that the present value of the expected loss in the second period must be: $63.63 — $26.67 ≈ $36.97. Hey, where did we see that number before? That’s right — a few inches above where we discounted the expected present value of the second-period loss.
What about CDS?
To protect against loss, the CDS should provide $400 in case of default at the end of each period.
If the CDS policy were sold period-by-period, i.e., one-year terms, the first year’s premium would have to be at least $26.67 and the second year’s if sold today would cost at least $36.97. The actual cost, like everything else in the real world, would depend upon how badly creditors want to protect against loss, but those values are actuarially fair in a risk-neutral setting.
Also note that if the CDS policy were sold at the start of the second period, the premium would have be to at least $41.72 to be actuarially fair in a risk-neutral world. So, if purchased consecutively, the insurance premiums would need to $26.67 today and $41.72 next year in our risk-neutral world.
What if the insurance were purchased for two periods? What would the constant premium be? In that case, there is a chance that one or both premiums will be received (or paid). If there is no bankruptcy in the first period, then the premium will be paid twice; so, we need:
premium + (1 — 0.06944) premium ÷ 1.05 = $63.63
premium (1.0 +0.93056 ÷ 1.05) = $63.63
premium = $33.74
We assumed that the premium was paid at the beginning of each period; so, it is like an “annuity due” and actually is like a random, annuity due. It’s random because it is a constant stream of cash flows, but the ending date is unknown. In this simple two-period example, the “stream” could be one or two payments.
Also remember that risk-averse creditors should be willing to pay more than that, i.e., a risk premium, too.
And remember, we’ve said absolutely nothing about probabilities in the real world that our example represents. Risk neutral probabilities and default rates are derived from a set of assumptions that permits (relatively) easy calculation, but those probabilities and rates only work in our model, and they do not represent real frequencies. For more on that, please see our other posts on the topic.
As we hope that you can see, CDS is identical to term life insurance – except millions and millions of similar firms don’t die each year; so, there is little empirical evidence of various factors, including loss given default rates.
By the way, we’ve ignored counter-party risk and a host of other complicating assumptions.
As with many of our longer posts, we’ll likely edit this one in the near future.
Copyright © 2008 Spero Consulting.
Footnotes:
- By the way, can you imagine the number of folks who would scream that 9.982% isn’t 10%; so, they would indict us for not being precise thus we are wrong, wrong, wrong. That might be despite the fact that they may have been involved in allowing their organizations to accumulate billions of dollars of losses all the while arguing for precision. We do love those ironies of life. Also, the fact that we’ve made life simple by not continuously compounding would upset a few, too. ↩
- Just to be clear, we could have found the “future value” of the price by multiplying $892.86 by 1.07 and using the face value of $1,000 and the recovery (upon default) value of $600. In other words, we could have solved: $955.35714= (1 — p2) × $1,000 + p2 × $600. ↩
Price Implied Default Rates
Update: December 12, 2008. While none of our analysis or calculations was incorrect, we did have a minor error in the penultimate paragraph. We should of said “first” not “last.” To make amends, here is a multi-period problem, Multi-period Bond Price Implied Default Rates and CDS, but it won’t make sense without reading this one first. We also added a few paragraphs below, which should help explain the multi-period case.
Further update: April 14, 2008. We also have a new, related post on default rates. It is Calculating Counterparty Credit Reserves from April 8, 2009. Much of that post involves default rates, too.
We see that we’re getting a number of hits from search engines for folks looking for information about price-implied default rates – possibly college students with homework assignments or people trying to understand the various types of default rates they may encounter in their jobs or readings.
We have a number of posts on risk-neutral default rates, including Implied Risk Neutral Probabilities (of Default) , implied RISK NEUTRAL probability of default, redux, Risk Neutral Valuation: There Are at Least Two Expected Values, but we doubt if those settings are the ones that all guests want to see, especially those looking for help on their homework. (Of course, we think they are all worth reading.) So, as a public service, we offer an example of a simple, one-period bond problem. (It is single-period because it is gratis, after all.)
Suppose that a zero-coupon, risky bond with a face value of $1,000 matures in exactly one year. (Yeah, we said it was simple.) We’ll ignore compounding issues and assume that the annual risk-free rate is 5%. We’ll also assume that this risky bond’s yield-to-maturity is 8%.
Let’s calculate and discuss a few things before we provide additional assumptions.
We’ll calculate the bond’s price that corresponds to an 8% yield, and we’ll calculate the bond’s price if it were riskless; of course, by riskless we mean free of default risk or credit risk, only. Our simple one-period model doesn’t really permit interest rate risk, which is a type of market risk.
The bond’s price with a 8% annual yield is: $1,000 ÷ (1 + 0.08) = $925.93.
Now, if the bond were risk-free, its price would be $1,000 ÷ (1 + 0.05) = $952.38,
which is $26.45 higher. So, the price drops and the yield increases (over their risk-free equivalents) because the owner(s) of the bond is forced to bear some type of credit risk or probability of loss.
That $26.45 will appear again later, but at this point we can’t say much more than it is the difference in the prices of a one-period risk-free bond and our one-period risky bond.
The problem with simple calculations – whether in one or multiple periods – is that they ignore all of the factors that actually affect and determine prices. In other words, we’ve completely ignored the market dynamics and factors that would cause the price to be $925.93.
The market-clearing price would depend upon supply and demand considerations.1 Those considerations would depend upon the preferences, beliefs, and endowments of actual and potential sellers and buyers. In our simple setting, the important preferences would be risk and time preferences, which could possibly be expressed as utility functions; beliefs would involve the probability of default as well as other probabilities associated with each agent’s wealth in other assets if they exist – i.e., their endowments.
So, we can think of the price of $925.93 as a “function” of preferences, U(·); beliefs, f(·); and endowments, w.2 Unfortunately, in real life, we don’t know those factors; so, we’ll never be able to solve the actual problem, but we can solve a substitute problem.
All we know is that the price is $925.93, and it can be expressed as a yield-to-maturity – or a yield curve for multi-period problems – of (our assumed) 8%. So, the yield could be viewed as a function of the price if you want, but they’re really determined simultaneously.
As we’ve written many times before in related posts, because of several clever researchers in economics and finance, we can actually do more than just discuss the tautologies of price and yield.
In certain cases, we can assume that market participants are risk-neutral – that takes care of U(·) and makes the w irrelevant – and we can assume a particular form of a density or distribution function of outcomes, f(·). Very importantly, with those assumptions, if we don’t know one of the parameters of f(·) we can solve for it if we know everything else. That would be like solving for the misnamed implied vol or implied default rate, which is what we will do here.3
Here’s the key to all risk-neutral pricing: under certain assumptions, if agents are (assumed to be) risk-neutral, then we can treat prices as equal to the expected value of the asset’s cash flows according to an associated density function. That’s the only time we can treat prices as expected cash flows, rather than expected utilities, but depending upon the level of the course, some profs are pretty bad at explaining that fact.4
So, there are three things to consider. First, if agents are risk neutral, we can assume that they care only about expected values.
Second, if agents are risk neutral, then they won’t pay a premium for taking risk like risk-lovers would, nor will they need to be paid a premium for taking risk like risk-averse agents would need to be paid.
Third, that means we can assume that risk neutral agents are satisfied earning the risk-free rate. 5 So, given all of our words above, that means that risk neutral agents would value assets at the discounted value of the expected cash flows – discounted at the risk-free rate.
So, as we showed above, if the bond were actually risk-free, then price would have been $952.38, but the price is $925.93. That means that market participants must expect to receive less than the face value of $1,000 at least some percentage of the time, and that percentage is the probability of default.
Let’s see exactly how much less than $1,000, but first note that we could write the price of a risk-free bond in a slightly expanded way. Risk-free means 100% chance of getting $1,000; so,
Equation A:
$952.38 = 100% × ($1,000 ÷ (1 + 0.05)) + 0% × (value given default ÷ (1 + 0.05))
We did nothing but add zero to our previous calculation of a risk-free bond.
Let’s make it risky. Let p represent the probability of default, then for a risk-neutral person, we could write that same line as:
price = (1 — p) × ($1,000 ÷ (1 + 0.05)) + p × (value given default ÷ (1 + 0.05))
Thus, with a price of $925.93, we could write:
$925.93 = (1 — p) × ($1,000 ÷ (1 + 0.05)) + p × (value given default ÷ (1 + 0.05))
There are two unknowns: the probability of default, p, and the value of the bond given default, which has to be less than $1,000. In fact, we could put a determine a upper bound that is less than $1,000 if we wanted to do so. (How?)
Now, look at the last equation. Once we know or assume the value given default, we could find the probability of default, p, or vice versa.
Usually, one assumes the value given default and solves for p. There’s not really a good reason for doing it other than that’s what just about everyone does. (Don’t let anyone attempt to fool you with some lame justification. It’s tradition, custom, convention. Regardless of the word, it is arbitrary.)
So, let’s make-up – er, we mean assume – a value given default. This is often given in terms of a loss given default, a loss given default rate, or a recovery rate, but they’re all equivalent as one can see in the following relationships.
value given default = $1,000 — loss given default
value given default = $1,000 — loss given default rate × $1,000 = $1,000 × (1 — loss given default rate)
value given default = $1,000 × (1 — loss given default rate) = $1,000 × recovery rate
The loss given default is often abbreviated LGD. Unfortunately, the loss given default rate is sometimes abbreviated as LGD. Don’t let the bad notation fool you. Now, where were we?
That’s right. Let’s suppose that the loss given default rate is 40%. That means the recovery rate is 60%, which is its complement. Regardless, of how that assumption is stated, that means that the value given default is $600. So, now we have another number to put into our equation:
$925.93 = (1 — p) × ($1,000 ÷ (1 + 0.05)) + p × (600 ÷ (1 + 0.05))
or,
Equation B:
$925.93 = (1 — p) × $952.38 + p × 571.43.
If we did the arithmetic correctly, then solving for p gives a probability of default of almost 7%: 6.94%. Clearly, all things equal, which means holding everything else constant, as the loss given default increases, the probability of default decreases. One can make a graph of that relationship as we did in Implied Default Probabilities and Risk Neutral Models in June, 2008.
Now, under the assumption of risk-neutral agents, the difference between the two bond prices of $26.45 can be express as the difference in the present value of their expected cash flows. The difference in the present values of the expected cash flows in Equations A and B is the present value of the expected loss. The loss given default is $400. The undiscounted expected loss is: 0.0694 × $400 = $27.76. The present value of the expected loss is – not surprisingly – $27.76 ÷ 1.05 = $26.45.
That’s not the most someone would spend for insurance. That insurance premium depends upon the person’s risk-aversion.
Multi-period problems aren’t that much different, but they require bonds of multiple maturities if one is attempting to derive a credit curve, and one works for from the last first period forward solving maturity-by-maturity. Otherwise, one can find an “average” annual marginal probability of default. (We talk about a similar issue in Good Column, Bad Math.) So, in our multi-period example, we’ll explain the price of a two-year bond as the difference in present values between a risky and risk-free two-year bond. Then we’ll say much much of that can be attributed to the first period and then the second period.
Note: WE“VE SAID ABSOLUTELY NOTHING ABOUT THE REAL PROBABILITY OF DEFAULT! If all of the agents are risk-averse, then the unknown real probability of default will be less than the risk-neutral rate, but that’s not too helpful, is it? Some of our older posts do illustrate this idea.
Good luck with the assignment.
Copyright ©2008 Spero Consulting.
Footnotes:
- That’s quite a vacuous statement. ↩
- We are purposely using U(·) for preferences to remind readers of utility functions; f(·) for beliefs to remind individuals of probability density functions; and w for endowments to remind of their other wealth. Also, we put the quote around function, because we’re definitely not using it in its strict mathematical sense. ↩
- The implied is misnamed; it is inferred. It’s implied by the model selected, but it is inferred or imputed by the analyst. ↩
- Risk neutrality is actually slightly more general than that. ↩
- That’s why the actual yield is greater than the risk-free rate because market participants tend to be risk averse, but we don’t know the exact form of that aversion. ↩
Risk Neutral Valuation: There Are at Least Two Expected Values
But You’ll Never Know the One
We also have a newer post, Price Implied Default Rates, that provides an example more like a risky bond, and this one: Multi-period Bond Price Implied Default Rates and CDS. And we’ll have more related posts soon.
We’ve noticed that our few posts on risk neutral probabilities and implied default probabilities have been among our most popular content for readers throughout the world. (And it is cool to write “throughout the world.”)
So, we’ve finally starting composing a longer essay to cover continuous density functions, but an earlier post this morning – November 13–about means and medians reminded us of a source of confusion regarding risk neutral pricing and valuation methods, and that is the fact that there are (at least) two different means to consider.
In fact, there are (at least) two different distributions to consider: the real one, which can never been known, and the assumed one, which permits calculations to be made based upon market prices (and many assumptions). Actually, there may be far more than two, but we can illustrate our point with only two.
What Makes a Market: Markets permit individuals with different preferences; beliefs about future uncertainties; endowments;and planning horizons to exchange resources and claims mutually maximize (some measure of) each person’s prospective satisfaction (according to their individual preferences or tastes).
Those beliefs about future uncertainties can be thought of as subjective (or personal) probabilities of events (or combinations of events) that affect the individual or the world.
Generally, those subjective probabilities are represented as distribution functions (and combinations of events as joint distribution functions).
As one could well imagine, knowing those beliefs or uncertainties or subjective probabilities or distribution functions along with knowing the market participants’ preferences would be quite useful for predicting future prices. Unfortunately, there’s no economic way to do so. (In fact, we’d argue that in the real world, there is no way to do so – partly because many individuals can’t clearly specify preferences or beliefs and partly because there’s not enough time to do it.)
Regardless – and we’ll write in the singular – the real distribution function and preferences are unknown.1
What we do know – or, more precisely, what we can model – is that the observed price of a good or security is some “function” of preferences, planning horizon, endowments, and the “true” distribution of outcomes. (We put “function” in scare quotes because we’re using that term quite loosely and not in a strict, mathematical sense.)
So, if we observe a price of, say, a claim against a stream of cash flows, we know that it is the result of combining those factors mentioned above for both actual or would-be market participants: think supply and demand curves.
We also know (from Jensen’s Inequality) that if all of the participants are risk-averse, then price will be less than the distribution’s expected value–although we don’t know that “true” expected value of the cash flows; so, we don’t know the difference between the two.
Now, that expected value of the cash flows is one of the two expected values referenced in the post’s title. The “real” expected cash flow from a stock or bond or option or other financial claim. Again, it is something that we cannot observe in the real world. (Of course, for certain distributions, expected values do not exist, but that is another topic, and our goal here is provide a bit of intuition.)
The other expected value and – more generally, the other distribution – is known but is not “real” so-to-speak. That distribution function is an assumption, which can be considered a figment of the analyst’s imagination. (It is very, very sad to know how many practitioners confuse that assumption with the real world, but we shan’t attempt to ruin the hopes and dreams of anyone today. Plus, we’ve written about it in other posts.)
A Brief Accounting: So, there is a real, but unknowable distribution function, and imaginary, but knowable one. (Real distribution functions are only truly known for games of chance like dice or the lottery.) Since we can’t know it, we must assume one to go any further.
Unfortunately, assuming a distribution function isn’t very useful without also knowing somthing about preferences – in terms of, say, a utility function. If we did have both a distribution function and utility functions, then with additional assumptions about market mechanisms and horizons and endowments, we could calculate expected utilities, which would allow us to calculate market prices.
So, what to do? Through a few clever applications involving the mathematical notion of change-of-measure and economic notion of no arbitrage (via costless replication of a position), researchers showed that one could assume that, say, investors were risk-neutral and go from there. (Technically, as we understand it, one could use square-root utility pricing if they wanted to, but it would just complicate matters, and risk neutral preferences are so, so, nice and linear.)
So, if investors were assumed to be risk neutral, then they’d only care about expected cash flows, and one could then assume that those risk-neutral investors valued expected cash on a util-for-dollar basis. (Technically, risk neutrality means linear preferences but not necessarily util-for-dollar preferences; they could be multiples or fractions.)
Now with assumed preferences and a distribution function, the mean or expected value of the assumed distribution could be set equal to the observed price, and one could then work with that preference-distribution combination rather than the true unknown ones. (Note that we were a bit loose with the first clause of last sentence. Technically, it involves moving from today’s price to a future price and then discounting backwards to get a present value. In continuous time models, this shows as multiplying by both ert and e–rt, respectively, but it is obscurred in the usual slide-rule presentation of Black-Scholes.)
So, the selection of a distribution function – which hopefully represents something that we’ve inferred about the true but unknowable one – and the assumption of risk neutrality allows us to treat prices as expected cash flows, which both permits and simplifies calculations. However, as any practitioner can tell you, that doesn’t mean that the calculations are simple.
So, setting the price equal to the mean of the assumed distribution function is the second expected value referenced in the title.2 And that is okay IF (and that’s a big IF) the claim against cash flows can be replicated or hedged with other instruments. (And that’s hedged, not nedged or sledged.)
Finally, and briefly, as we noted back on June 22, when a parameter value of the assumed distribution is unknown, it can often be inferred or found if enough other information is available. Unfortunately, these inferred parameters are often called “implied” as in implied volatilities. They’re implied by the assumption of the particular distribution function and by the assumption that market participants are risk neutral, but one needs to make inferences to find them.
We hope this helps those struggling with the concepts, especially those in math-finance programs who are hindered by a weak background in economics. If it is not, send a note and let us know why or ask a question of us. It is likely that we’ll continue to edit this post.
- Each participant could have their own distribution or joint distribution function to specify future uncertainties, but we can illustrate our point assuming they share an identical one. Also note that we are assuming that such uncertainties can be measured and represented as distribution functions but that’s a different topic. ↩
- It’s only the mean of the assumed distribution, not the mean of the real distribution. ↩
implied RISK NEUTRAL probability of default, redux
Update: we have newer posts on the topic, too, including Risk Neutral Valuation: There Are at Least Two Expected Values, that describes the difference between real and risk neutral distributions. We also have: Price Implied Default Rates that provides an example more like a risky bond, and a multi-period example: Multi-period Bond Price Implied Default Rates and CDS.
The Wall Street Journal has an article about Iceland’s financial problems in today’s paper: Aftershocks Felt From Iceland. It turns out that the country has more problems than being a small, cold island in the middle of the North Atlantic.
Any way, we’re not writing about its climate, especially since Western PA’s is probably worse and we have no beaches and few tall blonds. No, we’re writing about the graph in the article and the blurb that states, “Trading in the credit default swap market puts the probability of a default by Iceland on its debt at a little over 50%.”
As presented, that statement is highly misleading and nonsense, and the purpose of this post is to explain why.
We’ve written about Implied RISK NEUTRAL probabilities of default a few times. In the aptly titled Implied Risk Neutral Probabilities (of Default) we provided an example that illustrated the difference between the actual probability of default, which is never known in the real world, and the model–implied probability of default, which could be calculated from ANY model–regardless of its validity–that permits at least two outcomes, e.g., survival and failure of the entity. Such a model may or may not assume risk neutrality, but risk neutrality makes the calculation simpler.
Regardless of whether Iceland goes bankrupt or not, we provide several examples that distinguish the risk-neutral, implied default rate from the true default rate.
In our earlier post, Implied Default Probabilities and Risk Neutral Models, we commented on a similar graph in another WSJ article from last June, and mentioned many of the factors that would be involved in such a calculation. Unfortunately, we recently and accidentally deleted a very nice comment about that post, which expanded the analysis to include counterparty credit risk: the risk that the purchaser of a CDS contract would not get paid (the insurance proceeds) in case of bankruptcy because the insurer or CDS writer was also insolvent – kind of like AIG.
In this post, we’ll provide another numerical example with a different assumed, risk-averse, utility function for the insurance buyer.
We’ll again assume a single period, but we will not use the 50% probability of bankruptcy that we did in the earlier post; it would be too confusing. In fact, the 50% probability of default mentioned in the article is likely the cumulative probability of default over the five years. It may or may not be based on equal marginal probabilities of default for each of the five years, regardless the annual marginal probability of default is not 10%; the 50% mentioned for five years was not found by multiplying five years times 10%.
Readers interested in an example of a discrete-time, multi-period survival problem that illustrates these issues should see Good Column, Bad Math. Readers interested in a calculation-intensive, similarly-structured, discrete-time problem, should see our research paper on moral hazard: Deadlines as Management Control Devices, which is based upon our dissertation. In that paper, the game ends with success, rather than failure, but the outcome tree is very similar.
So, will provide a couple examples similar to our square root problem in August.
Case 1: Assume that the person has natural logarithmic utility, which is strictly concave funtion and makes him risk-averse. We’ll also assume that the person has an initial endowment of $75.858, which we choose for convenience as you’ll see below. We’ll ignore time-value-money calculations and interest rates today; they’re inessential.
Assume that a firm will be worth $100 if it survives and $10 if if fails. That makes the loss given default (LGD) $90, and the loss given default rate $90/$100 equal to 90%. In the real world, e don’t know the loss given default until a default occurs, the firm’s assets are liquidated, and the residual cash is paid to the debtholders. LGD rate is always assumed in CDS and other similar calculations and, from our experience, seems to be considered much less than implied default rates.
Assume that the actual probability of default is 12%, i.e., the probability of getting $10 from the investment is 12%. REMEMBER, two items that we never know in real life are the market participants preferences – expressed here as a ln(·) utility function – and the actual probability of default, 12%. It is crucial never to forget this ignorance.
Also, we generally don’t know the person’s entire endowment, specifically his other wealth independent of the gamble. In this first case, we cleverly chose the person’s endowment so that his other wealth, not tied up in this particular investment, is zero. (You’ll that fact below.)
We’ll do what we need to do to calculate the risk-neutral probability of default and then later we’ll change a few assumptions to see how those changes affect the answer.
First, we’ll calculate the person’s expected utility with the investment. Now, with logarithmic utility it is:
10% × ln($10) + 90% × $ln($100) = 4.375 utils.
Now, to get the same 4.375 utils of satisfaction from a certain gamble (involving no risk), the person should be willing to spend up to:
e4.375 = $75.858.
So, that $75.858 is his certainty equivalent, or the most he would pay for the uncertain investment. (That’s why we cleverly set his initial wealth at the same $75.858, so there would be no money left-over after the investment.) With the same hand-waving (about market interactions) that we performed in August, we’ll suppose that the $75.858 is also the price, i.e., competition among similarly-preferenced and endowed buyers drive the price to the break-even point; technically, it is an indifference point but only pedantics like ourselves care.
Now, a risk neutral person could–but need not – be modeled as caring only about expected cash flows on a dollar-for-dollar basis; so, for a risk-neutral person, we could set his utility equal to dollar values and expected dollar values. In other words, he would value $10, $75.858, and $100 as 10 utils, 75.858 utils, and 100 utils, respectively. (We wrote “but need not” above, because we could add a constant and multiply by a positive number without changing the essence of the analysis.)
Remember, in the real world, we don’t know the 12% or the actual market participant’s preferences, which we assumed to be logarithmic here, or his starting wealth, BUT if we assumed that he was risk neutral in our dollar-for-dollar way, then we solve for the corresponding probability of default, i.e., find p such that:
p × 10 + (1 — p) × 100 = 75.858.
Rearranging and solving for p, we get the risk neutral-implied probability of default, p, equals about 26.83% (versus the real probability of default of 12%, which, again, we never know in real life).
So, the WSJ writer or editor is calling that 26.83% the probability of default, when it is, in fact, the implied probability of default assuming that market participants were risk-neutral. (Here, our “model” is so simple as to be innocuous, but in more robust settings – with more details – that’s not the case.)
That risk-neutrality, which provides linearity of preferences, is what allows the analyst to view the price and set it equal to the expected value of the cash flows in the possible outcomes, e.g., survive or fail, for a possible probability, p. In real life, analysts would use different distributions to calculate an implied probability of default based upon their specific model in much the same way that they would calculate a model-implied volatility when using Black-Scholes or a variant. (Provide market variables or guesses about those variables, provide a model, and solve for the last remaining unknown. Notice that there are quite a lot of assumptions in such a process.)
(By the way, for those with a little knowledge of stochastic processes, setting the price equal to the expected value (under risk neutral valuation) is why the phrase Martingale Method is used. That’s what a Martingale is: a process where the value today is equal to the expected value in the future, and it doesn’t really change if we add interest rates and discounting.)
Now please note, unlike in real-life, in this example, we know that the true probability of default is 12%. To an outside observer, without our information to construct the calculations, there is no clear relationship between the 12% and the 26.83%. In other words, knowing only the 26.83% says nothing about the true probability of default, and that is the error that the journalist makes in today’s article.
Because the 50% for Iceland is such a large number, the graph and the blurb seem almost designed to insight hysteria; however, actual – albeit unknown rate – could be substantially lower.
We’re sure that many WSJ readers along with the article’s writer misinterpret that number. We were and continue to be amazed (and shocked) at the number of folks who work or trade in the area who do not understand it. Thus, we view this post as a public service.
It is about 5:00 EDT, and proofread the post like we promised. We’ll add to this post this later today with more examples; so, please check back for updates that show why the price could drop and the implied RISK NEUTRAL probability of default could rise despite the TRUE probability remaining at 12%. (Note: the true default rate has little or nothing to do with the historic default rate. We’ve written a lot about that notion, too. See our essay on uncertainty management for that discussion.)
Case 2: let’s keep everything the same, but make the person “more” risk-averse. In microeconomics, that has a particular, technical meaning having to do with the concavity (the curvedness) of the utility function, but here we’ll avoid the issue by reusing the natural logarthmic function recursively, i.e., our utility function is now ln(ln(·)).
In such a problem, the additional concavity reduces the certainty equivalent of the gamble, and possibly the price. We’ll wave our hands again as a way to stay on course, and assume that the price falls to the new certainty equivalent. To make it work, without trying to hard, we’ll arbitrary assume that as soon as the person purchases the firm, his preferences, via utility function, (and risk aversion) changes to the double-log thing, ie.,
12% × ln(ln($10)) + 88% × ln(ln($100)) = 1.444 utils.
For the changed person to get the same 1.444 utils of satisfaction for sure, he’d be willing to sell it for:
eexp(1.444) = $69.243.
(As his risk aversion increases, the value of a gambles decreases.) Now, in the real world, a decrease in a potential seller’s reservation price doesn’t necessarily change the market price, but we’ll assume that it does. So, immediately, the price is $69.243. We can now find the revised risk-neutral probabilities:
p × 10 + (1 — p) × 100 = 69.243.
Solving for p yields a new, risk-neutral, implied probability of default of 34.175%. So, a change in risk preferences will change the implied probability of default. You may call it the market implied probability of default, but it is really the implied probability of default using the market price and assuming that buyers are risk-neutral, but that gets kind of long. The real probability of default is still 12%.
Case 3: Now, let’s go back to our first case, where we used the natural log, ln, only once, not twice. Let’s assume that right after the purchase, the new owner discovers that the loss given default is really $99 dollars, not the $90 that (it was assumed that) the market knows.
In that case, the new expected utility is 0 + .88 × ln(100) or 4.145 utils. Taking the inverse gives e4.053 = 57.544.
Now, IF everyone knows that the two states are {$1, $100}, then the risk-neutral probability of default satisfies:
p × 1 + (1 — p) × 100 = 57.544,
and equals 42.9%. Remember the actual probability of default is still 12%, but the low outcome is particularly low for a log utility function. So, the implied, risk-neutral probability of default is more than 3.5 times the true probability of default.
Case 4: let’s take Case 3, and assume that the buyer knows that the loss given default has increased from $90 to $99, but a trader or analyst at another firm has not observed that change but has observed the new price of $57.544. In that case, the analyst very likely keep the same LGD assumption and solve for a new implied probability of default of (using the erroneous, but assumed $10, rather than the correct $1:
p × 10 + (1 — p) × 100 = 57.544.
In that case, solving for p gives an model-implied, under the assumption of risk-neutrality probability of default of 47.2%. Of course, once again, the real probability of default is 12%.
The difference between the 47.2% and $42.9% implied default rates is solely attributed to the (incorrect) assumption about the loss given default. In our experience, the LGD is the least-challenged, least-investigated assumption used to price CDS and related products. In real-life, it would be extremely common to maintain that assumption in the face of falling prices.
We’ll probably refine this post in the coming days, but our four simple cases should be sufficient to cast deep suspicion on Iceland’s reported probability of default, when it is really a model-implied, default rate under the assumption of risk neutrality. Remember in all of our cases, the real probability of default is 12%. The models used to calculate that rates involve more variables and more calculations, but apply no more knowledge than do our simple examples here.
If you have any questions or comments, please write.
Copyright © 2008 Spero Consulting.
Forced Mergers? Bigger Is Not Necessarily Better!
We read in Monday’s (September 15) Wall Street Journal that the Federal Reserve nudged Merrill Lynch towards a merger: Crisis on Wall Street as Lehman Totters, Merrill Seeks Buyer, AIG Hunts for Cash.
We think it is a mistake, and we’re not certain of the Fed’s goal when its propose such arrangements. Presumably, such a nudge is rationalized on the basis of “stabilizing” the financial system, but we’re not so sure that such a rationalization is a justification. It seems like a knee-jerk reaction for the sake of “temporary stability.” We believe that it has the potential to lead to wider swings – meaning greater losses – in the future.
We’re sure that with all that is happening in the financial markets, such a merger seems convenient and, in some sense, reduces by one the number of problems that the regulators must currently confront. A quick scan of the papers reveals comments like “saving a relatively healthy patient,” “one less firm to worry about,” etc. But given the small number of such firms, we don’t believe that expedience justifies the long-term increase in systemic risk to the economy. Unless, of course, the “patient” is in much worse health than we have been led to believe.
While most are focusing on the short-term we shall emphasize the long horizon. We do this primarily because it is our nature and because we do not view the self-inflicted problems on Wall Street to be an indication of the general health of the economy. As we have argued many times during the past several months, it seems that the losses are particularly concentrated this time. (We argue that it has to do with poorly-aligned incentives, which led to “excessive” risk-taking within the industry and possibly a wisening up” of some of its customers.) While there are and will continue to be indirect effects that harm the economy, e.g., overly-tight credit standards and high risk premia, etc, those frictions seem sustainable.[1.In fact, it is possible that illegal immigrants have hit the hardest by the reduction in home sales and building, and their incomes are rarely reported to the government.] (Note also that interest rates are low because there is plenty of available capital.)
Our argument is against the merger is based upon a classic centralization/decentralization argument combined with a diversification argument. However, in the interest of brevity, we don’t expand on the centralization/decentralization argument that much.
For all the bravado and despite some firms insisting that they are “a breed apart,” (moo) there is a substantial amount of herd behavior in the financial services industry, particularly among the large, most heavily-regulated firms. We attribute much of that mimicry to the profit-motive but also to governmental regulation, where the regulated insists to the regulator that all is well because it is just like its peers (in terms of ratios, concentrations, etc). (Those outside of the industry would likely be amazed the number of senseless, nearly content-free, peer-group studies performed.) Despite those regulation-induced similarities, there is still some benefit to decentralization and diversification; not all firms have fallen on difficult times and those in trouble are in trouble to varying degrees.
We won’t provide a detailed statistical model, but we have in mind a typical sum-of-normal-variables-type of argument. Think of 100 small, equally-sized, independent firms each making their own investment decisions and earning normally-distributed returns.
Now think of that number being reduced due to, say, acquisitions. There are many ways to implement such a sequence, but we’ll take the simplest one: one firm begins buying another and another and another, etc. At each point, it successfully integrates the new subsidiary. That means the acquirer begins making all of the investment decisions for the new subsidiary.
Now in real life, this isn’t quite true. As firms merge they’re not always able to merge systems, etc; so, certain portfolios may remain separate and separately directed. Eventually, though, one would expect that asset allocation decisions would be shifted to a central authority; so, our argument would seem to hold in the long-run.
After the first acquisition, the acquirer has twice the market power as anyone else, and its concentrated returns will be weighed twice as heavily as any other firm’s. As it gets larger and larger, its idiosyncratic is no longer idiosyncratic; it becomes systemic. At the extreme, when it is the only remaining firm, its return is the market return. So, the idiosyncratic is the systemic. (It is like increasing the covariance among the original 100 firms.)
Now if the government wants financial system security, is having one mega-firm the best way to achieve it? We doubt it. Especially, if by stability, one means a VaR-type argument of not losing more than some amount some percentage of the time – if such tail events can even be calculated. (We assume the returns were normally; so, they could be.)
Without performing any calculations to determine the efficient frontier, it seems that the remaining firm would have to be some type of “super-investor” for the expected gains (from higher average returns and lower costs) to outweigh the potential for massive correlated losses, and we’ve seen little evidence of super-investors. We’d prefer a setting with less efficient, stumbling investment committees – with possibly more localized knowledge – making relatively small mistakes and having those mistake cancel-out to a setting with a few large megafirms making – possibly – fewer mistakes but larger and more correlated ones.
We’d like to say more, but we are powerless to do so today. Look for more as power returns.
No one can predict the future; see our St. James’ quote on our quotes page. It summarizes our outlook.
At the end of the day, it is still individual tastes, preferences, and desires that matter. At investment committee meetings, the argument is not always won by the most knowledgeable or most thoughtful or even the luckiest. Given a grown-up’s understanding of human nature and luck, does such concentration seem wise?
We view this concentration of decision-making to be problematic. Not for an sinister reason. As we mentioned, it could simply be a matter of (bad) luck. We joked in April when UBS tried to pin $37 billion in losses on one person. As firms grow larger and larger, such an outcome would not be a joke.
Good Column, Bad Math.
The Good: In today’s (September 3) The Wall Street Journal, Holman Jenkins has a nice Business World column entitled, The Inflation Hurricane.
The print version’s blurb perfectly summarizes the essay: “Does the federal government have to be responsible for everything?” By now, through constant media reinforcement, we know that everything that goes wrong is Bush’s fault, but Mr. Jenkins seem to mean why do we taxpayers — at, say, 1,200 feet above sea level — need to subsidize folks who choose to live below sea level?
Ignore the poor folks, who may be helpless, for whom we have a bit of compassion. Instead focus of the wealthy with vacation homes or condos in flood– or storm-prone coastal areas. Exactly, why should we subsidize their lifestyles? Yeah, we couldn’t come up with a good reason, either. Please read the column for yourself. However, before doing so, note that Mr. Jenkins does make a few math errors.
The Bad: Mr. Jenkins writes very well and we often agree with his point-of-view, but his knowledge of probability and math seems to no better than that of other journalists. (That wasn’t intended as a compliment.) In the essay, he writes about the probabilities of a “100-year flood.” Now, ignore the statistical issues like sample size and stationarity, which affect what constitutes a “100-year flood,” and focus solely on his numerical example.
He puts the probability of a 100-year flood at one-percent per year. So, far, so good, but then he puts the probability of a 100-year flood in ten years to be 10%, and 30 years to be 30%. Extrapolating, that would mean that the probability of a flood in 130 years is 130%, and that doesn’t make sense.
Mr. Jenkins’ mistake is that he is adding the probabilities as if they were independent events, i.e., 1% + 1% + 1%…up to 10 years or 30 years, but he is misinterpreting what the one percent represents. The one percent is the conditional probability of a flood in any given year.
Note that if the flood occurs in, say, two years, there are no more trials and the experiment ends. Mr. Jenkins is not interested in the probability of having, say, three “100-year” floods in the first, say, 50 years. He is interested in only one flood — the next one — and it seems that he is mistaking the probability of the flood occurring in year T with the conditional probability of the flood occurring in year T, given that it didn’t occur in any period from one to T — 1.
The probability of the flood occurring in the first year is indeed 1%. The probability of it occurring in the second year is the probability that it didn’t occur in the first year, ( 1 — 1%), multiplied by the conditional probability of it occurring in the second year, 1%. That means that the probability of the flood occurring in the second year is: (1 — 1%) ·1% = 0.99% = 0.0099. So, the probability of a flood within the first two years is the sum: 1% + 0.99% = 1.99%. That isn’t much different than the 2% that Mr Jenkins would have calculated, but the difference grows through time as we multiply the conditional probability of a flood in period T, which is one percent (1%), by the probability of no flood in the previous T — 1 periods, which is 0.99(T — 1), which of course gets smaller as T gets bigger.
So, the probability of the flood occurring in year ten is: (1 — 1%)9 · 1% = 0.94%. Summing those probabilities for each of the ten years gives 9.6% chance that it would have flooded in the first ten years.. Still not too far from Mr. Jenkins’ 10%, but clearly not going in the right direction. Here is one way to write the sum of 9.6%:
= 1% + (1 — 1%) ·(1% + (1 — 1%) · (1% + (1 — 1%) · (1% + (1 — 1%) · (1% + (1 — 1%) · (1% + (1 — 1%) · (1% + (1 — 1%) · (1% + (1 — 1%) · (1% + (1 — 1%) · 1%)))))))))1
Or, if we factor correctly, we have p(T), the cumulative probability of a flood by period T, we have:
p(T) = 9.6% = 1% ·(1 + 0.991 + 0.992 + 0.993 + 0.994 + 0.995 + 0.996 + 0.997+ 0.998 + 0.999).
For those who care, we could write it symbolically with p taking the role of 1%, and p(T) representing the cumulative probability of a flood by year T.

By thirty years, Mr. Jenkins sum is off by about 4%, i.e., 30% versus the correct 26.03%. At 100 years, he is off by almost 37%. Presumably, he would have 100%, when in fact it is 63.40%. It takes about 460 years to get a 99% percent chance that a flood would have occurred by that time; very different than 99 years that he would say.
So, while we enjoy reading Mr. Jenkins’ columns, but the next time there is math involved, we do hope that he asks someone. We’ll do it for free and confidentially if he asks.
Finally, note that our cumulative probability of a flood, p(T), is exactly the same as the cumulative default if, say, New Orleans were a firm and p were the constant, conditional probability of default (in a discrete time setting).
- We think we have enough parentheses. If not, let us know. ↩
Implied Risk Neutral Probabilities (of Default)
An Illustration with Only One Simple Equation.
Update: we have several newer, related posts: Implied RISK NEUTRAL probability of default, redux, expands the analysis presented here by using a different utility function and considering a couple of different situations;Risk Neutral Valuation: There Are at Least Two Expected Values describes a common source of confusion: the various distributions involved; and Price Implied Default Rates provides an example that is more like a risky bond.
We get a decent number of hits on the little that we have actually posted about risk neutral probabilities. We take that as a sign that many folks have questions about them, and that most extant explanations are poor. We could write an entire post — make that an essay — on why that is the case, but we will defer that to another day.
While we would like to complete our planned essays on common risk management concepts and calculations, we doubt that it will be possible in the near future. (Those readers who are thinking, “no rest for the wicked” should be ashamed of themselves, and those readers thinking we are too busy to accept new clients should be doubly-ashamed of themselves. We shall set up a PayPal account so that they can buy indulgences.)
So, to fill the gap between what we have written and what we will write, we’ve prepared a numerical example to illustrate what risk-neutral probabilities are and how they differ from reality. To master the topic, one needs to know a decent amount from a variety of topics in math, probability, and economics. Our numerical example won’t provide mastery, but will provide enough knowledge to be dangerous (and annoying) to others, and that in itself is an accomplishment.
Before that, though, we note that we once had a professor correct our use of English, particularly our word choice. (Now, the reader who thinks, “only one prof?” should also be ashamed of himself or herself.) We were corrected because we said that we were surprised by someone’s behavior. He replied that there was no reason to be surprised—it was perfectly predictable — but one should be shocked by it. We find the widespread misinterpretation of risk neutral probabilities to be another example of a unsurprising, though shocking phenomena.
In a similar vein, our hero, Nassim Nicholas Taleb, and his colleague, Dan Goldberg, of Decision Research Laboratory, show how volatilities (standard deviations) are also misinterpreted by a surprising number of financial professionals: We Don’t Quite Know What We are Talking About When We Talk About Volatility. Shocking, but no longer surprising.
Again, we’ll have more to say about the topic of risk neutral probabilities, particularly the assumptions and applicability of such analysis, but our emphasis here is to provide just enough of a framework so that our simple calculations make sense. So, we begin as always, at the beginning…
On Nedges and Sledges and Paving the Road to Hell
Or when is a “hedge” not a hedge? —when it is a nedge or a sledge or a wild*** guess, of course.
To paraphrase St. Francis de Sales, the road to hell is paved with good intentions because execution matters! (Elsewhere he scolds perfectionism, too, and argues for a balance: do not be rash, do not over-analyze. Realize that it is not a sin to be imperfect, but it is a sin to do wrong.)
Back on May 21, we posted On N’edges and Sl’edges and Billions Lost in reference to a WSJ article, “Trouble Hid in the Hedges.” That article cited the likely continuation of losses at large investment banks to due ineffective hedges, particularly due to the firms using various CMBX indices to hedge CMBS (commercial mortgage-backed security) investments. Some of those losses have now been recognized: “Lehman Talks a Rosy Talk.” A few weeks ago on June 10, we posted They’re Losing $Billions, But Doesn’t She Have Nice Clothes and Shoes? in which we again mentioned nedges and sledges and alluded to losses from ineffective hedges.
We define nedges as near hedges and sledges as somewhat–like hedges. (Presumably, if we were younger and hipper, we could have thought of iHedges for ineffective hedges, but we would prefer that the word “hedge” not appear in it’s entirety without scare quotes or italics, and we are not clever enough to come up with iHedge.)
We invented those terms to indicate that while some statistical relationship might exist between the two items — in the commercial mortgage case, a credit index and bonds — only a fool would believe that buying one and selling the other (in whatever “optimal” proportion) would eliminate the risk of loss. Unless one is buying and selling the same item at the same moment in time, then the complete elimination of value or return risk is not possible either in the short run or in the long run. Moreover, it is worth nothing that eliminating market risk usually comes at the cost of additional counter-party risk. That exchange of one risk for another might remind the reader of a variation of the arcade game, whack-a-mole, with the presumed goal to have smaller and smaller rodents pop-up after each hit or transaction. (Of course, one small mole can still do tremendous damage to a well-manicured front lawn.)
In the long run, if the same transaction could be repeated ad infinitum, average losses would likely be reduced if the relevant, return probability distribution function were well-behaved. Unfortunately, in the real world we can never be quite sure of that fact. We can, in theory, think of nice probability functions that arise from, say, repeated coin flips (where one gains a dollar for heads or loses a dollar for tails on each flip). At the limit when the number of flips approaches infinity, extreme losses would be rare, but avoiding any loss is not assured. In such an experiment, break-even might be expected, but even there it is not guaranteed.
In the short-run, no such asymptotic rule comes into play, and like the CMBS example, there is the chance — in some cases quite a high chance — of losing on both legs of the trade.
We believe that excluding fraud, the willful ignorance of first principles is the reason behind many huge trading losses. Such losses, while often attributed to bad luck, are not usually due to the lack of advanced “knowledge” or “sophistication.” So, it doesn’t take a PhD. It takes an PhD or MBA forgetting, ignoring, or never internalizing the basics, especially when some level of pseudo-sophistication is combined with a healthy dose of hubris, possibly due to the misspecification of past good fortune. We′ll have other posts on this issue in the near future because we find it quite annoying and extremely dangerous.
In our example below, we use simulated values rather than real return data so that we can construct it just the way we want it. We provide the example as an EXCEL spreadsheet to also show the mechanics of a simple simulation — as well as make our point about slopes, linear correlations, nedges, and sledges. Moreover, it would be difficult to provide an example using CMBX and CMBS relationships because the data are so lacking. In fact, many such instruments were not marked on a daily basis until this past winter. Furthermore, note that marking values on a daily basis is quite different than witnessing daily transactions and observing new daily prices. Deep and liquid competitive market prices do not exist for many of these securities, bonds, and instruments; so, often it is mark-to-untested-quote, rather than mark-to-market transaction.
Because our example is an introductory level statistics problem, some academics might consider it to be a straw man, i.e., a weak opponent designed by us to be easily defeated (by us). Fear not; we take pride in our cleverness and humility, but we try hard not to be devious to fool others or to be so dull that we fool ourselves.
Instead, we would argue that academics that would make that criticism are truly academic and haven’t spent much time in firms or other organizations, where it is possible to overhear comments like, “We don’t have time to think about a strategy. We have to do something!” Or, the equally bewildering, “We have to do something, or it will look like we don’t know what we are doing!” As it turns out, without a substantial degree of luck or divine intervention, such decisions rarely pay-off. As a girls basketball coach, such comments remind us of closely-contested, middle school games, when the panic sets into the leading team, teammates begin to play “hot potato” with the ball (“Ouch, it burns. I don’t want it. You take it.”) and the girls forget to breathe in their desire not to make a mistake. (Don’t worry arbiters of sexism. We’ve seen it happen with boys and men, too, but just we haven’t experienced it while coaching them.)
In this file which we have protected, we (1) generate three correlated random variables, and (2) regress two of those variables, n and sl, against the other one, x. For illustrative purposes, we do this in a very simple fashion to show that by design the betas, b, or line slopes should be the same due to the common covariance that x shares with n and sl. (In a simulation, they may not be exactly equal.) With x as the regressor or independent variable, the slopes, bn, and bsl, are equal to cov(x, n) ÷ var(x) and cov(x, sl) ÷ var(x), respectively.
So, for example, when traders or risk managers don’t have time to think because they MUST ACT, they might select either n or sl as a suitable hedge for x. Focusing only on the expected, least-square minimizing relationship, they would be indifferent between the two. Moreover, such methods and thinking might allow the trader or manager to underestimate or ignore the scale of the potential risks that they may be inflicting upon the firm via their “hedging activities,” and this is where the road leads to hell. For example, if the firm owns x and uses n or sl to hedge by short-selling either of those instruments, then low values of x combined with high values of n or sl would be particularly damaging, and the chance and magnitude of the loss is related to the total variance of n or sl—not just the covariance. With suitable leverage, it is possible to lose big, with a nedge like n, whereas with sl it is possible to generate large, two-legged loses without leverage.
This can be seen in the following graph where we have chosen many of the parameter values so that the two dependent variables can be easily distinguished. Notice, also, that as promised, both n and sl have the same slope with respect to x, and in theory, either one could be used to “hedge” x, as both would have the same expected benefit. It is just that there is a difference between expectation and realization.

Now, some may argue that they are more sophisticated than our portrayal and would not employ such a simplistic technique unless no other alternatives existed. However, given its pervasive use, we find that difficult to believe that it is a method of last resort. More importantly, while there are different and more complex hedging strategies, unless the market risk is eliminated via forward purchases or sales, all probability and statistics-based strategies suffer from the same underlying problem that we illustrate here — there is residual randomness and the possibility to lose on both legs.
With more complicated hedging strategies it might be more difficult to see this problem and while some additional variation might be reduced, some still remains. Thus, the old adage of losing sight of the forest for the trees seems particularly relevant here. We also note the many folks spend much time and energy analyzing retrospective relationships to determine such hedging strategies, and, of course, such relationships need not be persistent — the oft-mentioned problem of induction. Dynamic, unstable relationships will invalidate historical analyses as will stable relationships with rare events and small sample histories. CMBX — CMBS series have extremely small samples. So, basing hedging strategies and positions on historical analyses poses additional risk due to misspecification. That is why the recent and relatively long period of low volatility in many markets was so damaging to those who forgot or ignored this fact — per our point of ignoring first principles.
In addition, while CMBX offers protection on a basket of CMBS, if either (1) your firm holds securities not in the CMBX basket or (2) your firm holds only some securities in the basket, your firm is likely to have sledges rather than hedges when using an index to try to off-set a position.
Finally, a slightly more technical criticism is based on our observation that some analysts seem to forget that risk-neutral pricing methods don′t actually eliminate risk; they just, in some sense, ignore it for certain purposes. (Out-of-sight is often out-of-mind for the harried and/or unwitting.) We won′t dwell too much on this issue here or anywhere else until we publish more reference material, including a simple explanation of risk neutral valuation. However, please do note that some folks do tend to believe that only expectations matter, and often these same folks tend to also forget or repress the social and behavioral element of trading, especially if they haven’t shown a previous interest in human nature in their careers or education. Such observations make us wonder whether their hiring managers are cynical or naively-ignorant? See this post, Caveat Emptor, for a related complaint.
P.S. As we mentioned, the EXCEL simulation file is protected; so, other than clicking the link to our web site or pressing the function key, F9, to generate a new batch of random numbers, there is little that one can do. Interested parties should contact us directly for a non-protected version.
Implied Default Probabilities and Risk Neutral Models
Update: If this topic is of interest, please see our more recent posts. Many provide better numerical illustrations of risk neutral probabilities: Implied Risk Neutral Probabilities (of Default) from August; implied RISK NEUTRAL probability of default, redux from October 9; Risk Neutral Valuation: There Are at Least Two Expected Values from November 13; and Price Implied Default Rates from December 2. We also have a new multi-period example: Multi-period Bond Price Implied Default Rates and CDS from mid-December. All of our posts are designed to illustrate and communicate the basic notions so that the reader has the requisite foundation to consider additional details per their specific interest.
This weekend′s WSJ has an article entitled, “Bond Insurers Inflict Further Pain on the Market.” While the article is interesting for several reasons, in this post we focus on one tiny aspect of it, which will still take a long time to explain. In the article, there is a graph labeled “Default Fears,” and it includes the subtitle, “The annual cost to buy protection against default on $10 million of MBIA and Ambac debt for five years.” According to the graph, such protection costs about $2.0 million per year for each firm. The purpose of this post is to describe the meaning of the $2.0 million, including various interpretations.
To simplify matters, we assume that the protection, i.e., the insurance contract, lasts only one year. In many ways, such a policy is almost identical to term life insurance. (For those interested in more advanced topics, we will try to add a post that analyzes a multi-year, term contract, but we want to start with the simplest case.)
The insurance premium will depend upon the (1) preferences, (2) beliefs, and (3) endowments of the buyers and sellers. In the near future, look for a rather detailed essay on these topics, but for now think of (1) preferences as the parties′ likes and dislikes, including their levels of risk aversion; think of (2) beliefs as forecasts or specifications of what might happen, which hopefully can be expressed as probability distributions; and think of (3) endowments as the parties′ current and future resources.
We mention these factors because ceteris paribus, which means everything else equal, we know that (a) the more risk averse the person, the more they are willing to pay for a given amount of protection, and (b) the greater the belief in the likelihood of something bad happening — in this case default — the more they are willing to pay for a given amount of protection. If we could express these preferences and beliefs as mathematical functions, and if the functions possessed certain characteristics, we could then determine the exact price that the participants would be willing to pay or receive.
Unfortunately, in real life, it is impossible to have this knowledge. One of the great things about a free economy is that this information isn’t needed to transact and trade. (Within the past few years, researchers have won Nobel prizes determining just how much (or how little) information is needed for such a mathematical verson of the economy to function properly.)
In the real world, one can assume that most people are risk-averse much of the time, but as we mentioned and reiterate, no one knows these preferences. In addition, no one except God knows the true probability distribution function of future events. All we do know is that through combinations of beliefs, preferences, and endowments possessed by the potential and actual buyers and the potential and actual sellers, we see a price, which on Friday was a premium of $2.0 million for $10 million of credit protection for one year.
Can anything further be said? Thanks to some very clever researchers, yes, we can say more as long as we understand the hypothetical nature of such statements.
Risk Neutrality: These financial economists realized that there was no hope discovering actual (risk-averse) preferences or the true way that randomness occurs (the actual probability functions), but these researchers were smart enough to realize that certain problems could be solved in an easier manner if they assumed that individuals involved in economic activities were risk neutral, rather than risk averse, and then proceeded with their analyses under that assumption.
Assumed Distributions & Implied Values: The assumption of risk neutrality allows observed prices to be viewed as the expected values of possible cash flows. That is generally not true in the real world, but it is okay in a hypothetical, risk neutral world.
So, once preferences were assumed to be risk neutral, and once the researcher further assumed a specific form for the probability distribution function, e.g., normal or log-normal or exponential or whatever, he could set the price equal to the expected value and solve for any unknown probability distribution parameters, like the mean or the variance of a normal distribution. Those implied values could then be used for other purposes, i.e., valuing other things that behave in a similar way.
If the reader is familiar with options, this is the same idea behind implied volatilities, which is short-hand for the implied standard deviation of the assumed distribution under the assumption that market participants are risk neutral. (Of course, it is actually inferred, but we′ll ignore that fact.)
Unfortunately, there are many market participants who use implied volatilities and wonder why they don’t equate to actual, historical volatilities. Ignoring the obvious prospective-retrospective differences, it is in some sense like wondering why ten miles isn’t equal to ten kilometers or wondering why a black-and-white photograph doesn’t capture all aspects of a colorful setting. We will have more to say about this below, but now we get back to the issue at hand.
Assumed Expected Loss: If we observe a premium of $2.0 million on $10 million of debt for a one-year policy, and if we assume that participants are risk-neutral, then in a competitive market (another big assumption that we will ignore) we can then conclude that the (risk neutral) expected loss from default is $2.0 million.
In fact, if real buyers of protection are isk-averse, then the premium of $2.0 million implies that they expect the loss to be less than $2.0 million. This has to do with the fact that being risk-averse, the buyers don′t like to take chances; so, they will pay a (risk) premium above the actual expected loss to stabilize their wealth, i.e., to avoid the risk of losing greater amounts.
When all goes right, this is how insurance companies make money. Buyers are willing to pay more than their expected losses to avoid larger, catastrophic losses. On average, and when losses aren′t too correlated, the insurance company profits by the sum of the differences, which is the sum of the risk premiums.
Probability of Default: Fortunately, in cases of default (in discrete time), we have very simple probability distributions — either the firm defaults, or it does not; so, we don′t have to choose a distribution function, A simple binary one works. We will let p represent the probability of default, and (1 — p) is the probability of survival or no default.
Face Value & Loss Given Default: We assume that the $10 million mentioned above is the face value of the debt, which we will label as F. That means that at the bond′s maturity, if the firm still exists, it will have to repay $10 million.
In the article, the author isn′t very clear whether the $10 million is the loss given default or the face value, but such protection contracts are usually specified on the notional or face value of the debt; so, that is what we will assume.
As it turns outs, except in cases of complete fraud, creditors usually recover some portion of loan or bond value. Banks usually collect or recover more than bond-holders because they are better organized for such events; it’s always ad hoc for a particular coalition of bondholders. Regardless, to estimate or infer a probability of default, we need to assume a recovery rate, or its complement, a loss given default rate. We will use L to represent the loss given default rate.
Taken together, the probability of loss, p, multiplied by the loss given default rate, L, multiplied by the face value, F, is equal to the expected loss assuming risk neutrality, which under that assumption is equal to the price. (We’re ignoring counterparty credit risk in this example.)
price = expected loss under risk neutrality = p · L · F + (1 — p) · 0 · F = p · L · F
Without a default there is no loss; so, the second addendum is equal to zero.
Thus, hypothetically at least, the premium is equal to the product of the risk-neutral probability of default, the loss given default rate, and the face value. We observe the premium of $2.0 million and the face value of $10 million, So, we have 2 = p · L · 10 or p · L = 0.20 = 20%. With two variables and one equation, if we assume one, we can solve for the other.
Generally, folks assume a loss given default rate, L, and solve for the risk-neutral, implied probability of default, p. We are pedantic and also believe that proper notation informs and reminds; so, we will write this implied (or solved for) probability of default as p(L). Suppose the loss given default rate is 40%, then it is easy to see that in our case, the risk-neutral probability of default would then be 50%. The graph below shows the implied probability of default for each assumed loss given default rate.
.jpg)
Now, there are a couple of points worth making. The first regards the particular situation described in the article.
Generally, the loss given default rate is less than 100% (of face value). How could it not be? Because MBIA and Ambac guarantee the debt of others, it is easy to imagine that some organizations could lose more than the face value of their MBIA and Ambac bonds if either firm defaults, and so it might seem that the loss rate in greater than 100%. (But that’s because the notionals of other, guaranteed items might not be included.)
Usually, firms will assume that the other exposure can be converted into equivalent notional value of bonds rather than set a higher loss rate, and the conversion requires a rather substantial assumption. That is one of the reasons that say, $1 billion worth of protection might be written on $100 million bond issue.
The article cited above (and many other articles last winter, spring, and summer) have described the losses that large banks have suffered on their inventories of municipal bonds and other securities due to the decreased creditworthiness of the insurers.
In fact, the article mentioned that those banks may face another $10 billion of losses due to the recent ratings downgrades of the insurers.
Many readers will recall that during this past winter, a consortium of banks attempted to stabilize the capital positions of the insurers in hope of preventing further writedowns. Of course, the circularity of some of these transactions should remind one of a snake eating it′s own tail, i.e., we will invest in them so that they (who are now us) guarantee that we lose nothing if bad things happen. Not that different than writing life insurance on one′s own life…if I die, come to the house so that I can pay you…
The second point is more general. As we mentioned above, many people expect to see risk-neutral probabilities and volatilities in real life. That is kind of like using a map to navigate and then believing the world is flat because the map is useful within some relevant range or being surprised the states aren’t actually red, green, blue, or yellow per the map.
Aside: With our own ears, we′ve heard managers argue that their subordinates were wrong because the latter′s historical, default rates didn′t equal the former’s risk-neutral, implied default probabilities. Your apple doesn′t look like our orange. Your orange must be wrong! Where did you get it? Do you know what you are doing? (The fact that there need not be any relationship between historical and prospective rates is a separate issue.) As we’ll explain, there different notions measured in different ways.
Consider that the more risk-averse the buyer, the more he is willing to pay for insurance; so, the higher the implied, risk-neutral probability of default, and thus the greater the difference between a fixed “true” or buyer-believed default rate and the implied default rate based on the premium. (Again, this assumes all else equal and some notion of knowing the true rate and having a clear notion of more risk averse, etc.)
So, why would anyone think that the two notions (actual and risk neutral rates) should be equal — that would happen only if risk neutral folks set prices in the real world, and as it turns out that because they are NEUTRAL or indifferent towards risk, they tend not to be the most promising insurance customers.
We will likely add to this post as we refine this last criticism and recall forgotten events where the methodology was abused and misunderstood. Also, look for several risk-related and valuation-related essays in the coming weeks. After we publish those essays, we will have a platform from which to criticize similar shoddy thinking (but in more complicated setttings.) At some point in the future, we will likely turn this post into an essay, and list it under the fallacies page.
On N’edges and Sl’edges and Billions Lost
Today’s Heard on the Street commentary in The Wall Street Journal, “Trouble Hid in the Hedges,” reminded us that we have been meaning to write about nedges and sledges for some time. The regular reader may have noticed that we are rather pedantic, particularly about the meaning of words. In financial risk-related activities, three words are frequently misused, and two of them are almost never used properly. Those three words are: (1) market, (2) arbitrage, and (3) hedge. Per the title, we will propose a few alternatives to “hedge.” In another post we will have more to say about the overuse of the word “market,” particularly when it is used as a label for a haphazard sequence of sporadic exchanges.
During our doctoral studies, we learned that “arbitrage” meant “riskless profits.” Usually such (an economic) rent accrued through the simultaneous purchase and sale of the same item with two different parties, possibly in two different locations. If such opportunities do exist, they are usually ephemeral. As we lament the passing of our youth, we think their transitory nature is well-captured by the refrain of Bruce Springsteen’s song, Glory Days: “…Glory days in the wink of a young girl’s eye…” (Although we prefer “blink” to “wink” as it seems more tragic.)
In real life, offsetting transactions are usually sequential rather than simultaneous, and the time difference can be substantial. In these cases, individuals often claim that their positions are “hedged.” Now, when we learned the definition of “arbitrage,” we also learned that “hedged” meant that the offset was complete. Either via an immediate offsetting transaction, or through a sequence of costless future ones, the arbitrageur was immunized against the possibility of loss.
Now, there are, in fact, ways to eliminate risk and be fully-hedged, but for many traders, these tactics are expensive relative to the level of expected profits. (Makes one wonder about the efficiency of that desk’s risk-return proposition.) Instead, approximate hedges are often used. These approximations may reflect the direction of the linear relationship, i.e., the correlation, between instruments but they ignore the strength of the relationship. Two dependent variables may have the same regression coefficient or β (or slope) with respect to an independent variable, but the confidence intervals around the regression line can be substantially different. These differences impose risk, which may be very costly. As today’s column explains, several firms have recently lost billions of dollars on bad hedges. (Time permitting, we’ll update the post with a graph to reinforce the fact that regression lines are probabilistic in nature.)
We refer to these approximate hedges as “n’edges,” which is the contraction of “near hedges,” or more generally as “sl’edges,” which is the contraction of “somewhat like hedges.” While use of the former term seems to give the risk mitigator the benefit of doubt, we hope that the latter term conjures the image of the use of a large, blunt and, possibly, inappropriate instrument. We believe our hyperbole is justified given the relative ease in which firms and individuals claim risk immunity due to such sledges. For more on the technical aspects of replication, we point the interested reader to many of Taleb’s papers dealing with replication strategies at www.FooledByRandomness.com. We will shortly post an essay and a data set to illustrate a common analytical mistake when perfoming statistical analysis. That essay also allows us to justify our motto: Cogitatio Ante Computation.
Bubble Economics and Subsidiary Valuation
There is a decent article in today’s The Wall Street Journal about econ professors at Princeton researching financial market bubbles. We write “decent” because there is much with which to both agree and disagree. In fact, one could write a book on the topic, and several have. Today, we offer only a short post. Our goal is to relate the theory to the over-valuation of equity investments, particularly where the float is very low. We have in mind partially corporate-owned, partially publicly-owned subsidiaries where the unrestricted public owns a tiny fraction of the shares outstanding.
Besides the WSJ article we — again — refer the interested reader to Rick Bookstaber’s excellent book, A Demon of Our Own Design, especially his discussion of the bursting of the dot-com bubble on about page 170. One will find many similarities between his description and examples in the WSJ article. We emphasize a crucial similarity by quoting from today’s article, “In markets with lots of disagreement about values, the optimists are better able to dominate when there are fewer shares available.” The idea is that when there are few shares available for purchase, say, when one or two large shareholders hold the vast majority of shares — or as Bookstaber discusses when many outstanding shares are restricted — the optimists on the demand curve set the price.
The implication for the valuation of such subsidiaries should be clear, and makes us wonder about an antonym for synergy? 1 Our point: the market value of 100% of the subsidiary is likely to less than the sum of its parts. To be more explicit, suppose the float and daily trading volume are the same and equal 1% of the outstanding. It is unlikely that the 99% restricted or non-tradeable piece is worth 99 times that one percent’s market value. (Try placing that order in the market.)
Depending upon who owns the 99% and in what fractions, it is unlikely that the accounting treatment would be mark-to-market, but if it were, our implication adds an additional layer of complexity to any such marking. More importantly, it makes one wonder whether CEOs and CFOs should act with greater care when they report the market value of nearly-wholly-owned subsidiaries by extrapolating the price of very thinly-traded shares.
Okay, we can’t resist making three other comments.
- We suspect that Prof Wei Xiong is attempting to use non-technical language and also to be hyperbolic when he is quoted as saying, “The two most important characteristics of a bubble…People pay a crazy price and people trade like crazy.” However, it is worth noting that almost any nonparticipant could make those same statements about any trading activity and most other transactions, as well. Without restating the assumptions behind it, think of the potential buyers and sellers at the top points of a typical X formed by the intersection of supply-demand curves on an inverse demand curve. Moreover, dear reader, think of all the things that one doesn’t buy and sell daily. It is just crazy. The actual trades are at the margins of usually large populations of buyers and sellers. Such activity doesn’t seem normal.
- The article also states that “Mr. Brunnermeier and Stanford’s Stefan Nagel found that hedge funds on the whole “skillfully anticipated price peaks” in individual tech stock…” We haven’t read their work, but we encourage interested parties to: (1) read Mr. Bookstaber’s book, (2) read about survivorship bias, and (3) contemplate whether such findings will hold if the past ten months were studied.
- Like Alan Greenspan, we thought technology stocks were unreasonably valued in the mid-90s, too. We didn’t make a speech about it in 1996 or coin a phrase like “irrational exuberance,” and for at least three years, the market seemed to ignore both of us. How could anyone reasonably expect that a 70-year-old-man, or even a younger and much handsomer man, could fully grasp all of the technological and financial implications of everything that was happening then (or now)? Were we correct and like Cassandra in 1996, or we were the just lucky in 1999? We know someone who in the summer of 2003 claimed the 10-year Treasury rates were going above 5%, possibly to 6% — 8% and staying there for a long time. If and when that happens, we are sure they will take credit for their prediction. While there have been ups and downs, the rate (~4%) is the same as at the time of the prediction. Back then, “everybody knows that rates are going up.”
Footnotes:
- One source had “antinergy,” another “antergy,” a third “dysergy,” but the best — by very far the best — had synergy, but with scare quotes: “synergy.” ↩
Caveat Emptor
We recently met an individual who is working towards a PhD in mathematical finance. He had a masters degree in something technical, too, and if we recall correctly, has been in the PhD program for four years.
He wanted us to read his paper regarding the volatility of volatility. We didn’t have the time, but a quick skim revealed that it was suitably loaded with math and graphs — quite technical. 1
When individuals discuss volatility, they usually speak of one of two kinds: (1) historical or realized vol or (2) implied vol. Historical or realized vol is the measured randomness in a past sequence of observations of a particular variable.
Implied vol is an estimate of future randomness, and it is called “implied” because it is found by solving a model, which will have any number of assumptions and restrictions. For example, imagine a pricing model (a function) of, say, input five variables. 2 Now, imagine that one can observe the actual price and four of the five input variables. Then, under suitable conditions, one can solve for the fifth, possibly unknown, variable that is implied by the function or model.
If one is using the Black-Scholes option pricing model or one of its variants, then (usually) the price of the option is known, and four of the five input variables are known or can be independently estimated: (1) the exercise value of the underlying variable, (2) the current value of the underlying variable, (3) the time until expiration, and (4) the risk-free rate. Using the appropriate algorithm, one can then find the implied vol that sets the function equal to the observed option price.
All of that just to say that when folks are concerned about the vol-of-vol, it almost always involves options, and when people are concerned about options or instruments with embedded options, they almost always use Black-Scholes or some variant. So, Black-Scholes and vol-of-vol are kind of like bread and peanut butter. A lot things can be eaten with bread (Black-Scholes), but peanut butter (vol-of-vol) is used almost exclusively with bread (B-S). Of course, we know about peanut butter pie and crackers and celery, but it is a decent analogy, and we are always willing to hear of better ones.
So, what does all of this have to do with our new acquaintance? We asked him to describe the Black-Scholes model, which is like the bread of any vol sandwich. His reply: when he had left his country of birth — about six years ago — they did not have equity options; so, he didn’t really know it well enough to explain it to others. Not the answer that we sought. Morals: (1) make certain that you have the right person asking the right questions, and (2) thought before calculation is the preferred order for the two.
Footnotes:
- Now for those of you who are unaware or unconcerned about the volatility of volatility, it basically involves the randomness of the randomness of an underlying (random) variable. For some market variables, we may have periods of great variability interspersed with other periods of near certainty — little change from day-to-day. Our short-term measures of uncertainty — like, say, the standard deviation over thirty days — would then vary, too. ↩
- These five variables will determine the price or value. ↩
