The Equity Premium Puzzle

A simple guide from basics to theory

The Numbers

Over about 100 years in the U.S., stocks returned ~7% per year. Treasury billsi returned ~1%. The gap is the equity premiumi.

What we see in data
6.18%
What the model can do (max)
0.35%

The model can only explain a small premium. The real premium is about 18 times larger. That is the puzzle.

The puzzle in one picture

First: stocks return ~7%, bonds ~1%. The difference (6%) is the equity premium.

Stocks (S&P 500)
~7%
Bonds (T-bills)
~1%

The gap between the two bars = equity premium (~6%).

The model can only produce 0.35% premium. Same scale:

Observed (data)
6.18%
Model (max)
0.35%
0% ———————————————————————— 7%
Observed in data Model maximum

What 6% really means

Six percent per year sounds small. But compounded over 90 years (1889-1978), $1 becomes very different amounts. Click to see.

Stocks
$1
T-Bills
$1
1889 — Click "Go" to start

Returns by decade

Average real annual returns. The premium exists in nearly every decade.

1889-99
S: 8.7% B: 4.5%
1900-09
S: 8.4% B: 2.0%
1910-19
S: 0.9% B:-3.5%
1920-29
S:15.0% B: 5.4%
1930-39
S: 0.5% B: 2.6%
1940-49
S: 6.3% B:-4.0%
1950-59
S:14.8% B: 1.0%
1960-69
S: 5.7% B: 1.3%
1970-78
S: 0.8% B:-0.3%
Stocks T-Bills

Stocks beat bonds in 7 of 9 decades. Even the two exceptions (1930s, 1970s) show tiny stock returns, not large bond returns. The premium is persistent, not a fluke of one era.

Find your risk aversion

You have $10,000. A fair coin is flipped. Would you take each of these bets?

Question 1 of 6
Heads: you win $6,000. Tails: you lose $1,000.

Risk aversion and the formula

Risk aversioni is how much you dislike risk. In the model it is measured by αi (the coefficient of relative risk aversion). Higher α means you dislike consumption going up and down more.

The equity premium in the model comes from this formula:

ln(1 + premium) = α × σ²

Here σ² is the variance of log consumption growth (~0.00125 in U.S. data). So: premium = exp(α × 0.00125) − 1.

When consumption growth varies, stocks are risky because they pay off more when consumption is already high (and you value that less). The premium is the reward for bearing that risk. It grows with α: more risk-averse people demand a bigger premium to hold stocks.

In the data, the variance of consumption growth is small (~0.00125). So to get a 6% premium, α must be large. Move the slider to see how the formula changes.

10
With α = 10:
Premium = exp(α × 0.00125) − 1 = 1.26%

What this risk aversion means in practice

What world would fix the puzzle?

The puzzle exists because consumption is too smooth. Change the data parameters below and see what happens. What would the economy need to look like for the model to work with reasonable risk aversion (α ≤ 10)?

3.6%

Actual U.S. data: 3.6%. This is how much consumption growth varies year to year.

0.40

Actual: ~0.40. How closely stock returns move with consumption.

16.7%

Actual: ~16.7%. How volatile stock returns are.

α needed to match 6.18%
51
plausible: ≤ 10
Cov(stocks, consumption)
0.0024
actual: ~0.0024

The Story in Three Papers

What they asked

Can standard theory explain why stocks return so much more than safe bonds?

Main equations

Objective:

max E0 Σt=0 βt (ct1−α − 1) / (1 − α)

E0 = expectation at time 0

β = discount factor (how much you value future vs. today)

ct = consumption at time t

α = coefficient of relative risk aversion

Euler equation (pricing condition):

Et[β (ct+1/ct)−α Rt+1] = 1

Et = expectation given information at time t

Rt+1 = gross return on an asset (stocks or bonds)

Equity premium (with lognormal consumption growth):

ln(1 + premium) = α × σ²

premium = equity premium (extra return on stocks over bonds)

σ² = variance of log consumption growth

Assumptions

  • One representative agenti with power utilityi
  • α (risk aversion) between 0 and 10; β (discount factor) between 0 and 1
  • Complete marketsi
  • Frictionlessi trading
  • Consumption growth follows a two-state Markov chain that matches U.S. data (mean, variance, serial correlation)
  • Dividend growth is perfectly correlated with consumption growth

The setup

  • Pure exchange economy (Lucas tree)
  • Stocks = claim on the tree; bonds = risk-free claim

Where does the formula come from?

Click through each step to see how Mehra and Prescott derive the equity premium formula.

Step 1. The investor wants to maximize lifetime utility:

max E0 Σ βt U(ct)

They choose how much to consume now vs. save/invest for tomorrow.

Step 2. At the optimum, the investor is indifferent about buying one more unit of any asset. This gives the Euler equation:

U'(ct) = β Et[U'(ct+1) Rt+1]

The cost of giving up consumption today (left) must equal the expected benefit of investing and consuming tomorrow (right).

Step 3. Plug in power utility U(c) = c1-α/(1-α), so U'(c) = c:

ct = β Et[ct+1 Rt+1]

Divide both sides by ct:

1 = β Et[(ct+1/ct) Rt+1]

Step 4. Write this for both stocks (Rs) and bonds (Rb), subtract:

Et[(ct+1/ct) (Rst+1 - Rbt+1)] = 0

The equity premium must satisfy this condition.

Step 5. If consumption growth is lognormal, take logs and use the approximation:

E[Rs] - Rb ≈ α × Cov(Rs, Δ ln c)

With i.i.d. growth this simplifies to:

ln(1 + premium) = α × σ²

And that is the equity premium formula. With σ² ≈ 0.00125 and α ≤ 10, the right side is tiny. The puzzle.

The result

With plausible risk aversioni (α ≤ 10), the model gives at most 0.35% premium. Observed: 6.18%.

Consumptioni does not move enough. To justify 6%, people would need to be very risk-averse (α ≈ 50).

What Weil tried

Can we separate risk aversioni from intertemporal substitutioni? He used Kreps-Porteusi (or Epstein-Zini) preferences.

Main equations

Recursive utility (Kreps-Porteus / Epstein-Zin):

Ut = [(1−β)ct1−ρ + β(Et Ut+11−γ)(1−ρ)/(1−γ)]1/(1−ρ)

Ut = utility at time t

β = discount factor

ct = consumption at time t

γ = coefficient of relative risk aversion (separate from ρ)

ρ = parameter for intertemporal substitution (1/ρ = IES)

Et = expectation given information at time t

With i.i.d. consumption growth, the equity premium depends only on γ:

ln(1 + premium) = γ × σ²

premium = equity premium

γ = risk aversion (ρ does not appear)

σ² = variance of log consumption growth

Assumptions

  • Same economy as Mehra-Prescott (Lucas tree, Markov dividend process)
  • But: Kreps-Porteusi preferences instead of time-additive expected utility
  • Two separate parameters: γ (risk aversion) and ρ (intertemporal substitution)
  • Complete marketsi, frictionlessi trading
  • Same calibration as Mehra-Prescott (two-state Markov, U.S. data)

The result

With i.i.d.i growth, the equity premiumi depends only on risk aversion (γ), not on intertemporal substitutioni. So the fix does not work.

He also added the risk-free rate puzzlei: if people dislike consumption swings, why is the risk-free ratei so low?

Try it: Weil's two independent knobs

2.0
1.00
0.12%
Observed: 6.18%
0% Predicted equity premium 8%
Predicted EP
0.12%
needed: ~6.18%
Predicted risk-free rate
7.0%
observed: ~0.8%

Notice: moving the IES slider barely changes the equity premium -- it just makes the risk-free rate better or worse. The extra knob does not help.

What Kocherlakota did

He wrote a survey. He showed the puzzles are robust and come from only three assumptions. Any fix must relax at least one.

Main equations

Kocherlakota uses the same model as Mehra-Prescott. The key conditions are:

Equity premium condition:

E[(Ct+1/Ct)−α (Rt+1s − Rt+1b)] = 0

Ct = per capita consumption at time t

α = coefficient of relative risk aversion

Rt+1s = gross return on stocks

Rt+1b = gross return on bonds

E = unconditional expectation

Risk-free rate condition:

β E[(Ct+1/Ct)−α Rt+1b] = 1

β = discount factor

If these hold with per capita consumption, the puzzles emerge. Any model that relaxes power utility, complete markets, or frictionless trading can avoid them.

Assumptions

Kocherlakota showed the puzzles come from only three things. Any fix must change at least one:

  1. Power utilityi with plausible αi and βi
  2. Complete marketsi
  3. Frictionlessi trading

He showed the puzzles do not depend on the exact consumption process, sampling error, or the assumption that dividends equal consumption. Only these three matter.

Survey of fixes

In the decade after Mehra and Prescott, economists tried every route. Kocherlakota surveyed them all. Click each approach to see what it tried and why it fell short.

Badges show whether the approach can explain the equity premium (EP) and the risk-free rate (RF).

The idea: Separate risk aversion from intertemporal substitution, giving two independent knobs instead of one.

On the risk-free rate: It works. By setting IES independently, the model can match the low observed risk-free rate.

Why it fails on the EP: The equity premium is governed almost exclusively by risk aversion, regardless of IES. You still need implausibly high risk aversion to match the data.

The idea: Your happiness depends on what you consumed yesterday. If you drove a BMW last year, going back to a cheaper car feels painful. This amplifies risk aversion.

On the risk-free rate: Helps a lot. Since people are "hooked" on their current level, they save more, pushing the rate down.

Why it fails on the EP: When consumption growth is hard to predict (which it is), the extra risk aversion from habits does not transfer into a higher equity premium in the formal model.

The idea: People care about how they compare to everyone else. If everyone's consumption falls, you feel extra pain beyond just your own loss.

Why it falls short: Can match the EP, but requires an unrealistically extreme concern about relative standing. Also needs habit-like effects to explain the low risk-free rate simultaneously.

The idea: Real people cannot insure against everything -- job loss, health shocks. Individual consumption is more volatile than the aggregate.

Why it fails: In a long-lived economy, people smooth income shocks by saving during good years and spending during bad years (dynamic self-insurance). This works well enough that the incomplete-markets equilibrium looks very similar to the complete-markets one.

The idea: If people cannot borrow against future income, they hold extra safe assets as a buffer. This pushes the risk-free rate down.

Why it fails on the EP: Constrained investors are constrained in both markets. Both returns get pushed down together, leaving the spread roughly unchanged.

The idea: Stocks are more expensive to trade than bonds. Part of the premium compensates for these costs, not for risk.

Why it falls short: For a buy-and-hold investor, the annualized cost shrinks the longer they hold. Only dramatically higher stock trading costs would explain the gap, and there is little evidence for that.

The idea: Investors fear a small probability of catastrophe -- a Great Depression, a war. Even if these did not happen in the sample, the fear could justify a high EP.

Why it is unconvincing: In a disaster, the model predicts real interest rates should spike -- but historically they did not. Also, the disaster probability is a free parameter: you can match any EP by picking the right probability, making the theory untestable.

The idea: We study the U.S. because it survived and thrived. Markets that collapsed are not in the data.

Why it fails: In catastrophic episodes, bonds also suffered -- governments defaulted, hyperinflation wiped out bondholders. So the spread between stocks and bonds should not be dramatically affected by looking only at survivors.

Three papers at a glance

Feature Mehra & Prescott (1985) Weil (1989) Kocherlakota (1996)
Type Original paper Extension Survey
Utility Power utility (CRRA) Kreps-Porteus / Epstein-Zin Same as M&P; surveys alternatives
Key parameters α (risk aversion = 1/IES) γ (risk aversion), ρ (1/IES), independent α, β (same framework)
Main finding Model produces max 0.35% premium with plausible α Separating γ and ρ does not help the EP; discovers risk-free rate puzzle Puzzle is robust; comes from 3 assumptions; no fix fully works
Explains EP? No No No (surveys why)
Explains RF puzzle? Not addressed Partially (with IES) Some fixes help
Markets Complete, frictionless Complete, frictionless Surveys relaxations
Data period 1889-1978 Same as M&P Same + extended samples

More detail

Test your understanding

1. Why is the equity premium a "puzzle"?

The puzzle is that consumption-based models predict a much smaller equity premium than the 6.18% observed. With plausible risk aversion, the model tops out at about 0.35%.

2. What is the root cause of the puzzle?

Consumption barely moves year to year. Since the model prices risk based on how assets co-move with consumption, smooth consumption means stocks look almost as safe as bonds. A small covariance times reasonable risk aversion gives a small premium.

3. What did Weil (1989) discover?

With i.i.d. consumption growth, the equity premium depends only on risk aversion (γ), not on intertemporal substitution. The new parameter helps the risk-free rate but not the EP. Worse, he uncovered the risk-free rate puzzle: why do people save so much at such a low rate?

4. According to Kocherlakota, the puzzle comes from which three assumptions?

Kocherlakota showed the puzzle survives regardless of the consumption process, sample period, or dividend assumptions. It comes from just three things: (1) power utility with reasonable α and β, (2) complete markets, and (3) frictionless trading. Any fix must relax at least one.

5. Why does "habit formation" fail to fully solve the puzzle?

Habit formation amplifies sensitivity to consumption drops, but the formal result shows that when consumption growth is unpredictable (which it is in reality), the amplified risk aversion does not produce a larger equity premium in the model. It does help the risk-free rate, though.

Timeline

1985
Mehra & Prescott
Stocks earn 6% more than bonds. Our best model says they should not.
1989
Weil
Separating risk aversion from time preferences does not help. And now we have a second puzzle: why is the risk-free rate so low?
1996
Kocherlakota
After a decade of attempts, the equity premium remains unexplained. No proposed fix avoids implausible assumptions.
The core tension

Aggregate consumption is too smooth relative to stock returns for any reasonably risk-averse investor to demand a 6% premium for holding stocks over bonds. Every modification to preferences, market structure, or statistical assumptions tried over the first decade of research failed to convincingly change this arithmetic.