The Kelly Criterion for Strategies: How to Size Positions and Allocate Capital

A strategy with positive expected value can still blow up your account if you get the bet size wrong. We walk through the Kelly criterion from deriving the formula to a portfolio of strategies: why full Kelly is dangerous, how fractional Kelly captures 75% of the growth at half the volatility, and what sizing recipe you should actually apply in algorithmic trading. Midway through the article there is an interactive calculator that shows how the Kelly fraction moves return and risk.

The question every strategy is obliged to answer

You have a strategy with a positive edge: over the long run it makes money. One detail remains — what fraction of capital to put into a single trade or to allocate to a single strategy.

This is not a secondary question but the central one. Positive expected value does not save you from ruin: bet too much, and a losing streak drives your account into a zone from which there is no statistical recovery (see The asymmetry of losses and gains). Bet too little, and you leave most of the potential growth on the table.

The Kelly criterion gives a precise answer: it is the fraction of capital that maximizes the long-term growth rate — the geometric one, not the arithmetic one. It is geometric growth that determines where your account ends up after a thousand trades, because returns multiply rather than add (see The multiplicative nature of returns).

Where the formula comes from: maximizing the logarithm of capital

The key idea of Kelly (1956) and later Thorp: what you should optimize is not the expected profit of a single trade but the expected logarithm of your final capital. The logarithm does not appear by accident — it is the only function whose maximization makes capital grow at the maximum geometric rate.

The binary case: a bet with two outcomes

Suppose that with probability $p$ the bet returns a net payout of $b$ per unit (the odds), and with probability $q = 1 - p$ we lose the stake itself. We bet a fraction $f$ of our capital. After one trade the capital is multiplied by $(1 + fb)$ on a win and by $(1 - f)$ on a loss.

Expected log growth:

$g(f) = p \ln(1 + fb) + q \ln(1 - f)$

Take the derivative with respect to $f$ and set it to zero:

$g'(f) = \frac{pb}{1 + fb} - \frac{q}{1 - f} = 0$

The solution is the Kelly formula:

$f^{*} = \frac{pb - q}{b} = p - \frac{q}{b}$

In words: the optimal fraction equals your edge divided by the odds. No edge ( $pb = q$ ) — no bet.

Example

A strategy wins 55% of trades with a 1:1 payout ratio ( $b = 1$ ):

$f^{*} = p - \frac{q}{b} = 0.55 - \frac{0.45}{1} = 0.10$

Full Kelly tells you to risk 10% of capital per trade. Remember this number — below we will see why almost no one should bet exactly that much.

The continuous case: returns instead of bets

In trading a trade rarely looks like a bet with two outcomes — there is a distribution of returns. For returns with mean $\mu$ and variance $\sigma^2$ per period, the expected log growth at leverage (fraction) $f$ is approximately:

$g(f) \approx f\mu - \tfrac{1}{2} f^{2}\sigma^{2}$

The maximum is reached at:

$\boxed{\,f^{*} = \dfrac{\mu}{\sigma^{2}}\,}$

This is the famous continuous form of Kelly (also known as the Merton fraction). And the growth rate at the optimum is tied to the Sharpe ratio $\mathrm{SR} = \mu / \sigma$ in a remarkably elegant way:

$g(f^{*}) = \frac{\mu^{2}}{2\sigma^{2}} = \frac{\mathrm{SR}^{2}}{2}$

A conclusion worth pinning to the wall: the maximum geometric growth rate of a portfolio equals half the square of its Sharpe. Double the Sharpe, and you quadruple the growth rate of capital.

Why full Kelly is too much

The formula gives the mathematical optimum for growth. But this optimum has a price the formula stays silent about: monstrous path volatility and drawdowns that break any real account and any real person.

The geometry of deviating from the optimum

Substitute the fraction $c \cdot f^{*}$ (where $c$ is the Kelly multiplier: $c = 1$ is full Kelly, $c = 0.5$ is half Kelly) into the growth formula. Relative to the maximum we get:

$\frac{g(c)}{g(f^{*})} = 2c - c^{2}$

$Return versus the Kelly fraction: a growth parabola peaking at the optimum and dropping off a cliff beyond it$

This parabola tells the entire story of risk management in a single line:

Multiplier $c$	Share of growth $2c - c^2$	Volatility	Comment
0.25 (quarter)	43.8%	25%	almost half the growth at a quarter of the risk
0.50 (half)	75.0%	50%	the practitioners' sweet spot
1.00 (full)	100.0%	100%	maximum growth, wild volatility
1.50	75.0%	150%	the same growth as half, but triple the risk
2.00 (double)	0.0%	200%	no growth, maximum risk
> 2.00	negative	—	ruin despite a positive edge

Three takeaways:

half Kelly captures 75% of the growth at half the volatility. On a risk/return basis this is far better than full Kelly.
The parabola is symmetric about $c = 1$ . Betting $1.5\times$ Kelly gives the same growth as $0.5\times$ , but is three times more volatile. Overshooting is punished more harshly than undershooting.
At $2\times$ Kelly growth goes to zero, and beyond that it turns negative. Sizing that is too aggressive kills capital even for a winning strategy.

Full Kelly drawdowns: the formula that sobers you up

Drawdown risk grows with bet size: calm curves versus deep plunges

For the continuous model, the probability that capital will ever fall to a fraction $\alpha$ of its starting value equals:

$P(\text{drawdown to } \alpha) = \alpha^{\frac{2 - c}{c}}$

Plug in the Kelly levels — and you get a table after which full Kelly stops looking attractive:

Multiplier $c$	P(ever −50%)	P(ever −75%)
1.00 (full)	50%	75%
0.50 (half)	12.5%	1.6%
0.25 (quarter)	0.78%	0.006%
2.00 (double)	100%	100%

Under full Kelly the probability of ever seeing a 50% drawdown equals 50%. This is not a tail scenario — it is a coin flip. half Kelly knocks it down to 12.5%, quarter Kelly to a fraction of a percent. And that is in an idealized Gaussian model; real fat tails make actual drawdowns even deeper.

The calculator: move the Kelly fraction — watch return and risk

Move the sliders. The top two set the strategy's edge (win probability and payout ratio), the bottom one sets the Kelly multiplier $c$ . Watch how the growth rate of capital and the probability of a deep drawdown change at the same time. Pay attention to the article's main plot: as you move from half to full Kelly, growth rises a little, while drawdown risk rises several-fold.

Kelly Calculator

Trade win probability55%

Payout ratio (win : loss)1.0

Kelly fraction (multiplier c)0.50x

Move the sliders.

10.0%

Full Kelly f*

5.0%

Your bet c·f*

75%

Growth vs maximum

x43

Capital over 1000 trades

0.50x

Volatility

13%

Drawdown risk ≥50%

Drawdown risk ≥75%

+0.375%

Log growth per trade

c = 0optimum (full Kelly)c = 2: growth goes to zero

Play with the Kelly fraction slider around the values 0.5 and 1.0: the growth versus maximum rises from 75% to 100%, but the risk of a 50% drawdown jumps from 13% to 50%. This is exactly why professionals live in the left half of the parabola.

Fractional Kelly as the industry standard

Serious managers almost never bet full Kelly. The typical range is from $1/4$ to $1/2$ Kelly. Beyond drawdowns there are four fundamental reasons to cut the fraction.

1. Parameter estimation error. The formula assumes you know the true $p$ , $b$ , $\mu$ , $\sigma$ . In reality you estimate them from a finite sample. And the growth function is asymmetric: overestimating the edge pushes you past the optimum, where growth falls faster than it rises under the same underestimation. If the true Kelly equals $f^{*}$ and you estimated it with error, it is safer to systematically bet less. Rule of thumb: when the estimate is uncertain, halve the fraction.

2. Non-stationarity. A strategy's edge is not a constant — market regimes change, the edge decays, competitors copy the idea. The Kelly computed on yesterday's data may turn out to be overstated tomorrow. A fractional multiplier is a cushion against edge decay.

3. Fat tails. The Gaussian formula $f^{*} = \mu/\sigma^2$ underestimates the risk of extreme moves. On real heavy-tailed distributions it systematically over-bets. Fractional Kelly partially compensates for this.

4. The cost of drawdowns in a business. For a market maker a drawdown is not just psychology. It is margin calls, forced position reductions at the worst moment, investor capital outflows, a rising cost of funding (see How funding rates kill leverage). A smooth equity curve has a standalone value that is absent from the Kelly formula.

Kelly for a portfolio of strategies

Matrix Kelly for a portfolio: the covariance matrix allocates capital across strategies

So far we have talked about a single strategy. But the real question is choosing Kelly for strategies in the plural: you have several strategies, and you need to allocate capital among them.

The naive approach — compute Kelly for each one separately and add them up — is catastrophically wrong, because it ignores correlations. Two strongly correlated strategies are essentially one doubled bet, and the total risk must be counted as for a single one.

The matrix form

For a vector of expected returns $\boldsymbol{\mu}$ and a covariance matrix $\boldsymbol{\Sigma}$ , the optimal vector of fractions across all strategies at once:

$\boxed{\,\mathbf{f}^{*} = \boldsymbol{\Sigma}^{-1} \boldsymbol{\mu}\,}$

The growth rate at the optimum generalizes to the square of the portfolio Sharpe:

$g(\mathbf{f}^{*}) = \tfrac{1}{2}\, \boldsymbol{\mu}^{\top} \boldsymbol{\Sigma}^{-1} \boldsymbol{\mu} = \tfrac{1}{2}\,\mathrm{SR}_{\text{portfolio}}^{2}$

Notice: $\boldsymbol{\Sigma}^{-1}\boldsymbol{\mu}$ is the direction of the maximum-Sharpe portfolio (the tangency portfolio). Kelly and mean-variance optimization are two sides of the same coin: Kelly simply fixes the leverage level at the one that maximizes geometric growth.

What the inverse covariance does

Correlated strategies share the risk budget. If two strategies are almost identical, the matrix will trim their combined fraction down to the level of one — automatically, with no manual hacks.
Uncorrelated strategies get a diversification premium. They can be held closer to their individual sizes, and the portfolio's total Sharpe will be higher than each one separately.
Negatively correlated strategies may get increased fractions — they hedge each other, and the matrix rewards that.

A warning about estimating $\boldsymbol{\Sigma}$

Inverting a covariance matrix is numerically unstable: on noisy estimates $\boldsymbol{\Sigma}^{-1}$ inflates small errors into wild weights. Shrinkage of the covariance toward the diagonal, leverage caps, and the same fractional multiplier are mandatory. Without them, matrix Kelly produces fractions that look beautiful in the backtest and lethal in production.

Adjustments for algorithmic trading and market making

The pure formula lives in a sterile world. In a real engine you need adjustments.

Commissions and slippage reduce the effective edge. Compute Kelly on returns after all costs, otherwise you will systematically over-bet.
Bet discreteness and lot sizing prevent you from betting exactly $f^{*}$ — round down, not up.
A non-stationary edge requires recomputation over time: estimates of $\mu$ and $\sigma$ on a rolling window with a half-life, not over the entire history.
A drawdown constraint. On top of Kelly, set a hard ceiling: maximum drawdown, maximum leverage, maximum fraction per single strategy. Formal drawdown-constrained versions of Kelly exist (Busseti, Boyd), but in practice a simple cap is enough.
Honesty in estimating the edge. The main mistake is feeding Kelly an edge measured in-sample. Take only an out-of-sample estimate, on walk-forward, after commissions. An overstated edge going into the formula means an overstated bet coming out.

The recipe: how to apply this

Estimate the edge honestly. Out-of-sample, on walk-forward, after commissions and slippage. This is the most important step — garbage in becomes ruin out.
Compute full Kelly. Binary $f^{*} = p - q/b$ for discrete outcomes or continuous $f^{*} = \mu/\sigma^2$ for returns.
Take a fraction. By default $1/4$ – $1/2$ of full Kelly. The less robust the edge, the smaller the multiplier.
For a portfolio, compute it in matrix form. $\mathbf{f}^{*} = \boldsymbol{\Sigma}^{-1}\boldsymbol{\mu}$ with covariance shrinkage, then the same fractional multiplier.
Set hard caps. Maximum per position, maximum leverage, a drawdown limit — on top of everything.
Recompute. As the estimates update, cut the fraction when uncertainty grows and the edge decays.

Code

import numpy as np

def kelly_binary(p, b):
    """p — win probability, b — net payout ratio per 1 unit of stake."""
    q = 1 - p
    return (b * p - q) / b          # = p - q/b

def kelly_continuous(mu, sigma):
    """mu, sigma — mean and standard deviation of period returns (in the same units)."""
    return mu / sigma ** 2

def kelly_portfolio(mu, cov, shrink=0.0):
    """Matrix Kelly for a portfolio of strategies.
    mu  — vector of expected returns;
    cov — covariance matrix of returns;
    shrink — shrinkage coefficient toward the diagonal (0..1) for stable inversion."""
    cov = np.asarray(cov, float)
    if shrink:
        cov = (1 - shrink) * cov + shrink * np.diag(np.diag(cov))
    return np.linalg.solve(cov, np.asarray(mu, float))

def sized(f_star, kelly_fraction=0.25, cap=0.2):
    """Fractional Kelly with a hard cap on the fraction."""
    return float(np.clip(f_star * kelly_fraction, -cap, cap))

f = kelly_binary(p=0.55, b=1.0)    # 0.10 — full Kelly
print(sized(f))                    # 0.025 — quarter Kelly, a safe size

Common mistakes

Kelly on an in-sample edge. The most expensive mistake. An overstated edge → over-betting → negative growth.
Ignoring correlations between strategies. The sum of individual Kellys is not the portfolio Kelly.
Full Kelly in production. The mathematical maximum of growth, but a 50% chance of a drawdown to half the account. It fits almost no one.
Kelly with an unstable edge. If the edge decays, the formula systematically overstates the bet.
Mixing up objectives. Kelly maximizes the geometric growth of capital — not Sharpe, not the probability of staying in the black, and not comfort. If a smooth curve matters more to you, deliberately bet fractional Kelly.

Bottom line

The Kelly criterion answers the central question of any strategy — how much to bet — with a formula that maximizes long-term geometric growth: $f^{*} = p - q/b$ for bets and $f^{*} = \mu/\sigma^2$ for returns, and for a portfolio — $\mathbf{f}^{*} = \boldsymbol{\Sigma}^{-1}\boldsymbol{\mu}$ .

But full Kelly is the mathematical optimum for growth, not for survival. Fractional Kelly ( $1/4$ – $1/2$ ) captures most of the growth at several times lower volatility and drawdowns. So the practical answer to the question of choosing Kelly for strategies sounds like this: compute full Kelly honestly — but bet a fraction of it, on top of hard risk limits.

Related material:

The Kelly Criterion for Strategies: How to Size Positions and Allocate Capital

The question every strategy is obliged to answer

Where the formula comes from: maximizing the logarithm of capital

The binary case: a bet with two outcomes

Example

The continuous case: returns instead of bets

Why full Kelly is too much

The geometry of deviating from the optimum

Full Kelly drawdowns: the formula that sobers you up

The calculator: move the Kelly fraction — watch return and risk

Fractional Kelly as the industry standard

Kelly for a portfolio of strategies

The matrix form

What the inverse covariance does

A warning about estimating $\boldsymbol{\Sigma}$

Adjustments for algorithmic trading and market making

The recipe: how to apply this

Code

Common mistakes

Bottom line

Auteurs

Lees meer

Loss-Profit Asymmetry: The Math That Kills Your Deposit

Markowitz Portfolio Theory for Crypto: From Zero to Hero

Inside Our House Algorithm: HRP + Long/Short + CVaR with Hull-White

The question every strategy is obliged to answer

Where the formula comes from: maximizing the logarithm of capital

The binary case: a bet with two outcomes

Example

The continuous case: returns instead of bets

Why full Kelly is too much

The geometry of deviating from the optimum

Full Kelly drawdowns: the formula that sobers you up

The calculator: move the Kelly fraction — watch return and risk

Fractional Kelly as the industry standard

Kelly for a portfolio of strategies

The matrix form

What the inverse covariance does

A warning about estimating Σ\boldsymbol{\Sigma}Σ

Adjustments for algorithmic trading and market making

The recipe: how to apply this

Code

Common mistakes

Bottom line

Auteurs

Lees meer

Loss-Profit Asymmetry: The Math That Kills Your Deposit

Markowitz Portfolio Theory for Crypto: From Zero to Hero

Inside Our House Algorithm: HRP + Long/Short + CVaR with Hull-White

Blijf de markt voor

Gelukt!

Sign In

A warning about estimating $\boldsymbol{\Sigma}$