Branching Processes

A branching process models a population in which each individual independently produces a random number of offspring drawn from a fixed offspring distribution. The classic version is the Galton–Watson process, and it answers a sharp question: starting from a single ancestor, does the lineage die out or grow without bound?

The Galton–Watson process

Label the generations $0, 1, 2, \dots$ , starting with one individual in generation $0$ . Each individual, independently, leaves a random number of offspring $X$ with probability mass function $p_k = P(X=k)$ . Let $Z_n$ be the population size in generation $n$ , so $Z_0 = 1$ and each individual in generation $n$ founds the next generation.

The offspring distribution has a mean, the expected number of children per individual, $m = \mathbb{E}[X] = \sum_{k\ge 0} k\,p_k.$ This single number controls the qualitative fate of the process.

Expected growth

Because individuals reproduce independently, expectations multiply across generations. Conditioning on the previous generation gives $\mathbb{E}[Z_{n}\mid Z_{n-1}] = m\,Z_{n-1}$ , and taking expectations again yields $\mathbb{E}[Z_n] = m^{\,n}.$ The expected population size grows or shrinks geometrically at rate $m$ .

In the epidemic reading, one individual is one infection and its “offspring” are the people it infects, so $m$ is exactly the basic reproduction number $R_0$ . The mean chain then grows like $R_0^{\,n}$ , the same quantity that the next-generation matrix computes for structured populations and that drives the early exponential phase of an SIR outbreak.

The probability generating function

The natural bookkeeping tool for offspring counts is the probability generating function (PGF) $G(s) = \mathbb{E}[s^{X}] = \sum_{k\ge 0} p_k s^{k}, \qquad 0\le s\le 1.$ It packages the whole offspring distribution into one function, much as the moment generating function does for continuous variables. Its derivative at $s=1$ recovers the mean, $G'(1) = m$ , and composing $G$ with itself tracks successive generations.

Extinction

Let $q$ be the probability of ultimate extinction — that the lineage eventually reaches size $0$ . Extinction of the whole process happens exactly when each of the first generation’s sub-lineages goes extinct, and those are independent copies of the original. This self-similarity gives the fixed-point equation $q = G(q).$ The extinction probability is the smallest solution of $s = G(s)$ in the interval $[0,1]$ .

Classifying by the mean $m$ :

Subcritical ( $m<1$ ): the only fixed point up to $1$ forces $q=1$ ; extinction is certain.
Critical ( $m=1$ ): still $q=1$ ; the process dies out with probability one (though it can drift large first).
Supercritical ( $m>1$ ): there is a fixed point $q<1$ , so the process survives with positive probability $1-q$ .

The gap between the critical case and genuine growth is why small populations can vanish by chance even when conditions favor increase — the same stochastic fragility seen in genetic drift.

Outbreaks: will an introduction take off?

Branching processes describe the early, stochastic phase of an outbreak, when the susceptible pool is still effectively unlimited. A single imported case starts one lineage of infections; either it fizzles out or it seeds a major outbreak.

A common and tractable choice is a Poisson offspring distribution with mean $R_0$ (see the Poisson distribution), whose PGF is $G(s) = e^{R_0(s-1)}$ . The extinction probability solves $s = e^{R_0(s-1)},$ and the probability of a major outbreak is $1-q$ . When $R_0 \le 1$ every introduction eventually dies out; only $R_0>1$ gives a genuine chance of takeoff.

Worked example

Take a Poisson offspring distribution with $R_0 = 2$ , so we solve $s = e^{2(s-1)}.$ One root is $s=1$ , but we want the smallest root in $[0,1]$ . Iterating $s_{k+1} = e^{2(s_k-1)}$ from $s_0 = 0$ gives $s_1 = e^{-2}\approx 0.135$ , then $0.176$ , $0.194$ , $0.200$ , converging to $q \approx 0.203.$ So the probability of extinction from a single introduction is about $0.203$ , and the probability of a major outbreak is $1-q \approx 0.797.$ Even with $R_0=2$ , roughly one in five introductions fizzles out purely by chance.

In code

R

# Extinction probability by fixed-point iteration: s = G(s)
G <- function(s, R0 = 2) exp(R0 * (s - 1))   # Poisson offspring PGF
q <- 0
for (i in 1:100) q <- G(q)
q            # ~ 0.2032
1 - q        # ~ 0.7968  (major-outbreak probability)

# Simulate branching trees and estimate extinction empirically
set.seed(1)
sim_extinct <- function(R0 = 2, gens = 40) {
  z <- 1
  for (g in 1:gens) {
    z <- sum(rpois(z, R0))   # each individual has Poisson(R0) offspring
    if (z == 0) return(TRUE) # extinct
  }
  FALSE                      # still alive -> treat as survival
}
mean(replicate(10000, sim_extinct()))  # ~ 0.20, matches q

Python

import numpy as np

def G(s, R0=2.0):        # Poisson offspring PGF
    return np.exp(R0 * (s - 1))

q = 0.0
for _ in range(100):
    q = G(q)
print(q, 1 - q)          # ~ 0.2032  0.7968

rng = np.random.default_rng(1)
def sim_extinct(R0=2.0, gens=40):
    z = 1
    for _ in range(gens):
        z = rng.poisson(R0, size=z).sum()  # offspring of current generation
        if z == 0:
            return True
    return False
print(np.mean([sim_extinct() for _ in range(10000)]))  # ~ 0.20

Julia

using Distributions, Statistics, Random

G(s; R0=2.0) = exp(R0 * (s - 1))     # Poisson offspring PGF
q = 0.0
for _ in 1:100
    q = G(q)
end
println((q, 1 - q))                  # ~ (0.2032, 0.7968)

Random.seed!(1)
function sim_extinct(; R0=2.0, gens=40)
    z = 1
    for _ in 1:gens
        z = sum(rand(Poisson(R0), z))
        z == 0 && return true
    end
    false
end
println(mean(sim_extinct() for _ in 1:10000))  # ~ 0.20

Evolutionary emergence: the Antia–Bergstrom model

A single number $R_0$ hides a subtler question for a newly introduced pathogen: what if the strain that spills over is poorly adapted to the new host ( $R_0 < 1$ ) but can evolve as it spreads? Antia, Regoes, Koella & Bergstrom (2003) answered this with a multi-type branching process, and it is a beautiful application of everything above.

Model each case’s secondary infections as a binomial offspring distribution: an infected host contacts $n$ others and infects each independently with probability $p$ , so offspring $\sim \text{Binomial}(n, p)$ with mean $R = np$ and PGF $G(s) = (1 - p + p s)^{n}$ . The introduced wildtype is subcritical, $R_w = n p_w < 1$ , so on its own it always dies out. But at each transmission the pathogen mutates with small probability $\mu$ to an adapted mutant with $R_m = n p_m > 1$ . Emergence is the event that an adapted lineage establishes before the wildtype chain burns out.

Two branching-process facts combine to give the answer. First, a single mutant establishes with probability $\pi = 1 - q_m$ , where $q_m$ is the extinction probability of the mutant process — the smallest root of $q = (1 - p_m + p_m q)^{n}$ . Second, a subcritical wildtype outbreak seeded by one case produces, in expectation, $\dfrac{R_w}{1 - R_w}$ secondary infections in total (the geometric sum of $R_w^{\,k}$ ). Each of those transmissions throws off a mutant with probability $\mu$ , so the expected number of established mutant lineages is $\mu\,\dfrac{R_w}{1-R_w}\,\pi$ , and — treating rare mutant lineages as independent — the probability of emergence is

$P_{\text{emerge}} \;\approx\; 1 - \exp\!\left(-\,\mu\,\frac{R_w}{1-R_w}\,\pi\right).$

The headline result is in the factor $\dfrac{R_w}{1-R_w}$ : as $R_w \to 1^{-}$ it blows up, so a pathogen that almost spreads lingers long enough to give evolution many attempts at rescue. A spillover strain with $R_0$ just below $1$ is therefore far more dangerous than its subcritical label suggests — the central public-health message of the paper.

Worked example

Take $n = 10$ contacts, wildtype $p_w = 0.09$ (so $R_w = 0.9$ ), mutant $p_m = 0.15$ (so $R_m = 1.5$ ), and mutation probability $\mu = 10^{-3}$ . Solving $q = (0.85 + 0.15\,q)^{10}$ gives $q_m \approx 0.37$ , so a mutant establishes with probability $\pi \approx 0.63$ . The wildtype throws off $R_w/(1-R_w) = 0.9/0.1 = 9$ secondary infections on average, so $P_{\text{emerge}} \approx 1 - e^{-10^{-3}\cdot 9 \cdot 0.63} \approx 0.0057$ . Push the wildtype closer to threshold, $R_w = 0.99$ : now it throws off $99$ infections and $P_{\text{emerge}} \approx 1 - e^{-0.0624} \approx 0.061$ — a tenfold jump in emergence risk from the same mutation rate.

n <- 10; pw <- 0.09; pm <- 0.15; mu <- 1e-3
Rw <- n * pw; Rm <- n * pm                 # 0.9 (subcritical), 1.5 (supercritical)

# Mutant establishment probability: smallest root of q = (1 - pm + pm q)^n
q <- 0; for (i in 1:1000) q <- (1 - pm + pm * q)^n
pi_est <- 1 - q                            # ~0.63

# Rare-mutation emergence probability
P_emerge <- 1 - exp(-mu * Rw / (1 - Rw) * pi_est)   # ~0.0057
c(pi_est = pi_est, P_emerge = P_emerge)

# Monte Carlo check: two-type binomial branching process
set.seed(1)
establishes <- function(p, gens = 60) {        # does one mutant lineage survive?
  z <- 1
  for (g in 1:gens) { z <- sum(rbinom(z, n, p)); if (z == 0) return(FALSE) }
  TRUE
}
emerge_once <- function() {
  z <- 1
  for (g in 1:300) {
    if (z == 0) return(FALSE)                   # wildtype burned out, no emergence
    kids <- sum(rbinom(z, n, pw))               # wildtype secondary infections
    n_mut <- rbinom(1, kids, mu)                # how many mutated
    if (n_mut > 0 && any(replicate(n_mut, establishes(pm)))) return(TRUE)
    z <- kids - n_mut
  }
  FALSE
}
mean(replicate(20000, emerge_once()))           # ~0.006, matches the formula

import numpy as np
n, pw, pm, mu = 10, 0.09, 0.15, 1e-3
Rw = n * pw
q = 0.0
for _ in range(1000):
    q = (1 - pm + pm * q) ** n        # mutant extinction prob
pi_est = 1 - q                        # ~0.63
P_emerge = 1 - np.exp(-mu * Rw / (1 - Rw) * pi_est)   # ~0.0057
print(pi_est, P_emerge)               # the R Monte-Carlo check translates directly

0.6284894704844379 0.005640437894373074

n, pw, pm, mu = 10, 0.09, 0.15, 1e-3
Rw = n * pw
q = 0.0
for _ in 1:1000
    q = (1 - pm + pm * q)^n           # mutant extinction prob
end
pi_est = 1 - q                        # ~0.63
P_emerge = 1 - exp(-mu * Rw / (1 - Rw) * pi_est)   # ~0.0057
println((pi_est, P_emerge))

Why it matters

Branching processes turn a vague worry — “could this spread?” — into a precise probability, separating the deterministic message of $R_0$ from the luck of small numbers. They explain why an outbreak with $R_0>1$ can still fail to establish, why small populations wink out despite favorable growth, and how genealogies and family names go extinct. The same machinery underlies nuclear chain reactions, PCR amplification, surname survival, and the founding dynamics of new mutations.

Branching Processes

The Galton–Watson process

Expected growth

The probability generating function

Extinction

Outbreaks: will an introduction take off?

Worked example

In code

R

Python

Julia

Evolutionary emergence: the Antia–Bergstrom model

Worked example

Why it matters

Related