Sum of independent but differently distributed variables
It's well known that a binomial random variable can be approximated by a Poisson random variable, and under what circumstances the approximation is particularly good. See, for example, this post.
A binomial random variable is the sum of iid (independent, identically distributed) Bernoulli random variables. But what if the Bernoulli random variables don't have the same distribution. That is, suppose you're counting the number of heads seen in flipping n coins, where each coin has a potentially different probability of coming up heads. Will a Poisson approximation still work?
This post will cite three theorems on the error in approximating a sum of n independent Bernoulli random variables, each with a different probability of success pi. I'll state each theorem and very briefly discuss its advantages. The theorems can be found in [1].
SetupFor i = 1, 2, 3, ..., n let Xi be Bernoulli random variables with
Prob(Xi = 1) = pi
and let X with no subscript be their sum:
X = X1 + X2 + X3 + ... + Xn
We want to approximate the distribution of X with a Poisson distribution with parameter . We will measure the error in the Poisson approximation by the maximum difference between the mass density function for X and the mass density function for a Poisson() random variable.
Sum of psWe consider two ways to choose . The first is
= p1 + p2 + p3 + ... + pn.
For this choice we have two different theorems that give upper bounds on the approximation error. One says that the error is bounded by the sum of the squares of the ps
p1^2 + p2^2 + p3^2 + ... + pn^2
and the other says it is bounded by 9 times the maximum of the ps
9 max(p1, p2, p3, ..., pn).
The sum of squares bound will be smaller when n is small and the maximum bound will be smaller when n is large.
Sum of transformed psThe second way to choose is
= 1 + 2 + 3 + ... + n
where
i = -log(1 - pi).
In this case the bound on the error is one half the sum of the squared 's:
(1^2 + 2^2 + 3^2 + ... + n^2)/2.
When pi is small, i pi. In this case the error bound for the transformed Poisson approximation will be about half that of the one above.
Related posts- Normal approximation to binomial
- Camp-Paulson approximation to binomial
- Relative error in normal approximations
[1] R. J. Serfling. Some Elementary Results on Poisson Approximation in a Sequence of Bernoulli Trials. SIAM Review, Vol. 20, No. 3 (July, 1978), pp. 567-579.
The post Sum of independent but differently distributed variables first appeared on John D. Cook.