PlanetMath (more info)
 Math for the people, by the people.
Encyclopedia | Requests | Forums | Docs | Wiki | Random | RSS  
Login
create new user
name:
pass:
forget your password?
Main Menu
Owner confidence rating: Very high Entry average rating: Very high
exponential family (Definition)

A probability (density) function $ f_X(x\mid\theta)$ given a parameter $ \theta$ is said to belong to the (one parameter) exponential family of distributions if it can be written in one of the following two equivalent forms:

  1. $ a(x)b(\theta)\operatorname{exp}\big[ c(x)d(\theta)\big ]$
  2. $ \operatorname{exp}\big[ a(x)+b(\theta)+c(x)d(\theta) \big]$
where $ a,b,c,d$ are known functions. If $ c(x)=x$, then the distribution is said to be in canonical form. When the distribution is in canonical form, the function $ d(\theta)$ is called a natural parameter. Other parameters present in the distribution that are not of any interest, or that are already calculated in advance, are called nuisance parameters.

Examples:

  • The normal distribution, $ N(\mu,\sigma^2)$, treating $ \sigma^2$ as a nuisance parameter, belongs to the exponential family. To see this, take the natural logarithm of $ N(\mu,\sigma^2)$ to get
    $\displaystyle -\frac{1}{2}\operatorname{ln}(2\pi\sigma^2)-\frac{1}{2\sigma^2}(x-\mu)^2$
    Rearrange the above expression and we have
    $\displaystyle \frac{x\mu}{\sigma^2}-\frac{\mu^2}{2\sigma^2}-\frac{1}{2}\Big[\frac{x^2}{\sigma^2}+\operatorname{ln}(2\pi\sigma^2)\Big]$
    Set $ c(x)=x$, $ d(\mu)=\mu/\sigma^2$, $ b(\mu)=-\mu^2/(2\sigma^2)$, and $ a(x)=-1/2\big[x^2/\sigma^2+\operatorname{ln}(2\pi\sigma^2)\big]$. Then we see that $ N(\mu,\sigma^2)$ does indeed belong to the exponential family. Furthermore, it is in canonical form. The natural parameter is $ d(\mu)=\mu/\sigma^2$.
  • Similarly, the Poisson, binomial, Gamma, and inverse Gaussian distributions all belong to the exponential family and they are all in canonical form.
  • Lognormal and Weibull distributions also belong to the exponential family but they are not in canonical form.

Remarks

  • If the p.d.f of a random variable $ X$ belongs to an exponential family, and it is expressed in the second of the two above forms, then
    $\displaystyle \operatorname{E}[c(X)]=-\frac{b'(\theta)}{d'(\theta)},$ (1)

    and
    $\displaystyle \operatorname{Var}[c(X)]=\frac{d''(\theta)b'(\theta)-d'(\theta)b''(\theta)}{d'(\theta)^3},$ (2)

    provided that functions $ b$ and $ d$ are appropriately conditioned.
  • Given a member from the exponential family of distributions, we have $ \operatorname{E}[U]=0$ and $ I=-\operatorname{E}[U']$, where $ U$ is the score function and $ I$ the Fisher information. To see this, first observe that the log-likelihood function from a member of the exponential family of distributions is given by
    $\displaystyle \ell(\theta\mid x)=a(x)+b(\theta)+c(x)d(\theta),$
    and hence the score function is
    $\displaystyle U(\theta)=b'(\theta)+c(X)d'(\theta).$
    From (1), $ \operatorname{E}[U]=0$. Next, we obtain the Fisher information $ I$. By definition, we have
    $\displaystyle I$ $\displaystyle =$ $\displaystyle \operatorname{E}[U^2]-\operatorname{E}[U]^2$  
      $\displaystyle =$ $\displaystyle \operatorname{E}[U^2]$  
      $\displaystyle =$ $\displaystyle d'(\theta)^2\operatorname{Var}[c(X)]$  
      $\displaystyle =$ $\displaystyle \frac{d''(\theta)b'(\theta)-d'(\theta)b''(\theta)}{d'(\theta)}$  

    On the other hand,
    $\displaystyle \frac{\partial U}{\partial\theta}=b''(\theta)+c(X)d''(\theta)$
    so
    $\displaystyle \operatorname{E}\Big[\frac{\partial U}{\partial\theta}\Big]$ $\displaystyle =$ $\displaystyle b''(\theta)+\operatorname{E}[c(X)]d''(\theta)$  
      $\displaystyle =$ $\displaystyle b''(\theta)-\frac{b'(\theta)}{d'(\theta)}d''(\theta)$  
      $\displaystyle =$ $\displaystyle \frac{b''(\theta)d'(\theta)-b'(\theta)d''(\theta)}{d'(\theta)}$  
      $\displaystyle =$ $\displaystyle -I$  

  • For example, for a Poisson distribution
    $\displaystyle f_X(x\mid\theta) = \frac{\theta^x e^{-\theta}}{x!},$
    the natural parameter $ d(\theta)$ is $ \operatorname{ln}\theta$ and $ b(\theta)=-\theta$. $ c(x)=x$ since Poisson is in canonical form. Then
    $\displaystyle U(\theta)=-1+\frac{X}{\theta}$ and $\displaystyle I=-\operatorname{E}\Big[\frac{-X}{\theta^2}\Big]=\frac{1}{\theta}$
    as expected.



"exponential family" is owned by CWoo.
(view preamble)

View style:

Also defines:  canonical exponential family, nuisance parameter, natural parameter
Log in to rate this entry.
(view current ratings)

Cross-references: Poisson distribution, log-likelihood function, Fisher information, score function, random variable, Weibull distributions, inverse, binomial, expression, natural logarithm, normal distribution, canonical, equivalent, distributions, parameter, function, density
There are 4 references to this entry.

This is version 4 of exponential family, born on 2004-07-27, modified 2006-09-12.
Object id is 6039, canonical name is ExponentialFamily.
Accessed 17711 times total.

Classification:
AMS MSC62J12 (Statistics :: Linear inference, regression :: Generalized linear models)

Pending Errata and Addenda
None.
[ View all 2 ]
Discussion
Style: Expand: Order:
forum policy
From (1), E(U)=0? by yaroslavvb on 2004-08-11 15:36:06
I'm confused by the E(U)=0 derivation, in particular, which is the equation (1)?

[ reply | up ]

Interact
post | correct | update request | add derivation | add example | add (any)