PlanetMath (more info)
 Math for the people, by the people.
Encyclopedia | Requests | Forums | Docs | Wiki | Random | RSS  
Login
create new user
name:
pass:
forget your password?
Main Menu
Owner confidence rating: Very high Entry average rating: Very high
factorization criterion (Theorem)

Let $ \boldsymbol{X}=(X_1,\ldots,X_n)$ be a random vector whose coordinates are observations, and whose probability (density) function is, $ f(\boldsymbol{x}\mid\theta)$ where $ \theta$ is an unknown parameter. Then a statistic $ T(\boldsymbol{X})$ for $ \theta$ is a sufficient statistic iff $ f$ can be expressed as a product of (or factored into) two functions $ g,h$, $ f=gh$ where $ g$ is a function of $ T(\boldsymbol{X})$ and $ \theta$, and $ h$ is a function of $ \boldsymbol{x}$. In symbol, we have

$\displaystyle f(\boldsymbol{x}\mid\theta)=g(T(\boldsymbol{X}),\theta)h(\boldsymbol{x}).$

Applications.

  1. In view of the above statement, let's show that the sample mean $ \overline{X}$ of $ n$ independent observations from a normal distribution $ N(\mu,\sigma^2)$ is a sufficient statistic for the unknown mean $ \mu$. Since the $ X_i$'s are independent random variables, then the probability density function $ f(\boldsymbol{x}\mid\mu)$, being the joint probability density function of each of the $ X_i$, is the product of the individual density functions $ f(x\mid\mu)$:
    $\displaystyle f(\boldsymbol{x}\mid\mu)$ $\displaystyle =$ $\displaystyle \prod_{i=1}^n f(x\mid\mu)= \prod_{i=1}^n \frac{1}{\sqrt{2\pi\sigma^2}}\exp\Big[-\frac{(x_i-\mu)^2}{2\sigma^2}\Big]$ (1)
      $\displaystyle =$ $\displaystyle \frac{1}{\sqrt{(2\pi)^n\sigma^{2n}}}\exp\Big [\sum_{i=1}^{n}-\frac{(x_i-\mu)^2}{2\sigma^2}\Big]$ (2)
      $\displaystyle =$ $\displaystyle \frac{1}{\sqrt{(2\pi)^n\sigma^{2n}}}\exp\Big [\frac{-1}{2\sigma^2... ...ig] \exp\Big[\frac{\mu}{\sigma^2}\sum_{i=1}^n x_i-\frac{n\mu^2}{2\sigma^2}\Big]$ (3)
      $\displaystyle =$ $\displaystyle h(\boldsymbol{x}) \exp\Big[\frac{n\mu}{\sigma^2}T(\boldsymbol{x})-\frac{n\mu^2}{2\sigma^2}\Big]$ (4)
      $\displaystyle =$ $\displaystyle h(\boldsymbol{x}) g(T(\boldsymbol{x}),\mu)$ (5)

    where $ g$ is the last exponential expression and $ h$ is the rest of the expression in $ (3)$. By the factorization criterion, $ T(\boldsymbol{X})=\overline{X}$ is a sufficient statistic.
  2. Similarly, the above shows that the sample variance $ s^2$ is not a sufficient statistic for $ \sigma^2$ if $ \mu$ is unknown.
  3. But, if $ \mu$ is a known constant, then the statistic
    $\displaystyle T(X_1,\ldots,X_n)=\frac{1}{n-1}\sum_{i=1}^{n}(X_i-\mu)^2$
    is sufficient for $ \sigma^2$ by observing in $ (2)$ above, and letting $ h(\boldsymbol{x})=1$ and $ g(T,\sigma^2)$ be all of expression $ (2)$.



"factorization criterion" is owned by CWoo.
(view preamble)

View style:

Other names:  factorization theorem, Fisher-Neyman factorization theorem
Log in to rate this entry.
(view current ratings)

Cross-references: sufficient, sample variance, expression, exponential, density functions, probability density function, random variables, mean, normal distribution, independent, sample mean, product, iff, sufficient statistic, statistic, parameter, function, density, observations, coordinates, random vector
There is 1 reference to this entry.

This is version 1 of factorization criterion, born on 2005-02-16.
Object id is 6761, canonical name is FactorizationCriterion.
Accessed 4047 times total.

Classification:
AMS MSC62B05 (Statistics :: Sufficiency and information :: Sufficient statistics and fields)

Pending Errata and Addenda
None.
Discussion
Style: Expand: Order:
forum policy

No messages.

Interact
post | correct | update request | prove | add result | add corollary | add example | add (any)