PlanetMath (more info)
 Math for the people, by the people.
Encyclopedia | Requests | Forums | Docs | Wiki | Random | RSS  
Login
create new user
name:
pass:
forget your password?
Main Menu
Owner confidence rating: Very high Entry average rating: Very high
factorization criterion (Theorem)

Let $ \boldsymbol{X}=(X_1,\ldots,X_n)$ be a random vector whose coordinates are observations, and whose probability (density) function is, $ f(\boldsymbol{x}\mid\theta)$ where $ \theta$ is an unknown parameter. Then a statistic $ T(\boldsymbol{X})$ for $ \theta$ is a sufficient statistic iff $ f$ can be expressed as a product of (or factored into) two functions $ g,h$, $ f=gh$ where $ g$ is a function of $ T(\boldsymbol{X})$ and $ \theta$, and $ h$ is a function of $ \boldsymbol{x}$. In symbol, we have

$\displaystyle f(\boldsymbol{x}\mid\theta)=g(T(\boldsymbol{X}),\theta)h(\boldsymbol{x}).$

Applications.

  1. In view of the above statement, let's show that the sample mean $ \overline{X}$ of $ n$ independent observations from a normal distribution $ N(\mu,\sigma^2)$ is a sufficient statistic for the unknown mean $ \mu$. Since the $ X_i$'s are independent random variables, then the probability density function $ f(\boldsymbol{x}\mid\mu)$, being the joint probability density function of each of the $ X_i$, is the product of the individual density functions $ f(x\mid\mu)$:
    $\displaystyle f(\boldsymbol{x}\mid\mu)$ $\displaystyle =$ $\displaystyle \prod_{i=1}^n f(x\mid\mu)= \prod_{i=1}^n \frac{1}{\sqrt{2\pi\sigma^2}}\exp\Big[-\frac{(x_i-\mu)^2}{2\sigma^2}\Big]$ (1)
      $\displaystyle =$ $\displaystyle \frac{1}{\sqrt{(2\pi)^n\sigma^{2n}}}\exp\Big [\sum_{i=1}^{n}-\frac{(x_i-\mu)^2}{2\sigma^2}\Big]$ (2)
      $\displaystyle =$ $\displaystyle \frac{1}{\sqrt{(2\pi)^n\sigma^{2n}}}\exp\Big [\frac{-1}{2\sigma^2... ...ig] \exp\Big[\frac{\mu}{\sigma^2}\sum_{i=1}^n x_i-\frac{n\mu^2}{2\sigma^2}\Big]$ (3)
      $\displaystyle =$ $\displaystyle h(\boldsymbol{x}) \exp\Big[\frac{n\mu}{\sigma^2}T(\boldsymbol{x})-\frac{n\mu^2}{2\sigma^2}\Big]$ (4)
      $\displaystyle =$ $\displaystyle h(\boldsymbol{x}) g(T(\boldsymbol{x}),\mu)$ (5)

    where $ g$ is the last exponential expression and $ h$ is the rest of the expression in $ (3)$. By the factorization criterion, $ T(\boldsymbol{X})=\overline{X}$ is a sufficient statistic.
  2. Similarly, the above shows that the sample variance $ s^2$ is not a sufficient statistic for $ \sigma^2$ if $ \mu$ is unknown.
  3. But, if $ \mu$ is a known constant, then the statistic
    $\displaystyle T(X_1,\ldots,X_n)=\frac{1}{n-1}\sum_{i=1}^{n}(X_i-\mu)^2$
    is sufficient for $ \sigma^2$ by observing in $ (2)$ above, and letting $ h(\boldsymbol{x})=1$ and $ g(T,\sigma^2)$ be all of expression $ (2)$.



"factorization criterion" is owned by CWoo.
(view preamble)

View style:

Other names:  factorization theorem, Fisher-Neyman factorization theorem
Log in to rate this entry.
(view current ratings)

Cross-references: sufficient, sample variance, expression, exponential, probability density function, random variables, mean, normal distribution, independent, sample mean, applications, product, iff, sufficient statistic, statistic, parameter, function, density, observations, coordinates, random vector
There is 1 reference to this entry.

This is version 1 of factorization criterion, born on 2005-02-16.
Object id is 6761, canonical name is FactorizationCriterion.
Accessed 4213 times total.

Classification:
AMS MSC62B05 (Statistics :: Sufficiency and information :: Sufficient statistics and fields)

Pending Errata and Addenda
None.
Discussion
Style: Expand: Order:
forum policy

No messages.

Interact
post | correct | update request | prove | add result | add corollary | add example | add (any)