Login
This is a place holder for potential sponsor logos.
stratified sampling
In sampling surveys, it is sometimes a good idea to break up the population into subdivisions before any sampling were to take place. For example, from a population $U$ of automobile insurance policies, the claim frequencies (loosely speaking, this is the ratio of the number of claims to the number of policies in the $U$ ) are found in the following table:
| male drivers | female drivers | all drivers |
| $10\ \hline \end{tabular} \end{center} Suppose that we would like to take a sample from $ U$ so that when the sample units are divided up into male drivers and female drivers, the respective sample claim frequencies are more or less $ 10 is taken directly from the population, we may get the total claim frequency (for all drivers) to be more or less $9\ sample is broken down into two groups by gender, we may no longer guarantee that the claim frequencies by gender match (more or less) those calculated from the population. To insure that the sample taken preserves claim frequencies by gender, we would take a \emph{stratified sampling}. \\\\ Formally, in \emph{stratified sampling}, the following steps are taken, in order, from a population $ U$ of $ N$ units: \begin{enumerate} \item Decide what subdivisions are to be analyzed from within $ U$ and what information (or statistics) within the subdivisions should be ``preserved''. For example, if we want to analyze our data by gender, then we would have two subdivisions to study. If there is more than one categorical variable, then we would look at all the \emph{possible} combinations of the these variables. \item Make sure all the possible combinations are mutually exclusive events; \item Divide $ U$ into $ k$ subdivisions, or \emph{strata}, $ U_i$, where $ k$ is the total number of possible combinations described above. From the first two steps, we have $ $U=U_1\cup U_2\cup\ldots U_k \mbox{ such that }U_i\cap U_j=\varnothing,$ $ for all $ i&ne#neq;j$ and $ 1&le#leq;i,j&le#leq;k$. In addition, if we let $ N_i=U_i -16-JSM $N=\sum_{i=1}^{k}N_i.$ $ \item Draw a sample $ S_i$ from each stratum $ U_i$. \end{enumerate} {Remarks}. \begin{itemize} \item When each $ S_i$ is a simple random sample within each $ U_i$, then we call this procedure a \emph{stratified random sampling}. \item Each stratum corresponds to a number $ $W_i:=\frac{N}{N_i},$ $ called a \emph{stratum weight}. \item Suppose each sample $ S_i$ contains $ n_i$ units ($ S_i = n_i$) and that $ n=&sum#sum;_i=1^kn_i$. We call the stratified sampling \emph{proportional} if, for each $ i$, $ $\frac{n}{n_i}=W_i.$ $ \end{itemize} \end{document} $ |
stratified sampling is owned by Chi Woo.
None.
