![]() |
The
jackknife procedure consists of three steps:
The random groups must be formed so that each random group has essentially the same sampling design as the parent sample. This ensures that the random group estimator of variance has acceptable statistical properties. In complex surveys this is typically done by using the primary sampling units (PSUs) as the random groups. 2. Constructing Replicate Weights To produce replicate weights, each of the A random groups should be removed, in turn, from the file, and the remaining random groups weighted. The records belonging to the removed random group should be assigned a replicate weight of 0. For records from the remaining A-1 random groups, the base weight assigned when the full sample was weighted will be used as the starting point for computing the replicate weights. The base weights are first adjusted by multiplying them by an adjustment factor {A/A-1} to account for the fact that one random groups has been removed. Then, all additional weight adjustments used in the full-sample weighting process should be repeated. Removing each of the A random groups in turn means that these weighting procedures will have to be repeated A times to produce the A sets of replicate weights. 3. Computing Estimates of Variance Let the population parameter o be estimated by ô, an estimator based on data from the full sample. The aim is to estimate the variance of ô, using the jackknife estimator to obtain V(ô). Assume that A such groups have been constructed. Then, for each group (a = 1,... ,A), ô(a) is calculated based only on the data that remain after omitting the ath group. For a = 1,..., A we define ôa = Aô - (A-1)ô(a) The jackknife estimator of ô is the average value of the previously-calculated estimates ô = (1/A)Sôa and the jackknife variance estimator which calculates the variability between the subsample estimates, is defined as V [1/(A(A-1))]S(ôa-ô)2 where the summation is done over a = 1 to A. The Jackknife was originally used for bias reduction. In most survey applications, the jackknife is used only to capture the sampling variance, and a different formula is used. The replicate estimate ô(a) is estimated directly using the appropriate replicate weights (rather than as a pseudo-value):
|