Probabilities not bounded away from zero

We have a population or cohort of $N$ people divided into $H$ sampling strata, with a sample of size $n_{h}$ taken from the population $N_{h}$ in stratum $h$ . Let $π_{i j}$ be the sampling probability for person $i$ in stratum $h$ . When we do asymptotics we usually assume $π_{i h}$ are bounded away from zero. That’s not ideal for, say, case-control studies of rare diseases, where we might want asymptotic approximations based on the case incidence being small (ie, converging to zero).

In the situations where I’m interested in $π_{i h}$ being small, it’s usually small for a whole stratum. Since sampling is independent between strata, there should be a central limit theorem separately for each stratum, and we should be able to add up the limiting Normal approximations for the stratum totals to get a Normal limit for the population total estimate and the population mean estimate.

To formalise this, suppose $n_{h} \to \infty$ for every stratum (so that asymptotics makes sense), and that $π_{i h} N_{h} / n_{h}$ is bounded above and below, so that within each stratum the sampling probability has a finite (relative) range. As a simple example, we might have a case stratum with $π_{i} \approx 1$ and a control stratum with very small $π_{i}$ .

[Update: As Stas Kolenikov points out, I’m assuming the same strata are small large along the infinite sequence, so I need something like $n_{h_{1}} / (n_{h_{1}} + n_{h_{2}}) \to c_{h_{1}, h_{2}} \in [0, 1]$ for each pair of strata. This isn’t a meaningful loss of generality since (a) the infinite sequence is an analytic fiction and we might as well set it up for our maximum convenience; and (b) even without assuming anything, every subsequence will have a subsubsequence along which the condition holds]

By standard results, $n_{h}^{1 / 2} ({\bar{X}}_{. h} - μ_{h}) \overset{d}{\to} N (0, σ_{h}^{2})$ for each stratum $h$ , and by the Skorohod representation theorem we can find an $H$ -variate normal vector $⟨ Z_{h} ⟩_{h = 1}^{H}$ with
$n_{h}^{1 / 2} ({\bar{X}}_{. h} - μ_{h}) \overset{p}{\to} Z_{h}$
(possibly on a different probability space), to get
${\bar{X}}_{. h} = μ_{h} + n_{h}^{- 1 / 2} Z_{h} + o_{p} (n_{h}^{- 1 / 2})$
The $Z_{h}$ will be independent, with mean zero; write $σ_{h}^{2}$ for the variances.

[Update: Note that $σ_{h}^{2}$ is just $v a r [Z_{h}]$ , nothing more fundamental. Under stratified random sampling, $σ_{h}^{2}$ will be $v a r [X]$ in stratum $h$ multiplied by the ‘finite population correction” $(N_{h} - n_{h}) / N_{h}$ , but under other sampling schemes it will be something else]

Now,

{\bar{X}}_{. .} = \frac{1}{N} \sum_{h = 1}^{H} N_{h} {\bar{X}}_{. h}

giving

\begin{aligned} {\bar{X}}_{. .} & = \sum_{h = 1}^{H} \frac{N_{h}}{N} μ_{h} + \frac{N_{h} n_{h}^{- 1 / 2}}{N} Z_{h} + o_{p} (\frac{N_{h} n_{h}^{- 1 / 2}}{N}) \\ = μ + (\sum_{h = 1}^{H} \frac{N_{h} n_{h}^{- 1 / 2}}{N} Z_{h}) + o_{p} (\sum_{h = 1}^{H} \frac{N_{h}}{N \sqrt{n_{h}}}) \end{aligned}

First, suppose $ N_h/N$ converges to a non-zero constant for each $h$ . Let $n_{*} = min_{h} n_{h}$ and define $H = {h : lim n_{*} / n_{h} > 0}$
$\begin{array}{rcl} {\bar{X}}_{. .} & = & μ + (\sum_{h = 1}^{H} \frac{N_{h} n_{h}^{- 1 / 2}}{N} Z_{h}) + o_{p} (\frac{max_{h} N_{h}}{N min_{h} \sqrt{n_{h}}}) \\ = & μ + (\sum_{h \in H} \frac{N_{h} n_{*}^{- 1 / 2}}{N} Z_{h}) + \sum_{h \notin H} o_{p} (n_{*}^{- 1 / 2}) + o_{p} (\frac{max_{h} N_{h}}{N \sqrt{n_{*}}}) \\ = & μ + n_{*}^{- 1 / 2} Z + o_{p} (n_{*}^{- 1 / 2}) \end{array}$

where $Z \sim N (0, σ^{2})$ with
$σ^{2} = lim_{n_{*} \to \infty} \sum_{h \in H} \frac{N_{h}^{2} n_{*} σ_{h}^{2}}{N^{2} n_{h}}$

Alternatively, for case–control sampling we may have $N_{h} / N \to 0$ in the case stratum, but we would have $n_{h}$ all of the same order, and so of the same order as their total, $n$ . The limiting distribution is dominated by the largest strata: define $H^{'} = {h : lim N_{h} / N > 0}$ (which is non-empty as $H$ is finite)

$\begin{array}{rcl} {\bar{X}}_{. .} & = & μ + (\sum_{h = 1}^{H} \frac{N_{h} n_{h}^{- 1 / 2}}{N} Z_{h}) + o_{p} (\sum_{h = 1}^{H} \frac{N_{h}}{N \sqrt{n_{h}}}) \\ = & μ + (\sum_{h \in H^{'}} \frac{N_{h} n^{- 1 / 2}}{N} Z_{h}) + \sum_{h \notin H^{'}} o_{p} (n^{- 1 / 2}) + o_{p} (n^{- 1 / 2}) \\ = & μ + n^{- 1 / 2} Z + o_{p} (n^{- 1 / 2}) \end{array}$
where $Z \sim N (0, σ^{2})$ with
$σ^{2} = lim_{n \to \infty} \sum_{h \in H} \frac{N_{h}^{2} n σ_{h}^{2}}{N^{2} n_{h}}$

Weaker conditions on $N_{h}$ and $n_{h}$ are clearly possible: it is only necessary to identify which terms dominate the limiting distribution of ${\bar{X}}_{. .}$ , since the limiting distribution of estimated stratum totals is always independent $H$ -variate Normal under appropriate scaling.