Yes

This question comes up from time to time on social media or StackExchange or email, often from reasonable people, so extra emphasis might be useful.

There are two parts to the answer:

Yes

and

If you think about it, what else could it be using?

Write $\hat{β}$ for the svyglm estimator. Theory says the estimator solves the weighted score equations

$U (β) = \sum_{i = 1}^{N} \frac{R_{i}}{π_{i}} U_{i} (β) = 0$

where $N$ is the population size, $R_{i}$ is the sampling indicator, and $U_{i} = \partial_{β} ℓ_{i} (β)$ is the score. Doing an Taylor series expansion on this gives

$U (β) = U (\hat{β}) + (\hat{β} - β) \frac{\partial U}{\partial β} + remainder$ so that $\hat{β} - β \approx {[\frac{\partial U}{\partial β}]}^{- 1} U (β_{0}) .$ The large-sample variance approximation is then the ‘sandwich’ $\hat{var} [\hat{β}] = {[\frac{\partial U}{\partial β}]}^{- 1} \hat{var} [U (β_{0})] {[\frac{\partial U}{\partial β}]}^{- 1} .$

This is all similar to, eg, Huber or White’s derivation of the sandwich estimator. The only difference is that the middle term¹ has to be estimated differently because of the survey design. That is, the svyglm variance estimator generalises the familiar sandwich estimators to allow for non-trivial sampling.

The middle term is the variance of an estimated population total, and is estimated the same way as for any other population total. This is literally true: all the population-total variance estimates go through the function svyrecvar.²

The middle term is $\sum_{i, j} \frac{R_{i j}}{π_{i j}} \frac{R_{i} U_{i}}{π_{i}} \frac{R_{j} U_{j}}{π_{j}}$ where $π_{i j}$ are the pairwise sampling probabilities. If you had independent sampling of individual records, so $cov [R_{i}, R_{j}] = 0$ , the middle term would reduce to $\sum_{i} \frac{R_{i}}{π_{i}} {[\frac{R_{i} U_{i}}{π_{i}}]}^{\otimes 2}$ and the whole thing simplifies to a standard sandwich estimator.

Model-based standard error estimates are based on simplifying the sandwich estimator by making stronger assumptions about the structure of the middle term. We can’t do this with survey data: we don’t necessarily assume anything about how the finite population was generated, so no simplifications are available.

So, yes, all the model in the survey and svyVGAM packages use model-robust standard errors.

meat? cheese? falafel? avocado?↩︎
or analogous functions such as ppsvar or twophasevar for other categories of design↩︎

Does svyglm use robust standard errors?

Yes

Yes

If you think about it, what else could it be using?