glm_weightit() is used to fit generalized linear models with a
covariance matrix that accounts for estimation of weights, if supplied.
lm_weightit() is a wrapper for glm_weightit() with the Gaussian family
and identity link (i.e., a linear model). By
default, these functions use M-estimation to construct a robust covariance
matrix using the estimating equations for the weighting model and the outcome
model when available.
Usage
glm_weightit(
formula,
data,
family = gaussian,
weightit = NULL,
vcov = NULL,
cluster,
R = 500L,
offset,
start = NULL,
etastart,
mustart,
control = list(...),
x = FALSE,
y = TRUE,
contrasts = NULL,
fwb.args = list(),
br = FALSE,
...
)
lm_weightit(
formula,
data,
weightit = NULL,
vcov = NULL,
cluster,
R = 500L,
offset,
x = FALSE,
y = TRUE,
contrasts = NULL,
fwb.args = list(),
...
)Arguments
- formula
an object of class
formula(or one that can be coerced to that class): a symbolic description of the model to be fitted.- data
a data frame containing the variables in the model. If not found in data, the variables are taken from
environment(formula), typically the environment from which the function is called.- family
a description of the error distribution and link function to be used in the model. This can be a character string naming a family function, a family function or the result of a call to a family function. See family for details of family functions.
- weightit
a
weightitorweightitMSMobject; the output of a call toweightit()orweightitMSM(). If not supplied, an unweighted model will be fit.- vcov
string; the method used to compute the variance of the estimated parameters. Allowable options include
"asympt", which uses the asymptotically correct M-estimation-based method that accounts for estimation of the weights when available;"const", which uses the usual maximum likelihood estimates (only available whenweightitis not supplied);"HC0", which computes the robust sandwich variance treating weights (if supplied) as fixed;"BS", which uses the traditional bootstrap (including re-estimation of the weights, if supplied);"FWB", which uses the fractional weighted bootstrap as implemented infwb::fwb()(including re-estimation of the weights, if supplied); and"none"to omit calculation of a variance matrix. IfNULL(the default), will use"asympt"ifweightitis supplied and M-estimation is available and"HC0"otherwise. See thevcov_typecomponent of the outcome object to see which was used.- cluster
optional; for computing a cluster-robust variance matrix, a variable indicating the clustering of observations, a list (or data frame) thereof, or a one-sided formula specifying which variable(s) from the fitted model should be used. Note the cluster-robust variance matrix uses a correction for small samples, as is done in
sandwich::vcovCL()by default. Cluster-robust variance calculations are available only whenvcovis"asympt","HC0","BS", or"FWB".- R
the number of bootstrap replications when
vcovis"BS"or"FWB". Default is 500. Ignored otherwise.- offset
optional; a numeric vector containing the model offset. See
offset(). An offset can also be present in the model formula.- start
optional starting values for the coefficients.
- etastart, mustart
optional starting values for the linear predictor and vector of means. Passed to
glm().- control
a list of parameters for controlling the fitting process.
- x, y
logical values indicating whether the response vector and model matrix used in the fitting process should be returned as components of the returned value.
- contrasts
an optional list defining contrasts for factor variables. See
model.matrix().- fwb.args
an optional list of further arguments to supply to
fwb::fwb()whenvcov = "FWB".- br
logical; whether to use bias-reduced regression as implemented bybrglm2::brglmFit()(including Firth logistic regression). IfTRUE, arguments passed tocontrolor ... will be passed tobrglm2::brglmControl().- ...
arguments to be used to form the default control argument if it is not supplied directly.
Value
A glm_weightit object, which inherits from glm.
Unless vcov = "none", the vcov component contains the covariance matrix
adjusted for the estimation of the weights if requested and a compatible
weightit object was supplied. The vcov_type component contains the type
of variance matrix requested. If cluster is supplied, it will be stored in
the "cluster" attribute of the output object, even if not used.
The model component of the output object (also the model.frame() output)
will include two extra columns when weightit is supplied: (weights)
containing the weights used in the model (the product of the estimated
weights and the sampling weights, if any) and (s.weights) containing the
sampling weights, which will be 1 if s.weights is not supplied in the
original weightit() call.
Details
glm_weightit() is essentially a wrapper for glm() that
optionally computes a coefficient variance matrix that can be adjusted to
account for estimation of the weights if a weightit or weightitMSM object
is supplied to the weightit argument.
When no argument is supplied to
weightit or there is no "Mparts" attribute in the supplied object, the
default variance matrix returned will be the "HC0" sandwich variance matrix,
which is robust to misspecification of the outcome family (including
heteroscedasticity). Otherwise, the default variance matrix uses M-estimation
to additionally adjust for estimation of the weights. When possible, this
often yields smaller (and more accurate) standard errors. See the individual
methods pages to see whether and when an "Mparts" attribute is included in
the supplied object. To request that a variance matrix be computed that
doesn't account for estimation of the weights even when a compatible
weightit object is supplied, set vcov = "HC0", which treats the weights
as fixed.
Bootstrapping can also be used to compute the coefficient variance matrix;
when vcov = "BS" or vcov = "FWB", which implement the traditional
resampling-based and fractional weighted bootstrap, respectively, the entire
process of estimating the weights and fitting the outcome model is repeated
in bootstrap samples (if a weightit object is supplied). This accounts for
estimation of the weights and can be used with any weighting method. It is
important to set a seed using set.seed() to ensure replicability of the
results. The fractional weighted bootstrap is more reliable but requires the
weighting method to accept sampling weights (which most do, and you'll get an
error if it doesn't). Setting vcov = "FWB" and supplying fwb.args = list(wtype = "multinom")
also performs the resampling-based bootstrap but
with the additional features fwb provides (e.g., a progress bar and
parallelization).
Functions in the sandwich package can be to compute standard errors
after fitting, regardless of how vcov was specified, though these will
ignore estimation of the weights, if any. When no adjustment is done for
estimation of the weights (i.e., because no weightit argument was supplied
or there was no "Mparts" component in the supplied object), the default
variance matrix produced by glm_weightit() should align with that from
sandwich::vcovHC(. type = "HC0") or sandwich::vcovCL(., type = "HC0", cluster = cluster)
when cluster is supplied. Not all types are available
for all models.
See also
lm()andglm()for fitting (generalized) linear models without adjusting standard errors for estimation of the weights.glm_weightit()for fitting generalized linear models that adjust for estimation of the weights.multinom_weightit()for fitting multinomial regression models that adjust for estimation of the weights.ordinal_weightit()for fitting ordinal regression models that adjust for estimation of the weights.coxph_weightit()for fitting Cox proportional hazards models that adjust for estimation of the weights.
Examples
data("lalonde", package = "cobalt")
# Logistic regression ATT weights
w.out <- weightit(treat ~ age + educ + married + re74,
data = lalonde, method = "glm",
estimand = "ATT")
# Linear regression outcome model that adjusts
# for estimation of weights
fit1 <- lm_weightit(re78 ~ treat, data = lalonde,
weightit = w.out)
summary(fit1)
#>
#> Call:
#> lm_weightit(formula = re78 ~ treat, data = lalonde, weightit = w.out)
#>
#> Coefficients:
#> Estimate Std. Error z value Pr(>|z|)
#> (Intercept) 5515.9 376.5 14.649 <1e-06 ***
#> treat 833.2 669.1 1.245 0.213
#> Standard error: HC0 robust (adjusted for estimation of weights)
#>
# Linear regression outcome model that treats weights
# as fixed
fit2 <- lm_weightit(re78 ~ treat, data = lalonde,
weightit = w.out,
vcov = "HC0")
summary(fit2)
#>
#> Call:
#> lm_weightit(formula = re78 ~ treat, data = lalonde, weightit = w.out,
#> vcov = "HC0")
#>
#> Coefficients:
#> Estimate Std. Error z value Pr(>|z|)
#> (Intercept) 5515.9 353.5 15.604 <1e-06 ***
#> treat 833.2 676.6 1.232 0.218
#> Standard error: HC0 robust
#>
# Can also just call summary() with `vcov` option
summary(fit1, vcov = "HC0")
#>
#> Call:
#> lm_weightit(formula = re78 ~ treat, data = lalonde, weightit = w.out)
#>
#> Coefficients:
#> Estimate Std. Error z value Pr(>|z|)
#> (Intercept) 5515.9 353.5 15.604 <1e-06 ***
#> treat 833.2 676.6 1.232 0.218
#> Standard error: HC0 robust
#>
# Linear regression outcome model that bootstraps
# estimation of weights and outcome model fitting
# using fractional weighted bootstrap with "Mammen"
# weights
set.seed(123)
fit3 <- lm_weightit(re78 ~ treat, data = lalonde,
weightit = w.out,
vcov = "FWB",
R = 50, #should use way more
fwb.args = list(wtype = "mammen"))
summary(fit3)
#>
#> Call:
#> lm_weightit(formula = re78 ~ treat, data = lalonde, weightit = w.out,
#> vcov = "FWB", R = 50, fwb.args = list(wtype = "mammen"))
#>
#> Coefficients:
#> Estimate Std. Error z value Pr(>|z|)
#> (Intercept) 5515.9 361.5 15.257 <1e-06 ***
#> treat 833.2 644.4 1.293 0.196
#> Standard error: fractional weighted bootstrap
#>
