Package 'spldv' reference manual

Title:	Spatial Models for Limited Dependent Variables
Description:	The current version of this package estimates spatial autoregressive models for binary dependent variables using GMM estimators <doi:10.18637/jss.v107.i08> and RIS estimator <doi:10.1007/978-3-662-05617-2_8>. It supports one-step (Pinkse and Slade, 1998) <doi:10.1016/S0304-4076(97)00097-3> and two-step GMM estimator along with the linearized GMM estimator proposed by Klier and McMillen (2008) <doi:10.1198/073500107000000188>. It also allows for either Probit or Logit model and compute the average marginal effects. The RIS estimator allows to estimate the SAR and SEM model. All these models are presented in Sarrias and Piras (2023) <doi:10.1016/j.jocm.2023.100432>.
Authors:	Mauricio Sarrias [aut, cre] , Gianfranco Piras [aut] , Daniel McMillen [ctb]
Maintainer:	Mauricio Sarrias <[email protected]>
License:	GPL (>= 2)
Version:	0.1.4
Built:	2025-02-16 05:17:23 UTC
Source:	https://github.com/gpiras/spldv

Get Model Summaries for use with "mtable" for objects of class bingmm

Description

A generic function to collect coefficients and summary statistics from a bingmm object. It is used in mtable

Usage

## S3 method for class 'bingmm'
getSummary(obj, alpha = 0.05, ...)
## S3 method for class 'bingmm'
getSummary(obj, alpha = 0.05, ...)

Arguments

`obj`	a `bingmm` object,
`alpha`	level of the confidence intervals,
`...`	further arguments,

Details

For more details see package memisc.

Value

A list with an array with coefficient estimates and a vector containing the model summary statistics.

Get Model Summaries for use with "mtable" for objects of class binlgmm

Description

A generic function to collect coefficients and summary statistics from a binlgmm object. It is used in mtable

Usage

## S3 method for class 'binlgmm'
getSummary(obj, alpha = 0.05, ...)
## S3 method for class 'binlgmm'
getSummary(obj, alpha = 0.05, ...)

Arguments

`obj`	a `binlgmm` object,
`alpha`	level of the confidence intervals,
`...`	further arguments,

Details

For more details see package memisc.

Value

A list with an array with coefficient estimates and a vector containing the model summary statistics.

Get Model Summaries for use with "mtable" for objects of class binris

Description

A generic function to collect coefficients and summary statistics from a binris object. It is used in mtable

Usage

## S3 method for class 'binris'
getSummary(obj, alpha = 0.05, ...)
## S3 method for class 'binris'
getSummary(obj, alpha = 0.05, ...)

Arguments

`obj`	a `binris` object,
`alpha`	level of the confidence intervals,
`...`	further arguments,

Details

For more details see package memisc.

Value

A list with an array with coefficient estimates and a vector containing the model summary statistics.

Estimation of the average marginal effects for SARB models.

Description

Obtain the average marginal effects from bingmm, binlgmm or binris class model.

Usage

## S3 method for class 'bingmm'
impacts(
  obj,
  vcov = NULL,
  vce = c("robust", "efficient", "ml"),
  het = TRUE,
  atmeans = FALSE,
  type = c("mc", "delta"),
  R = 100,
  approximation = FALSE,
  pw = 5,
  tol = 1e-06,
  empirical = FALSE,
  ...
)

## S3 method for class 'binlgmm'
impacts(
  obj,
  vcov = NULL,
  het = TRUE,
  atmeans = FALSE,
  type = c("mc", "delta"),
  R = 100,
  approximation = FALSE,
  pw = 5,
  tol = 1e-06,
  empirical = FALSE,
  ...
)

## S3 method for class 'binris'
impacts(
  obj,
  vcov = NULL,
  het = TRUE,
  atmeans = FALSE,
  type = c("mc", "delta"),
  R = 100,
  approximation = FALSE,
  pw = 5,
  tol = 1e-06,
  empirical = FALSE,
  ...
)

## S3 method for class 'impacts.bingmm'
print(x, ...)

## S3 method for class 'impacts.bingmm'
summary(object, ...)

## S3 method for class 'summary.impacts.bingmm'
print(x, digits = max(3, getOption("digits") - 3), ...)
## S3 method for class 'bingmm'
impacts(
  obj,
  vcov = NULL,
  vce = c("robust", "efficient", "ml"),
  het = TRUE,
  atmeans = FALSE,
  type = c("mc", "delta"),
  R = 100,
  approximation = FALSE,
  pw = 5,
  tol = 1e-06,
  empirical = FALSE,
  ...
)

## S3 method for class 'binlgmm'
impacts(
  obj,
  vcov = NULL,
  het = TRUE,
  atmeans = FALSE,
  type = c("mc", "delta"),
  R = 100,
  approximation = FALSE,
  pw = 5,
  tol = 1e-06,
  empirical = FALSE,
  ...
)

## S3 method for class 'binris'
impacts(
  obj,
  vcov = NULL,
  het = TRUE,
  atmeans = FALSE,
  type = c("mc", "delta"),
  R = 100,
  approximation = FALSE,
  pw = 5,
  tol = 1e-06,
  empirical = FALSE,
  ...
)

## S3 method for class 'impacts.bingmm'
print(x, ...)

## S3 method for class 'impacts.bingmm'
summary(object, ...)

## S3 method for class 'summary.impacts.bingmm'
print(x, digits = max(3, getOption("digits") - 3), ...)

Arguments

`obj`	an object of class `bingmm`, `binlgmm` or `binris`.
`vcov`	an estimate of the asymptotic variance-covariance matrix of the parameters for a `bingmm` or `binlgmm` object.
`vce`	string indicating what kind of variance-covariance matrix of the estimate should be computed when using `effect.bingmm`. For the one-step GMM estimator, the options are `"robust"` and `"ml"`. For the two-step GMM estimator, the options are `"robust"`, `"efficient"` and `"ml"`. The option `"vce = ml"` is an exploratory method that evaluates the VC of the RIS estimator using the GMM estimates.
`het`	logical. If `TRUE` (the default), then the heteroskedasticity is taken into account when computing the average marginal effects.
`atmeans`	logical. If `FALSE` (the default), then the average marginal effects are computed at the unit level.
`type`	string indicating which method is used to compute the standard errors of the average marginal effects. If `"mc"`, then the Monte Carlo approximation is used. If `"delta"`, then the Delta Method is used.
`R`	numerical. Indicates the number of draws used in the Monte Carlo approximation if `type = "mc"`.
`approximation`	logical. If `TRUE` then $(I - \lambda W)^{-1}$ is approximated as $I + \lambda W + \lambda^2 W^2 + \lambda^3 W^3 + ... +\lambda^q W^q$ . The default is `FALSE`.
`pw`	numeric. The power used for the approximation $I + \lambda W + \lambda^2 W^2 + \lambda^3 W^3 + ... +\lambda^q W^q$ . The default is 5.
`tol`	Argument passed to `mvrnorm`: tolerance (relative to largest variance) for numerical lack of positive-definiteness in the coefficient covariance matrix.
`empirical`	logical. Argument passed to `mvrnorm` (default `FALSE`): if `TRUE`, the coefficients and their covariance matrix specify the empirical not population mean and covariance matrix
`...`	further arguments. Ignored.
`x`	an object of class `impacts.bingmm`.
`object`	an object of class `impacts.bingmm` for `summary` methods.
`digits`	the number of digits.

Details

Let the model be:

$y^*= X\beta + WX\gamma + \lambda W y^* + \epsilon = Z\delta + \lambda Wy^{*} + \epsilon$

where $y = 1$ if $y^*>0$ and 0 otherwise; $\epsilon \sim N(0, 1)$ if link = "probit" or $\epsilon \sim L(0, \pi^2/3)$ if link = "logit". The RIS estimator assumes that $\epsilon \sim N(0, 1)$ .

The marginal effects respect to variable $x_r$ can be computed as

$diag(f(a))D^{-1}_{\lambda}A^{-1}_{\lambda}\left(I_n\beta_r + W\gamma_r\right) = C_r(\theta)$

where $f()$ is the pdf, which depends on the assumption of the error terms; $diag$ is the operator that creates a $n \times n$ diagonal matrix; $A_{\lambda}= (I -\lambda W)$ ; and $D_{\lambda}$ is a diagonal matrix whose elements represent the square root of the diagonal elements of the variance-covariance matrix of $u = A_{\lambda}^{-1}\epsilon$ .

We implement these three summary measures: (1) The average total effects, $ATE_r = n^{-1}i_n'C_{r}i_n$ , (2) The average direct effects, $ADE_r = n^{-1}tr(C_{r})$ , and (3) the average indirect effects, $ATE_r - ADE_r$ .

The standard errors of the average total, direct and indirect effects can be estimated using either Monte Carlo (MC) approximation, which takes into account the sampling distribution of $\theta$ , or Delta Method.

Value

An object of class impacts.bingmm.

Author(s)

Mauricio Sarrias and Gianfranco Piras.

Examples


# Data set
data(oldcol, package = "spdep")

# Create dependent (dummy) variable
COL.OLD$CRIMED <- as.numeric(COL.OLD$CRIME > 35)

# Two-step (Probit) GMM estimator
ts <- sbinaryGMM(CRIMED ~ INC + HOVAL| HOVAL,
                link = "probit", 
                listw = spdep::nb2listw(COL.nb, style = "W"), 
                data = COL.OLD, 
                type = "twostep")
                
# Marginal effects using Delta Method
summary(impacts(ts, type = "delta"))

# Marginal effects using MC with 100 draws
summary(impacts(ts, type = "mc", R = 100))

# Marginal effects using efficient VC matrix
summary(impacts(ts, type = "delta", vce = "efficient"))

# Marginal effects using efficient VC matrix and ignoring the heteroskedasticity
summary(impacts(ts, type = "delta", vce = "efficient", het = FALSE))

# Marginal effects using  RIS estimator
ris_sar <- sbinaryRis(CRIMED ~ INC + HOVAL, data = COL.OLD,
                      R = 50, 
                      listw = spdep::nb2listw(COL.nb, style = "W"))
summary(impacts(ris_sar, method = "delta"))
summary(impacts(ris_sar, method = "mc", R = 100))

# Data set
data(oldcol, package = "spdep")

# Create dependent (dummy) variable
COL.OLD$CRIMED <- as.numeric(COL.OLD$CRIME > 35)

# Two-step (Probit) GMM estimator
ts <- sbinaryGMM(CRIMED ~ INC + HOVAL| HOVAL,
                link = "probit", 
                listw = spdep::nb2listw(COL.nb, style = "W"), 
                data = COL.OLD, 
                type = "twostep")
                
# Marginal effects using Delta Method
summary(impacts(ts, type = "delta"))

# Marginal effects using MC with 100 draws
summary(impacts(ts, type = "mc", R = 100))

# Marginal effects using efficient VC matrix
summary(impacts(ts, type = "delta", vce = "efficient"))

# Marginal effects using efficient VC matrix and ignoring the heteroskedasticity
summary(impacts(ts, type = "delta", vce = "efficient", het = FALSE))

# Marginal effects using  RIS estimator
ris_sar <- sbinaryRis(CRIMED ~ INC + HOVAL, data = COL.OLD,
                      R = 50, 
                      listw = spdep::nb2listw(COL.nb, style = "W"))
summary(impacts(ris_sar, method = "delta"))
summary(impacts(ris_sar, method = "mc", R = 100))

Make instruments for spatial models

Description

Make instruments for spatial models

Usage

make.instruments(listw, x, q)
make.instruments(listw, x, q)

Arguments

`listw`	object. An object of class `listw`, `matrix`, or `Matrix`.
`x`	variable(s) to be lagged
`q`	number of lags

Author(s)

Mauricio Sarrias and Gianfranco Piras.

Estimation of SAR for binary dependent models using GMM

Description

Estimation of SAR model for binary dependent variables (either Probit or Logit), using one- or two-step GMM estimator. The type of model supported has the following structure:

$y^*= X\beta + WX\gamma + \lambda W y^* + \epsilon = Z\delta + \lambda Wy^{*} + \epsilon$

where $y = 1$ if $y^*>0$ and 0 otherwise; $\epsilon \sim N(0, 1)$ if link = "probit" or $\epsilon \sim L(0, \pi^2/3)$ if link = "logit".

Usage

sbinaryGMM(
  formula,
  data,
  listw = NULL,
  nins = 2,
  link = c("probit", "logit"),
  winitial = c("optimal", "identity"),
  s.matrix = c("robust", "iid"),
  type = c("onestep", "twostep"),
  gradient = TRUE,
  start = NULL,
  cons.opt = FALSE,
  approximation = FALSE,
  verbose = TRUE,
  print.init = FALSE,
  pw = 5,
  tol.solve = .Machine$double.eps,
  ...
)

## S3 method for class 'bingmm'
coef(object, ...)

## S3 method for class 'bingmm'
vcov(
  object,
  vce = c("robust", "efficient", "ml"),
  method = "bhhh",
  R = 1000,
  tol.solve = .Machine$double.eps,
  ...
)

## S3 method for class 'bingmm'
print(x, digits = max(3, getOption("digits") - 3), ...)

## S3 method for class 'bingmm'
summary(
  object,
  vce = c("robust", "efficient", "ml"),
  method = "bhhh",
  R = 1000,
  tol.solve = .Machine$double.eps,
  ...
)

## S3 method for class 'summary.bingmm'
print(x, digits = max(5, getOption("digits") - 3), ...)
sbinaryGMM(
  formula,
  data,
  listw = NULL,
  nins = 2,
  link = c("probit", "logit"),
  winitial = c("optimal", "identity"),
  s.matrix = c("robust", "iid"),
  type = c("onestep", "twostep"),
  gradient = TRUE,
  start = NULL,
  cons.opt = FALSE,
  approximation = FALSE,
  verbose = TRUE,
  print.init = FALSE,
  pw = 5,
  tol.solve = .Machine$double.eps,
  ...
)

## S3 method for class 'bingmm'
coef(object, ...)

## S3 method for class 'bingmm'
vcov(
  object,
  vce = c("robust", "efficient", "ml"),
  method = "bhhh",
  R = 1000,
  tol.solve = .Machine$double.eps,
  ...
)

## S3 method for class 'bingmm'
print(x, digits = max(3, getOption("digits") - 3), ...)

## S3 method for class 'bingmm'
summary(
  object,
  vce = c("robust", "efficient", "ml"),
  method = "bhhh",
  R = 1000,
  tol.solve = .Machine$double.eps,
  ...
)

## S3 method for class 'summary.bingmm'
print(x, digits = max(5, getOption("digits") - 3), ...)

Arguments

`formula`	a symbolic description of the model of the form `y ~ x \| wx` where `y` is the binary dependent variable, `x` are the independent variables. The variables after `\|` are those variables that enter spatially lagged: $WX$ . The variables in the second part of `formula` must also appear in the first part. This rules out situations in which one of the regressors can be specified only in lagged form.
`data`	the data of class `data.frame`.
`listw`	object. An object of class `listw`, `matrix`, or `Matrix`.
`nins`	numerical. Order of instrumental-variable approximation; as default `nins = 2`, such that $H = (Z, WZ, W^2Z)$ are used as instruments.
`link`	string. The assumption of the distribution of the error term; it can be either `link = "probit"` (the default) or `link = "logit"`.
`winitial`	string. A string indicating the initial moment-weighting matrix $\Psi$ ; it can be either `winitial = "optimal"` (the default) or `winitial = "identity"`.
`s.matrix`	string. Only valid of `type = "twostep"` is used. This is a string indicating the type of variance-covariance matrix $\hat{S}$ to be used in the second-step procedure; it can be `s.matrix = "robust"` (the default) or `s.matrix = "iid"`.
`type`	string. A string indicating whether the one-step (`type = "onestep"`), or two-step GMM (`type = "twostep"`) should be computed.
`gradient`	logical. Only for testing procedures. Should the analytic gradient be used in the GMM optimization procedure? `TRUE` as default. If `FALSE`, then the numerical gradient is used.
`start`	if not `NULL`, the user must provide a vector of initial parameters for the optimization procedure. When `start = NULL`, `sbinaryGMM` uses the traditional Probit or Logit estimates as initial values for the parameters, and the correlation between $y$ and $Wy$ as initial value for $\lambda$ .
`cons.opt`	logical. Should a constrained optimization procedure for $\lambda$ be used? `FALSE` as default.
`approximation`	logical. If `TRUE` then $(I - \lambda W)^{-1}$ is approximated as $I + \lambda W + \lambda^2 W^2 + \lambda^3 W^3 + ... +\lambda^q W^q$ . The default is `FALSE`.
`verbose`	logical. If `TRUE`, the code reports messages and some values during optimization.
`print.init`	logical. If `TRUE` the initial parameters used in the optimization of the first step are printed.
`pw`	numeric. The power used for the approximation $I + \lambda W + \lambda^2 W^2 + \lambda^3 W^3 + ... +\lambda^q W^q$ . The default is 5.
`tol.solve`	Tolerance for `solve()`.
`...`	additional arguments passed to `maxLik`.
`vce`	string. A string indicating what kind of standard errors should be computed when using `summary`. For the one-step GMM estimator, the options are `"robust"` and `"ml"`. For the two-step GMM estimator, the options are `"robust"`, `"efficient"` and `"ml"`. The option `"vce = ml"` is an exploratory method that evaluates the VC of the RIS estimator using the GMM estimates.
`method`	string. Only valid if `vce = "ml"`. It indicates the algorithm used to compute the Hessian matrix of the RIS estimator. The defult is `"bhhh"`.
`R`	numeric. Only valid if `vce = "ml"`. It indicates the number of draws used to compute the simulated probability in the RIS estimator.
`x`, `object`	an object of class `bingmm`
`digits`	the number of digits

Details

The data generating process is:

$y^*= X\beta + WX\gamma + \lambda W y^* + \epsilon = Z\delta + \lambda Wy^{*} + \epsilon$

where $y = 1$ if $y^*>0$ and 0 otherwise; $\epsilon \sim N(0, 1)$ if link = "probit" or $\epsilon \sim L(0, \pi^2/3)$ if link = "logit".. The general GMM estimator minimizes

$J(\theta) = g'(\theta)\hat{\Psi} g(\theta)$

where $\theta = (\beta, \gamma, \lambda)$ and

$g = n^{-1}H'v$

where $v$ is the generalized residuals. Let $Z = (X, WX)$ , then the instrument matrix $H$ contains the linearly independent columns of $H = (Z, WZ, ..., W^qZ)$ . The one-step GMM estimator minimizes $J(\theta)$ setting either $\hat{\Psi} = I_p$ if winitial = "identity" or $\hat{\Psi} = (H'H/n)^{-1}$ if winitial = "optimal". The two-step GMM estimator uses an additional step to achieve higher efficiency by computing the variance-covariance matrix of the moments $\hat{S}$ to weight the sample moments. This matrix is computed using the residuals or generalized residuals from the first-step, which are consistent. This matrix is computed as $\hat{S} = n^{-1}\sum_{i = 1}^n h_i(f^2/(F(1 - F)))h_i'$ if s.matrix = "robust" or $\hat{S} = n^{-1}\sum_{i = 1}^n \hat{v}_ih_ih_i'$ , where $\hat{v}$ are the first-step generalized residuals.

Value

An object of class “bingmm”, a list with elements:

`coefficients`	the estimated coefficients,
`call`	the matched call,
`callF`	the full matched call,
`X`	the X matrix, which contains also WX if the second part of the `formula` is used,
`H`	the H matrix of instruments used,
`y`	the dependent variable,
`listw`	the spatial weight matrix,
`link`	the string indicating the distribution of the error term,
`Psi`	the moment-weighting matrix used in the last round,
`type`	type of model that was fitted,
`s.matrix`	the type of S matrix used in the second round,
`winitial`	the moment-weighting matrix used for the first step procedure
`opt`	object of class `maxLik`,
`approximation`	a logical value indicating whether approximation was used to compute the inverse matrix,
`pw`	the powers for the approximation,
`formula`	the formula.

Author(s)

Mauricio Sarrias and Gianfranco Piras.

References

Pinkse, J., & Slade, M. E. (1998). Contracting in space: An application of spatial statistics to discrete-choice models. Journal of Econometrics, 85(1), 125-154.

Fleming, M. M. (2004). Techniques for estimating spatially dependent discrete choice models. In Advances in spatial econometrics (pp. 145-168). Springer, Berlin, Heidelberg.

Klier, T., & McMillen, D. P. (2008). Clustering of auto supplier plants in the United States: generalized method of moments spatial logit for large samples. Journal of Business & Economic Statistics, 26(4), 460-471.

LeSage, J. P., Kelley Pace, R., Lam, N., Campanella, R., & Liu, X. (2011). New Orleans business recovery in the aftermath of Hurricane Katrina. Journal of the Royal Statistical Society: Series A (Statistics in Society), 174(4), 1007-1027.

Piras, G., & Sarrias, M. (2023). One or Two-Step? Evaluating GMM Efficiency for Spatial Binary Probit Models. Journal of choice modelling, 48, 100432.

Piras, G,. & Sarrias, M. (2023). GMM Estimators for Binary Spatial Models in R. Journal of Statistical Software, 107(8), 1-33.

Examples


# Data set
data(oldcol, package = "spdep")

# Create dependent (dummy) variable
COL.OLD$CRIMED <- as.numeric(COL.OLD$CRIME > 35)

# Two-step (Probit) GMM estimator
ts <- sbinaryGMM(CRIMED ~ INC + HOVAL,
                link = "probit", 
                listw = spdep::nb2listw(COL.nb, style = "W"), 
                data = COL.OLD, 
                type = "twostep",
                verbose = TRUE)
                
# Robust standard errors
summary(ts)
# Efficient standard errors
summary(ts, vce = "efficient")

# One-step (Probit) GMM estimator 
os <- sbinaryGMM(CRIMED ~ INC + HOVAL,
                link = "probit", 
                listw = spdep::nb2listw(COL.nb, style = "W"), 
                data = COL.OLD, 
                type = "onestep",
                verbose = TRUE)
summary(os)

# One-step (Logit) GMM estimator with identity matrix as initial weight matrix
os_l <- sbinaryGMM(CRIMED ~ INC + HOVAL,
                  link = "logit", 
                  listw = spdep::nb2listw(COL.nb, style = "W"), 
                  data = COL.OLD, 
                  type = "onestep",
                  winitial = "identity", 
                  verbose = TRUE)
summary(os_l)

# Two-step (Probit) GMM estimator with WX
ts_wx <- sbinaryGMM(CRIMED ~ INC + HOVAL| INC + HOVAL,
                   link = "probit", 
                   listw = spdep::nb2listw(COL.nb, style = "W"), 
                   data = COL.OLD, 
                   type = "twostep",
                   verbose = FALSE)
summary(ts_wx)

# Constrained two-step (Probit) GMM estimator 
ts_c <- sbinaryGMM(CRIMED ~ INC + HOVAL,
                  link = "probit", 
                  listw = spdep::nb2listw(COL.nb, style = "W"), 
                  data = COL.OLD, 
                  type = "twostep",
                  verbose = TRUE, 
                  cons.opt = TRUE)
summary(ts_c)

# Data set
data(oldcol, package = "spdep")

# Create dependent (dummy) variable
COL.OLD$CRIMED <- as.numeric(COL.OLD$CRIME > 35)

# Two-step (Probit) GMM estimator
ts <- sbinaryGMM(CRIMED ~ INC + HOVAL,
                link = "probit", 
                listw = spdep::nb2listw(COL.nb, style = "W"), 
                data = COL.OLD, 
                type = "twostep",
                verbose = TRUE)
                
# Robust standard errors
summary(ts)
# Efficient standard errors
summary(ts, vce = "efficient")

# One-step (Probit) GMM estimator 
os <- sbinaryGMM(CRIMED ~ INC + HOVAL,
                link = "probit", 
                listw = spdep::nb2listw(COL.nb, style = "W"), 
                data = COL.OLD, 
                type = "onestep",
                verbose = TRUE)
summary(os)

# One-step (Logit) GMM estimator with identity matrix as initial weight matrix
os_l <- sbinaryGMM(CRIMED ~ INC + HOVAL,
                  link = "logit", 
                  listw = spdep::nb2listw(COL.nb, style = "W"), 
                  data = COL.OLD, 
                  type = "onestep",
                  winitial = "identity", 
                  verbose = TRUE)
summary(os_l)

# Two-step (Probit) GMM estimator with WX
ts_wx <- sbinaryGMM(CRIMED ~ INC + HOVAL| INC + HOVAL,
                   link = "probit", 
                   listw = spdep::nb2listw(COL.nb, style = "W"), 
                   data = COL.OLD, 
                   type = "twostep",
                   verbose = FALSE)
summary(ts_wx)

# Constrained two-step (Probit) GMM estimator 
ts_c <- sbinaryGMM(CRIMED ~ INC + HOVAL,
                  link = "probit", 
                  listw = spdep::nb2listw(COL.nb, style = "W"), 
                  data = COL.OLD, 
                  type = "twostep",
                  verbose = TRUE, 
                  cons.opt = TRUE)
summary(ts_c)

Estimation of SAR for binary models using Linearized GMM.

Description

Estimation of SAR model for binary dependent variables (either Probit or Logit), using Linearized GMM estimator suggested by Klier and McMillen (2008). The model is:

$y^*= X\beta + WX\gamma + \lambda W y^* + \epsilon = Z\delta + \lambda Wy^{*} + \epsilon$

where $y = 1$ if $y^*>0$ and 0 otherwise; $\epsilon \sim N(0, 1)$ if link = "probit" or $\epsilon \sim L(0, \pi^2/3)$ link = "logit".

Usage

sbinaryLGMM(
  formula,
  data,
  listw = NULL,
  nins = 2,
  link = c("logit", "probit"),
  ...
)

## S3 method for class 'binlgmm'
coef(object, ...)

## S3 method for class 'binlgmm'
vcov(object, ...)

## S3 method for class 'binlgmm'
print(x, digits = max(3, getOption("digits") - 3), ...)

## S3 method for class 'binlgmm'
summary(object, ...)

## S3 method for class 'summary.binlgmm'
print(x, digits = max(3, getOption("digits") - 2), ...)
sbinaryLGMM(
  formula,
  data,
  listw = NULL,
  nins = 2,
  link = c("logit", "probit"),
  ...
)

## S3 method for class 'binlgmm'
coef(object, ...)

## S3 method for class 'binlgmm'
vcov(object, ...)

## S3 method for class 'binlgmm'
print(x, digits = max(3, getOption("digits") - 3), ...)

## S3 method for class 'binlgmm'
summary(object, ...)

## S3 method for class 'summary.binlgmm'
print(x, digits = max(3, getOption("digits") - 2), ...)

Arguments

`formula`	a symbolic description of the model of the form `y ~ x \| wx` where `y` is the binary dependent variable, `x` are the independent variables. The variables after `\|` are those variables that enter spatially lagged: $WX$ . The variables in the second part of `formula` must also appear in the first part.
`data`	the data of class `data.frame`.
`listw`	object. An object of class `listw`, `matrix`, or `Matrix`.
`nins`	numerical. Order of instrumental-variable approximation; as default `nins = 2`, such that $H = (Z, WZ, W^2Z)$ are used as instruments.
`link`	string. The assumption of the distribution of the error term; it can be either `link = "probit"` (the default) or `link = "logit"`.
`...`	additional arguments.
`x`, `object`	an object of class `binlgmm`.
`digits`	the number of digits

Details

The steps for the linearized spatial Probit/Logit model are the following:

1. Estimate the model by standard Probit/Logit model, in which spatial autocorrelation and heteroskedasticity are ignored. The estimated values are $\beta_0$ . Calculate the generalized residuals assuming that $\lambda = 0$ and the gradient terms $G_{\beta}$ and $G_{\lambda}$ .

2. The second step is a two-stage least squares estimator of the linearized model. Thus regress $G_{\beta}$ and $G_{\lambda}$ on $H = (Z, WZ, W^2Z, ...., W^qZ)$ and obtain the predicted values $\hat{G}$ . Then regress $u_0 + G_{\beta}'\hat{\beta}_0$ on $\hat{G}$ . The coefficients are the estimated values of $\beta$ and $\lambda$ .

The variance-covariance matrix can be computed using the traditional White-corrected coefficient covariance matrix from the last two-stage least squares estimator of the linearlized model.

Value

An object of class “bingmm”, a list with elements:

`coefficients`	the estimated coefficients,
`call`	the matched call,
`X`	the X matrix, which contains also WX if the second part of the `formula` is used,
`H`	the H matrix of instruments used,
`y`	the dependent variable,
`listw`	the spatial weight matrix,
`link`	the string indicating the distribution of the error term,
`fit`	an object of `lm` representing the T2SLS,
`formula`	the formula.

Author(s)

Mauricio Sarrias and Gianfranco Piras.

References

Piras, G., & Sarrias, M. (2023). One or Two-Step? Evaluating GMM Efficiency for Spatial Binary Probit Models. Journal of choice modelling, 48, 100432.

Piras, G,. & Sarrias, M. (2023). GMM Estimators for Binary Spatial Models in R. Journal of Statistical Software, 107(8), 1-33.

Examples

# Data set
data(oldcol, package = "spdep")

# Create dependent (dummy) variable
COL.OLD$CRIMED <- as.numeric(COL.OLD$CRIME > 35)

# LGMM for probit using q = 3 for instruments
lgmm <- sbinaryLGMM(CRIMED ~ INC + HOVAL | INC,
                link  = "probit", 
                listw = spdep::nb2listw(COL.nb, style = "W"),
                nins  = 3, 
                data  = COL.OLD)
summary(lgmm)
# Data set
data(oldcol, package = "spdep")

# Create dependent (dummy) variable
COL.OLD$CRIMED <- as.numeric(COL.OLD$CRIME > 35)

# LGMM for probit using q = 3 for instruments
lgmm <- sbinaryLGMM(CRIMED ~ INC + HOVAL | INC,
                link  = "probit", 
                listw = spdep::nb2listw(COL.nb, style = "W"),
                nins  = 3, 
                data  = COL.OLD)
summary(lgmm)

Estimation of spatial probit model for binary outcomes using RIS (GHK) simulator

Description

Estimation of spatial probit model using RIS-normal (a.k.a GHK) simulator. The models can be the SAR or SEM probit model model. The SAR probit model has the following structure:

$y^*= X\beta + WX\gamma + \lambda W y^* + \epsilon = Z\delta + \lambda Wy^{*} + \epsilon = A_{\lambda}^{-1}Z\delta + u,$

where $y = 1$ if $y^*>0$ and 0 otherwise, $Z = (X, WX)$ , $\delta = (\beta', \gamma')'$ , $u = A_{\lambda}^{-1}\epsilon$ with $A_{\lambda} = (I - \lambda W)$ , and $\epsilon \sim N(0, I)$ . The SEM probit model has the following structure:

$y^* = X\beta + WX\gamma + u = Z\delta + u$

where $y = 1$ if $y^*>0$ and 0 otherwise, $Z = (X, WX)$ , $\delta = (\beta', \gamma')'$ , $u = \rho W u + \epsilon$ such that $u = A_{\rho}^{-1}\epsilon$ , and $\epsilon \sim N(0, I)$ ,

Usage

sbinaryRis(
  formula,
  data,
  subset,
  na.action,
  listw = NULL,
  R = 20,
  model = c("SAR", "SEM"),
  varcov = c("invsigma", "sigma"),
  approximation = TRUE,
  pw = 5,
  start = NULL,
  Qneg = FALSE,
  print.init = FALSE,
  ...
)

## S3 method for class 'binris'
terms(x, ...)

## S3 method for class 'binris'
estfun(x, ...)

## S3 method for class 'binris'
bread(x, ...)

## S3 method for class 'binris'
df.residual(object, ...)

## S3 method for class 'binris'
vcov(object, ...)

## S3 method for class 'binris'
coef(object, ...)

## S3 method for class 'binris'
logLik(object, ...)

## S3 method for class 'binris'
print(x, ...)

## S3 method for class 'binris'
summary(object, eigentol = 1e-12, ...)

## S3 method for class 'summary.binris'
print(x, digits = max(3, getOption("digits") - 2), ...)
sbinaryRis(
  formula,
  data,
  subset,
  na.action,
  listw = NULL,
  R = 20,
  model = c("SAR", "SEM"),
  varcov = c("invsigma", "sigma"),
  approximation = TRUE,
  pw = 5,
  start = NULL,
  Qneg = FALSE,
  print.init = FALSE,
  ...
)

## S3 method for class 'binris'
terms(x, ...)

## S3 method for class 'binris'
estfun(x, ...)

## S3 method for class 'binris'
bread(x, ...)

## S3 method for class 'binris'
df.residual(object, ...)

## S3 method for class 'binris'
vcov(object, ...)

## S3 method for class 'binris'
coef(object, ...)

## S3 method for class 'binris'
logLik(object, ...)

## S3 method for class 'binris'
print(x, ...)

## S3 method for class 'binris'
summary(object, eigentol = 1e-12, ...)

## S3 method for class 'summary.binris'
print(x, digits = max(3, getOption("digits") - 2), ...)

Arguments

`formula`	a symbolic description of the model of the form `y ~ x \| wx` where `y` is the binary dependent variable, `x` are the independent variables. The variables after `\|` are those variables that enter spatially lagged: $WX$ . The variables in the second part of `formula` must also appear in the first part. This rules out situations in which one of the regressors can be specified only in lagged form.
`data`	the data of class `data.frame`.
`subset`	an optional vector specifying a subset of observations to be used in the fitting process.
`na.action`	a function which indicates what should happen when the data contains `NA`s.
`listw`	object. An object of class `listw`, `matrix`, or `Matrix`.
`R`	numerical. The number of draws used in RIS (GHK) simulator.
`model`	string. A string indicating which model to estimate. It can be `"SAR"` for the spatial autoregressive spatial model or `"SEM"` for the spatial error model.
`varcov`	string. A string indicating over which variance-covariance matrix to apply the Chokesly factorization.
`approximation`	logical. If `TRUE` (the default) then $(I - \lambda W)^{-1}$ or $(I - \rho W)^{-1}$ is approximated as $I + \lambda W + \lambda^2 W^2 + \lambda^3 W^3 + ... +\lambda^q W^q$ . If `FALSE`, then the inverse is computed without approximations.
`pw`	numeric. The power used for the approximation $I + \lambda W + \lambda^2 W^2 + \lambda^3 W^3 + ... +\lambda^q W^q$ . The default is 5.
`start`	if not `NULL`, the user must provide a vector of initial parameters for the optimization procedure. When `start = NULL`, `sbinaryRis` uses the traditional Probit estimates as initial values for the parameters, and the correlation between $y$ and $Wy$ as initial value for $\lambda$ or $\rho$ .
`Qneg`	logical. Whether to construct the diagonal elements of $Q$ . If `Qneg = FALSE`, then $q_{ii} = 2y_{i} - 1$ . If `Qneg = TRUE`, then $q_{ii} = 1- 2y_{i}$ .
`print.init`	logical. If `TRUE` the initial parameters used in the optimization of the first step are printed.
`...`	additional arguments passed to `maxLik`.
`x`, `object`	an object of class `bingmm`.
`eigentol`	the standard errors are only calculated if the ratio of the smallest and largest eigenvalue of the Hessian matrix is less than `eigentol`. Otherwise the Hessian is treated as singular.
`digits`	the number of digits

Details

The models are estimated by simulating the probabilities using the RIS-normal (GHK) simulator. The aim is to evaluate the multivariate density function $P(\upsilon = Q u < s)$ , where $Q$ is a diagonal matrix with entries $q_{ii} = 2y_i - 1$ and the $n\times 1$ vector $s$ depends on whether the model is SAR or SEM. If model = "SAR", then $s = QA_{\lambda}^{-1}Z\delta$ where $A_{\lambda} = (I - \lambda W)$ ; if model = "SEM", then $s = QZ\delta$ .

Let $\Sigma_{\upsilon} = QVar(u)Q'$ be the variance-covariance model of the transformed model. If model = "SAR" $\Sigma_{\upsilon} = Q\Sigma_{\lambda}Q'$ , where $\Sigma_{\lambda} = (A_{\lambda}'A_{\lambda})^{-1}$ . If model = "SEM", then $\Sigma_{\upsilon} = Q\Sigma_{\rho}Q'$ , where $\Sigma_{\rho} = (A_{\rho}'A_{\rho})^{-1}$ .

Since $\Sigma_{\upsilon}$ is positive definite, there exists a Cholesky decomposition such that $C'C = \Sigma_{\upsilon}^{-1}$ , where $C$ is the upper triangular Cholesky matrix and $\Sigma_{\upsilon}^{-1}$ is the precision matrix. Let $B = C^{-1}$ . Then the random vector $Qu$ can be replaced by $Qu = C^{-1}\eta = B\eta$ , where $\eta$ is a vector of standard normal variables. Then, the upper limit of the integration becomes $Qu = B \eta < s$ , which can be written as $\eta < B^{-1}s = \nu$ .

The RIS simulator is implemented by drawing a large number $R$ of random vector $\eta$ and computing $\eta_{rj}$ recursively for $j = 1, \ldots, n$ . The parameters are estimated using the simulated maximum likelihood (SML) function:

$\ln \tilde{L}(\theta) = \ln \left(\frac{1}{R}\sum_{r = 1}^R \tilde{p}_r\right),$

where:

$\tilde{p}_r = \prod_{j = 1}^n \Phi(\hat{\eta}_{jr}),$

and $\Phi(\cdot)$ is the univariate CDF of the standard normal density.

By default, sbinaryRis compute the SML using the Cholesky transformation on $\Sigma_{\upsilon}^{-1}$ , varcov = "invsigma". The transformation can also be applied to $\Sigma_{\upsilon}$ using varcov = "sigma", which is slower than the previous option.

This estimator can take several minutes for large datasets. Thus, by default the inverse matrices $A_{\lambda}^{-1}$ and $A_{\rho}^{-1}$ are approximated using the Leontief expansion.

Value

An object of class "binris", a list with elements:

`varcov`	the matrix over which the Chokesly factorization is applied,
`model`	type of model that was fitted,
`Qneg`	matrix Q used,
`approximation`	a logical value indicating whether approximation was used to compute the inverse matrix,
`pw`	the powers for the approximation,
`call`	the matched call,
`X`	the X matrix, which contains also WX if the second part of the `formula` is used,
`y`	the dependent variable,
`listw`	the spatial weight matrix,
`formula`	the formula,
`R`	number of draws,
`mf`	model frame.

Author(s)

Mauricio Sarrias and Gianfranco Piras.

References

Beron, K. J., & Vijverberg, W. P. (2004). Probit in a spatial context: a Monte Carlo analysis. In Advances in spatial econometrics: methodology, tools and applications (pp. 169-195). Berlin, Heidelberg: Springer Berlin Heidelberg.

Fleming, M. M. (2004). Techniques for estimating spatially dependent discrete choice models. In Advances in spatial econometrics (pp. 145-168). Springer, Berlin, Heidelberg.

Pace, R. K., & LeSage, J. P. (2016). Fast simulated maximum likelihood estimation of the spatial probit model capable of handling large samples. In Spatial Econometrics: Qualitative and Limited Dependent Variables (pp. 3-34). Emerald Group Publishing Limited.

Piras, G., & Sarrias, M. (2023). One or Two-Step? Evaluating GMM Efficiency for Spatial Binary Probit Models. Journal of choice modelling, 48, 100432.

Examples


data(oldcol, package = "spdep")

# Create dependent (dummy) variable
COL.OLD$CRIMED <- as.numeric(COL.OLD$CRIME > 35)

# Estimate SAR probit model using RIS simulator using Sigma_v^{-1}
ris_sar <- sbinaryRis(CRIMED ~ INC + HOVAL, 
                     data = COL.OLD,
                     R = 50, 
                     listw = spdep::nb2listw(COL.nb, style = "W"), 
                     print.level = 2)
summary(ris_sar)

# Estimate SAR probit model using RIS simulator using Sigma_v
ris_sar_2  <- sbinaryRis(CRIMED ~ INC + HOVAL, 
                        data = COL.OLD,
                        R = 50, 
                        listw = spdep::nb2listw(COL.nb, style = "W"), 
                        varcov = "sigma", 
                        print.level = 2)
summary(ris_sar_2)

# Estimate SDM probit model using RIS simulator
ris_sdm <- sbinaryRis(CRIMED ~ INC + HOVAL | INC + HOVAL, 
                     data = COL.OLD,
                     R = 50, 
                     listw = spdep::nb2listw(COL.nb, style = "W"), 
                     print.level = 2)
summary(ris_sdm)

# Estimate SEM probit model using RIS simulator
ris_sem <- sbinaryRis(CRIMED ~ INC + HOVAL | INC, 
                     data = COL.OLD,
                     R = 50, 
                     listw = spdep::nb2listw(COL.nb, style = "W"), 
                     model = "SEM")
summary(ris_sem)


data(oldcol, package = "spdep")

# Create dependent (dummy) variable
COL.OLD$CRIMED <- as.numeric(COL.OLD$CRIME > 35)

# Estimate SAR probit model using RIS simulator using Sigma_v^{-1}
ris_sar <- sbinaryRis(CRIMED ~ INC + HOVAL, 
                     data = COL.OLD,
                     R = 50, 
                     listw = spdep::nb2listw(COL.nb, style = "W"), 
                     print.level = 2)
summary(ris_sar)

# Estimate SAR probit model using RIS simulator using Sigma_v
ris_sar_2  <- sbinaryRis(CRIMED ~ INC + HOVAL, 
                        data = COL.OLD,
                        R = 50, 
                        listw = spdep::nb2listw(COL.nb, style = "W"), 
                        varcov = "sigma", 
                        print.level = 2)
summary(ris_sar_2)

# Estimate SDM probit model using RIS simulator
ris_sdm <- sbinaryRis(CRIMED ~ INC + HOVAL | INC + HOVAL, 
                     data = COL.OLD,
                     R = 50, 
                     listw = spdep::nb2listw(COL.nb, style = "W"), 
                     print.level = 2)
summary(ris_sdm)

# Estimate SEM probit model using RIS simulator
ris_sem <- sbinaryRis(CRIMED ~ INC + HOVAL | INC, 
                     data = COL.OLD,
                     R = 50, 
                     listw = spdep::nb2listw(COL.nb, style = "W"), 
                     model = "SEM")
summary(ris_sem)

Package 'spldv'

Help Index

Get Model Summaries for use with "mtable" for objects of class bingmm

Description

Usage

Arguments

Details

Value

Get Model Summaries for use with "mtable" for objects of class binlgmm

Description

Usage

Arguments

Details

Value

Get Model Summaries for use with "mtable" for objects of class binris

Description

Usage

Arguments

Details

Value

Estimation of the average marginal effects for SARB models.

Description

Usage

Arguments

Details

Value

Author(s)

See Also

Examples

Make instruments for spatial models

Description

Usage

Arguments

Author(s)

Estimation of SAR for binary dependent models using GMM

Description

Usage

Arguments

Details

Value

Author(s)

References

See Also

Examples

Estimation of SAR for binary models using Linearized GMM.

Description

Usage

Arguments

Details

Value

Author(s)

References

See Also

Examples

Estimation of spatial probit model for binary outcomes using RIS (GHK) simulator

Description

Usage

Arguments

Details

Value

Author(s)

References

See Also

Examples