First-difference estimator

In statistics and econometrics, the first-difference (FD) estimator is an estimator used to address the problem of omitted variables with panel data. It is consistent under the assumptions of the fixed effects model. In certain situations it can be more efficient than the standard fixed effects (or "within") estimator, for example when the error terms follows a random walk.^[1] The estimator requires data on a dependent variable, $y_{i t}$ , and independent variables, $x_{i t}$ , for a set of individual units $i = 1, \dots, N$ and time periods $t = 1, \dots, T$ . The estimator is obtained by running a pooled ordinary least squares (OLS) estimation for a regression of $Δ y_{i t}$ on $Δ x_{i t}$ .

Derivation

The FD estimator avoids bias due to some unobserved, time-invariant variable $c_{i}$ , using the repeated observations over time:

y_{i t} = x_{i t} β + c_{i} + u_{i t}, t = 1, . . . T,

y_{i t - 1} = x_{i t - 1} β + c_{i} + u_{i t - 1}, t = 2, . . . T .

Differencing the equations, gives:

Δ y_{i t} = y_{i t} - y_{i t - 1} = Δ x_{i t} β + Δ u_{i t}, t = 2, . . . T,

which removes the unobserved $c_{i}$ and eliminates the first time period.^[2]^[3] The FD estimator ${\hat{β}}_{F D}$ is then obtained by using the differenced terms for $x$ and $u$ in OLS:

{\hat{β}}_{F D} = (Δ X^{'} Δ X)^{- 1} Δ X^{'} Δ y = β + (Δ X^{'} Δ X)^{- 1} Δ X^{'} Δ u

where $X, y,$ and $u$ , are notation for matrices of relevant variables. Note that the rank condition must be met for $Δ X^{'} Δ X$ to be invertible ( $rank [Δ X^{'} Δ X] = k$ ), where $k$ is the number of regressors. Let

Δ X_{i} = [Δ X_{i 2}, Δ X_{i 3}, . . ., Δ X_{i T}]

,

and, analogously,

Δ u_{i} = [Δ u_{i 2}, Δ u_{i 3}, . . ., Δ u_{i T}]

.

If the error term is strictly exogenous, i.e. $E [u_{i t} | x_{i 1}, x_{i 2}, . ., x_{i T}] = 0$ , by the central limit theorem, the law of large numbers, and the Slutsky's theorem, the estimator is distributed normally with asymptotic variance of

\hat{Avar} ({\hat{β}}_{F D}) = E [Δ {X_{i}}^{'} Δ X_{i}]^{- 1} E [Δ {X_{i}}^{'} Δ u_{i} Δ {u_{i}}^{'} Δ X_{i}] E [Δ {X_{i}}^{'} Δ X_{i}]^{- 1}

.

Under the assumption of homoskedasticity and no serial correlation, $Var (Δ u | X) = σ_{Δ u}^{2}$ , the asymptotic variance can be estimated as

\hat{Avar} ({\hat{β}}_{F D}) = {\hat{σ}}_{Δ u}^{2} (Δ X^{'} Δ X)^{- 1},

where ${\hat{σ}}_{u}^{2}$ , a consistent estimator of $σ_{u}^{2}$ , is given by

{\hat{σ}}_{Δ u}^{2} = [n (T - 1) - K]^{- 1} \sum_{i = 1}^{n} \sum_{t = 2}^{T} {\hat{Δ u_{i t}}}^{2}

and

\hat{Δ u_{i t}} = Δ y_{i t} - {\hat{β}}_{F D} Δ x_{i t}

.^[4]

Properties

To be unbiased, the fixed effects estimator (FE) requires strict exogeneity, defined as

E [u_{i t} | x_{i 1}, x_{i 2}, . ., x_{i T}] = 0

.

The first difference estimator (FD) is also unbiased under this assumption. If strict exogeneity is violated, but the weaker assumption

E [(u_{i t} - u_{i t - 1}) (x_{i t} - x_{i t - 1})] = 0

holds, then the FD estimator is consistent. Note that this assumption is less restrictive than the assumption of strict exogeneity which is required for consistency using the FE estimator when $T$ is fixed. If $T \to \infty$ , then both FE and FD are consistent under the weaker assumption of contemporaneous exogeneity. The Hausman test can be used to test the assumptions underlying the consistency of the FE and FD estimators.^[5]

Relation to fixed effects estimator

For $T = 2$ , the FD and fixed effects estimators are numerically equivalent.^[6] Under the assumption of homoscedasticity and no serial correlation in $u_{i t}$ , the FE estimator is more efficient than the FD estimator. This is because the FD estimator induces no serial correlation when differencing the errors. If $u_{i t}$ follows a random walk, however, the FD estimator is more efficient as $Δ u_{i t}$ are serially uncorrelated.^[7]

Notes

↑ Wooldridge 2001, p. 284.
↑ Wooldridge 2013, p. 461.
↑ Wooldridge 2001, p. 279.
↑ Wooldridge 2001, p. 281.
↑ Wooldridge 2001, p. 285.
↑ Wooldridge 2001, p. 284.
↑ Wooldridge 2001, p. 284.

References

Wooldridge, Jeffrey M. (2001). Econometric Analysis of Cross Section and Panel Data. MIT Press. pp. 279–291. ISBN 978-0-262-23219-7. Retrieved 30 August 2024.
Wooldridge, Jeffrey M. (2013). Introductory Econometrics: A Modern Approach (PDF) (5th ed.). South-Western Cengage Learning. ISBN 978-1-111-53104-1. Retrieved 30 August 2024.

[1] Wooldridge 2001, p. 284.

[2] Wooldridge 2013, p. 461.

[3] Wooldridge 2001, p. 279.

[4] Wooldridge 2001, p. 281.

[5] Wooldridge 2001, p. 285.

[6] Wooldridge 2001, p. 284.

[7] Wooldridge 2001, p. 284.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

First-difference estimator

Contents

Derivation

Properties

Relation to fixed effects estimator

See also

Notes

References

Navigation menu

Page actions

Page actions

Personal tools

Navigation

Search

Tools

In other projects

In other languages