Open Journal of Statistics
Vol.3 No.6(2013), Article ID:41537,5 pages DOI:10.4236/ojs.2013.36049

Estimating the Parameters Geographically Weighted Regression (GWR) with Measurement Error

Ida Mariati Hutabarat1,2, Asep Saefuddin2, Anik Djuraidah2, I Wayan Mangku3

1Departement of Mathematics, Cenderawasih University, Jayapura, Indonesia

2Departement of Statistics, Bogor Agricultural University, Bogor, Indonesia

3Departement of Mathematics, Bogor Agricultural University, Bogor, Indonesia

Email: ida_mariati@yahoo.com, asaefuddin@gmail.com, anikdjuraidah@gmail.com, wayan.mangku@gmail.com

Copyright © 2013 Ida Mariati Hutabarat et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. In accordance of the Creative Commons Attribution License all Copyrights © 2013 are reserved for SCIRP and the owner of the intellectual property Ida Mariati Hutabarat et al. All Copyright © 2013 are guarded by law and by SCIRP as a guardian.

Received October 18, 2013; revised November 18, 2013; accepted November 25, 2013

Keywords: Geographical Weighted Regression; Measurement Error; Instrumental Variable; Weighted Least Squares

ABSTRACT

Geographically weighted regression models with the measurement error are a modeling method that combines the global regression models with the measurement error and the weighted regression model. The assumptions used in this model are a normally distributed error with that the expectation value is zero and the variance is constant. The purpose of this study is to estimate the parameters of the model and find the properties of these estimators. Estimation is done by using the Weighted Least Squares (WLS) which gives different weighting to each location. The variance of the measurement error is known. Estimators obtained are. The properties of the estimator are unbiased and have a minimum variance.

1. Introduction

The measurement error is the error appeared when a recorded value isn’t exactly equal to the true value in terms of a measurement process, so that the true value of the explanatory variables is represented by a value that is obtained through a measurement process that isn’t necessarily correspond to the true value.

[1] says that the measurement errors affect the slope of the regression curve. [2] say that the measurement errors can lead to bias in the regression estimator and also lead to a model that is not built right or not representative of the population. The presence of the measurement error causes biased and inconsistent parameter estimates and leads to erroneous conclusions [3]. In addressing these issues, we use the measurement error models.

There are many researches that have been discussed about parametric regression models with the measurement error, including [2,4]. Nonparametric regression models with the measurement errors have been developed by [5]. They discussed a nonparametric regression function estimator which is constructed to reflect the fact that there are errors in variables. The result of their study shows that the convergence rates of all possible estimators have a lower bound possessed by the kernel estimators. In addition, [6] have conducted a study to estimate the parameters in the measurement error model with the modified spline method.

Some researches have been done on the nonlinear regression model with the measurement errors such as [7] in the logistic regression model of the development of heart disease. In their study, they introduced a bias-adjusted estimator. [8] conducted a study of the measurement error in the generalized linear model (GLM). The results of the computational methods offered an informative plot, called the measurement error trace which graphically illustrates the effect of the measurement error on the estimated parameters. [9] estimated the parameters of the proportional hazard models.

According to [10], spatial data are prone to the measurement error in the covariates, so that the research for spatial regression models with the measurement errors begins to develop, because in practice there are variables that can’t be measured directly or can’t be measured precisely in accordance with the true value as well as the spatial effect. [10] used a model of the conditional auto-regressive (CAR) in the study of spatial linear mixed model. The results of their research show that the naive estimators of the regression coefficients are attenuated while the naive estimators of the variance components are inflated, if the measurement error is ignored.

Beside CAR, there are several ways to analyze the spatial data. One of them is the geographically weighted regression model [11]. The geographically weighted regression model (GWR) is a development of the classical linear regression model. In the linear regression model only valid parameter estimators are produced globally, whereas in the GWR model it produced the model parameter estimators that are local to each observation location.

Based on the problems and the development of the research before, in this case, we are interested in examining the GWR model with the measurement error. So that, the purpose of this study is to determine the parameter estimators β and to examine the statistical properties of the resulting on the GWR model with the measurement error.

2. Model

GWR Model is a regression model of global development of the basic idea which is taken from the nonparametric regression [12]. This Model is a locally linear regression that generates a local model parameter estimates to each point or the region where the data is collected.

GWR Model can be written as follows [11]:

(2.1)

with states the point of coordinates (latitude, longitude) region is the value of a random variable and xi is the value of a fixed variable which is known and does not contain errors. This means that xi can be observed directly.

If xi can’t be observed directly, it will be observed variables zi. In this case, there has been a measurement error of the xi. Measurement error is referred to as an error on the variables xi. Measurement error models are:

(2.2)

where is a random variables. The observed random variables called indicator variables, and unobserved variables referred to as latent variables.

So that from Equations (2.1) and (2.2) the regression model becomes:

3. Estimation

The first step from this model approach is forming a weighting matrix for each observation (location). Weighting matrix is used to estimate the parameters in the location. Suppose the weight for each location

is, , then the location parameter allegedly by adding the element weighting in Equation (2.1).

(3.1)

Suppose is the element of the i-th row of the matrix 𝑿. Then the value of y at the location of the observation can be written as follows:

We let then

, (3.2)

If can’t be directly observed or experienced measurement error, then the Equation (3.1) becomes:

In Equation (3.2) it is assumed that has mean 0 and constant variance.

Means, so that

where

From Equation (3.3) with

,

obtained

so

(3.4)

If Equation (3.4) derived to and the result equated to zero then obtain parameter estimators

The parameter estimation of geographically weighted regression models with the measurement error for each location is

(3.5)

4. Statistical Properties

After obtaining the estimators it will look for the properties of the estimator in Equation (3.5). From Equation (2.1), weighted regression with measurement errors obtained:

which can also be written as

(4.1)

where

In matrix notation, the regression Equation (4.1) is

(4.2)

Estimating the parameters obtained from the weighted

From the above Equation we can see that

so that the estimator is a biased estimator for.

[13] found that the instrumental variable method provided unbiased estimates in linear models. To prove the unbiasedness, we use a linear model with covariates. Suppose is the matrix containing the instrumental variables and, i.e., where “1” indicates the vector of ones which is needed for the intercept. According to the definition of instrumental variables [2], V and are independent. Therefore,

To obtain the parameter estimates, we create a “quasi” normal Equation by pre-multiplying both sides of Equation (4.2) with, and hence, we obtain

(4.3)

Since, Equation (4.3) above becomes

Therefore, the unbiased estimates obtained by the instrumental variable technique are

Despite the fact that it is sometimes difficult to find variables serving purely as instrumental variables [14], this technique is a good option to overcome the problems of measurement error [13]. In addition, [15] proposed an instrumental variable technique as a method to estimate the reliability coefficient of covariates that are difficult to measure.

To prove whether is an efficient estimator is

with

should be as small as possible so that efficient estimator.

5. Summary

In this paper, we assume that the variance of the measurement error is known. Assumptions used in this model are the errors normally distributed with that expected value is zero and the variance is constant. Geographically weighted regression models with the measurement error use the instrumental variable method. gives the unbiased estimation in the linear model, with terms. and are independent. The properties of estimators of geographically weighted regression models with the measurement error are an efficient estimator if should be as small as possible.

REFERENCES

  1. W. A. Fuller “Measurement Error Models,” John Wiley and Sons Inc., New York, 1987. http://dx.doi.org/10.1002/9780470316665
  2. R. J. Carroll, D. Ruppert and L. A. Stefanski “Measurement Error in Nonlinear Models,” Chapman and Hall, New York, 1995. http://dx.doi.org/10.1007/978-1-4899-4477-1
  3. X. H. Chen, H. Hong and D. Nekipelov, “Nonlinear Models of Measurement Errors,” Journal of Economic Literature, Vol. 49, No. 4, 2011, pp. 901-937. http://dx.doi.org/10.1257/jel.49.4.901
  4. W. A. Fuller and M. A. Hidiroglou, “Regression Estimation after Correction for Attenuation,” Journal of the American Statistical Association, Vol. 73, No. 361, 1978, pp. 99-104. http://dx.doi.org/10.1080/01621459.1978.10480011
  5. J. Fan and Y. K. Truong, “Nonparametric Regression with Errors in Variables,” Annals of Statistics, Vol. 21, No. 4, 1993, pp. 1900-1925. http://dx.doi.org/10.1214/aos/1176349402
  6. R. J. Carroll, J. D. Maca and D. Ruppert, “Nonparametric Regression in the Presence of Measurement Error,” Biometrika, Vol. 86, No. 3, 1999, pp. 541-554. http://dx.doi.org/10.1093/biomet/86.3.541
  7. L. A. Stefanski and R. J. Carrol, “Covariate Measurement Error in Logistic Regression,” The Annals of Statistics, Vol. 13, No. 4, 1985, pp. 1335-135. http://dx.doi.org/10.1214/aos/1176349741
  8. L. A. Stefanski, “Measurement Errors in Generalized Linear Model Explanatory Variables,” 3rd International Workshop on Statistical Modelling, Vienna, 4-8 July 1988, pp. 1-40.
  9. T. Nakamura, “Proportional Hazards Model with Covariates Subject to Measurement Error,” Biometrics, Vol. 48, No. 3, 1992, pp. 829-839. http://dx.doi.org/10.2307/2532348
  10. Y. Li, H. C. Tang and X. H. Lin, “Spatial Linear Mixed Models with Covariate Measurement Errors,” Statistica Sinica, Vol. 19, No. 3, 2009, pp. 1077-1093.
  11. A. S. Fotheringham, C. Brunsdon and M. E. Charlton, “Geographically Weighted Regression, the Analysis of Spatially Varying Relationships,” John Wiley and Sons, Ltd., Hoboken, 2002.
  12. C. L. Mei, “Geographically Weighted Regression Technique for Spatial Data Analysis,” School of Science Xi’an Jiaotong University, Xi’an, 2005.
  13. A. Madansky, “The Fitting of Straight Lines When Both Variables Are Subject to Error,” Journal of the American Statistician Association, Vol. 54, No. 285, 1959, pp. 173- 205. http://dx.doi.org/10.1080/01621459.1959.10501505
  14. L. M. Bartels, “Instrumental and Quasi-Instrumental Variables,” American Journal of Political Science, Vol. 35, No. 3, 1991, pp. 777-800. http://dx.doi.org/10.2307/2111566
  15. R. J. Carroll and L. A. Stefanski, “Measurement Error, Instrumental Variables and Corrections for Attenuation with Application to Meta-Analyses,” Statistics in Medicine, Vol. 13, No. 12, 1994, pp. 1265-1282. http://dx.doi.org/10.1002/sim.4780131208