In the case of the multivariate random design nonparametric regression, an interpolation method is proposed to overcome the problem of unbounded finite sample variance for the local linear estimator (LLE) using a global bandwidth. This interpolation method simply uses the Nadaraya–Watson estimator with the product “Gaussian” kernel to construct pseudodata on equally spaced partition points of the support of the design density. Then the LLE using the “Epanechnikov” kernel is applied to smooth these equally spaced pseudodata. Our proposed estimator for the multivariate regression function has advantages in both the finite sample and the asymptotic cases. In the finite sample case, it always produces “smooth” regression function estimates, adapts “automatically and smoothly” to regions with sparse design, and has bounded conditional (and unconditional) bias and variance. On the other hand, in the asymptotic case, it has the same mean square error as the LLE. Empirical studies demonstrate that our suggested estimator is competitive with alternatives, in the sense of yielding both smaller sample mean integrated square error and smoother estimates.
Journal of Statistical Planning and Inference 116(1), pp.91-111