Locally Weighted Regression
  • 𝑋 - input vector
  • πœƒ - weight vector

𝑓(𝑋) = = πœƒΒ·π‘‹

Error Function E(Xq) WeΒ Could Choose From

  1. minimize squared error over the k nearest neighbors:
    1. E1(Xq) = (1/2) Ξ£[f(x) - fΜ‚(x)]Β² for each k-nearest neighbors x of Xq
  2. minimize squared error over all training examples, while weighting the error of each training example by some function K inverse growing with respect to distance from query input Xq:
    1. E2(Xq) = (1/2) Ξ£[(f(x) - fΜ‚(x))Β² * K(d(Xq, x))] for each x in all training examples
  3. combine 1 and 2:
    1. E3(Xq) = (1/2) Ξ£[(f(x) - fΜ‚(x))Β² * K(d(Xq, x))] for each k-nearest neighbors x of Xq

Gradient Descent

  1. Ξ”πœƒj = Ξ· * Ξ£[(f(X) - fΜ‚(X)) * Xj] for each k-nearest neighbors X of Xq
  2. Ξ”πœƒj = Ξ· * Ξ£[K(d(Xq, X)) * (f(X) - fΜ‚(X)) * Xj] for each X in all training examples
  3. Ξ”πœƒj = Ξ· * Ξ£[K(d(Xq, X)) * (f(X) - fΜ‚(X)) * Xj] for each k-nearest neighbors X of Xq