Locally Weighted Regression
- πΒ - input vector
- π - weight vector
π(π) = = πΒ·π
Error Function E(Xq) WeΒ Could Choose From
- minimize squared error over the k nearest neighbors:
- E1(Xq) = (1/2) Ξ£[f(x) - fΜ(x)]Β² for each k-nearest neighbors x of Xq
- minimize squared error over all training examples, while weighting the error of each training example by some function K inverse growing with respect to distance from query input Xq:
- E2(Xq) = (1/2) Ξ£[(f(x) - fΜ(x))Β² * K(d(Xq, x))] for each x in all training examples
- combine 1 and 2:
- E3(Xq) = (1/2) Ξ£[(f(x) - fΜ(x))Β² * K(d(Xq, x))] for each k-nearest neighbors x of Xq
Gradient Descent
- Ξπj = Ξ· * Ξ£[(f(X) - fΜ(X)) * Xj] for each k-nearest neighbors X of Xq
- Ξπj = Ξ· * Ξ£[K(d(Xq, X)) * (f(X) - fΜ(X)) * Xj] for each X in all training examples
- Ξπj = Ξ· * Ξ£[K(d(Xq, X)) * (f(X) - fΜ(X)) * Xj] for each k-nearest neighbors X of Xq