k-Nearest Neighbors (k-NN)

is a non-probabilistic, non-parametric regression/instance-based, supervised learning approach where the response of a data point is determined by the nature of its 𝑘 neighbors from the training set. It can be used in both classification and regression settings
is a type of Kernel Density Estimation with a uniform kernel with variable bandwidth to encompass 𝑘 nearest neighbors
no real training stage
in test time, given a test input 𝑥, we find the 𝑘 nearest neighbors to 𝑥 in the training set. then return the average of the corresponding 𝑦 values in the training set
has a very high capacity

k-NN - Definition of Nearest

Nearest is based on either: distance measure or similarity measure

k-NN - Bias Variance Tradeoff

higher the parameter 𝑘 → higher bias & lower variance (lower capacity)
lower the parameter 𝑘 → higher variance & lower bias (higher capacity)

k-NN - Weakness

one weakness is that it cannot learn that one feature is more discriminative than another (e.g. imagine we have a regression task with 𝒙 ∊ ℝ¹⁰⁰ drawn from an isotropic Gaussian distribution, but only a single variable 𝑥₁ is relevant to the output. Suppose further that this feature simply encodes the output directly, i.e. that 𝑦 = 𝑥₁ in all cases. Nearest neighbor regression will not be able to detect this simple pattern. The nearest neighbor of most points 𝒙 will be determined by the large number of features 𝑥₂ through 𝑥₁₀₀, not by the lone feature 𝑥₁. Thus the output on small training sets will essentially be random)

k-NN - Types

training (fast):

for each training example (𝑥, 𝑦) add to the list of training examples

prediction (slow):

given query instance 𝑥_𝑞
𝐾 = find 𝑘 instances from training examples that are “nearest” to 𝑥_𝑞 (nearest based on distance measure or similarity measure)
replace from below
return class 𝑣

TYPE	replace step 3 of prediction:
Discrete	𝑣 = 𝑎𝑟𝑔𝑚𝑎𝑥_𝑦∊𝑌𝛴[𝛿(𝑦, 𝑓(𝑥_𝑖)] # for each training example 𝑥_𝑖 in 𝐾
Continuous	𝑣 = [𝛴𝑓(𝑥_𝑖)] / 𝑘
Discrete Distance-Weighted	𝑣 = 𝑎𝑟𝑔𝑚𝑎𝑥_𝑦∊𝑌𝛴[𝑤_𝑖 * 𝛿(𝑦, 𝑓(𝑥_𝑖)]
Continuous Distance-Weighted	𝑣 = [𝛴𝑤_𝑖 * 𝑓(𝑥_𝑖)] / [𝛴𝑤_𝑖]

where 𝑤_𝑖is some distance such as:

𝑤_𝑖= 1 / (some other distance measure)
- 𝑤_𝑖= 1 / 𝑒𝑢𝑐𝑙𝑖𝑑𝑒𝑎𝑛-𝑑𝑖𝑠𝑡𝑎𝑛𝑐𝑒(𝑥_𝑞, 𝑥_𝑖)²
- 𝑤_𝑖= 1 / 𝑚𝑎𝑛ℎ𝑎𝑡𝑡𝑎𝑛-𝑑𝑖𝑠𝑡𝑎𝑛𝑐𝑒(𝑥_𝑞, 𝑥_𝑖)²
𝑤_𝑖= (some other similarity measure)

Calculating Nearest Neighbor Efficiently

Random Project Trees
Nearest-Neighbor Descent (NN-Descent)

Resources

Interactive k-NN demo - http://vision.stanford.edu/teaching/cs231n-demos/knn/

／var／log marcus chiu

Explorer

k-Nearest Neighbors (k-NN) Regression

k-Nearest Neighbors (k-NN)

k-NN - Definition of Nearest

k-NN - Bias Variance Tradeoff

k-NN - Weakness

k-NN - Types

Calculating Nearest Neighbor Efficiently

Resources

／var／logmarcus chiu

Explorer

k-Nearest Neighbors (k-NN) Regression

k-Nearest Neighbors (k-NN)

k-NN - Definition of Nearest

k-NN - Bias Variance Tradeoff

k-NN - Weakness

k-NN - Types

Calculating Nearest Neighbor Efficiently

Resources

／var／log marcus chiu