Principal Component Analysis (PCA)

- is a statistical procedure that uses an orthogonal transformation to convert a set of observations of possibly [[Correlation|correlated variables]] into a set of values of linearly uncorrelated/independent variables called principal components
- helpful when most of the variation of the data is due to variations of a few principal components
- depends on the:
	- eigen-decomposition of a [[Positive Semi-Definite Matrix|positive semi-definite matrices]]
	- [[Singular Value Decomposition／Factorization (SVD) - Reduced SVD|singular value decomposition]] of rectangular matrices
- learns a linear projection that aligns the direction of greatest variance with axes of the new space
- can be viewed as an [[Unsupervised Learning|unsupervised learning algorithm]] that learns a new "representation" of data:
	- that has lower dimensionality than the original input data
	- whose elements have no linear correlation with each other

PCA is a linear technique for dimensionality reduction which performs a linear mapping of the data to a lower-dimensional space in such a way that the variance of the data in the low-dimensional representation is maximized. In practice, the covariance or correlationmatrix of the data-variables is constructed and the eigenvectors on this matrix are computed. The eigenvectors that correspond to the largest eigenvalues can now be used to reconstruct a large fraction of the variance of the original data

PCA - Introduction

let 𝑋 be a 𝑝✕𝑛 matrix of 𝑛 observations:

𝑋 = [𝑋₁, …, 𝑋_𝑛]

where each 𝑋_𝑖is a 𝑝✕1 vector

sample mean 𝑀 be a 𝑝✕1 vector defined as:

𝑀 = (1/𝑛) (𝑋₁ + … + 𝑋_𝑛)

translate the 𝑛 observations as so:

𝑋_𝑖ˆ = 𝑋_𝑖- 𝑀

assign 𝑋 to be the mean-deviation form (having sample mean = 0)

𝑋 = [𝑋₁ˆ, …, 𝑋_𝑛ˆ]

let 𝑆 be a 𝑝✕𝑝 sample covariance matrix

𝑆 = (1/(𝑛-1)) 𝑋𝑋^𝑇

find the eigenvalues and eigenvectors of 𝑆

eigenvalues {𝜆₁, …, 𝜆_𝑝}
eigenvectors {𝑣₁, …, 𝑣_𝑝}

normalize the eigenvectors to get the principal components:

𝑢_𝑖= 𝑣_𝑖/ ||𝑣_𝑖||
principal components = {𝑢₁, …, 𝑢_𝑝}

let 𝑃 be the change of variable/basis matrix that contains the principal components as columns

𝑃 = [𝑢₁, …, 𝑢_𝑝]

𝑃 is used to transform vector 𝑋_𝑖with basis defined by the observations axis to a vector 𝑌_𝑖with basis {𝑢₁, …, 𝑢_𝑝}

𝑋_𝑖= 𝑃𝑌_𝑖
𝑋= 𝑃𝑌
𝑌_𝑖= 𝑃^𝑇𝑋_𝑖
𝑌= 𝑃^𝑇𝑋

for any orthogonal 𝑃 the covariance matrix of 𝑌 = [𝑌₁, …, 𝑌_𝑝] is:

𝑆 = (1/(𝑛-1)) 𝑋𝑋^𝑇
𝑆 = (1/(𝑛-1)) (𝑃𝑌)(𝑃𝑌)^𝑇
𝑆 = (1/(𝑛-1)) 𝑃𝑌𝑌^𝑇𝑃^𝑇
𝑃^𝑇𝑆𝑃 = (1/(𝑛-1)) 𝑌𝑌^𝑇

thus, covariance matrix of 𝑌 = 𝑃^𝑇𝑆𝑃

PCA - Reducing the Dimension of Multivariate Data

an orthogonal change of variable/basis does not change the total-variance of the data (because left-multiplication by 𝑃 does not change lengths of vectors nor angles between them)
this means if 𝑆 = 𝑃𝐷𝑃^𝑇 then:
- {total-variance of observation 𝑥₁, …, 𝑥_𝑝} = {total-variance of 𝑦₁, …, 𝑦_𝑝} = 𝑡𝑟𝑎𝑐𝑒(𝑆) = 𝑡𝑟𝑎𝑐𝑒(𝐷) = 𝜆₁ + … + 𝜆_𝑝
the variance of 𝑦_𝑖= 𝜆_𝑖
the quotient 𝜆_𝑖/𝑡𝑟𝑎𝑐𝑒(𝐷) measures the fraction of total variance explained or captured by 𝑦_𝑖

PCA - Example

Click here to expand...

3 measurements made on each of the 4 individuals:

sample mean vector

translate the observations

mean-deviation matrix

sample covariance matrix (which is positive semi-definite)

𝑆 ‘s eigenvalues and unit eigenvectors

the 3 principal components are the 3 unit eigenvectors

𝑦₁ = -0.074𝑥₁ - 0.303𝑥₂ + 0.950𝑥₃

𝑦₂ = -0.819𝑥₁ - 0.525𝑥₂ - 0.231𝑥₃

𝑦₃ = -0.569𝑥₁ + 0.796𝑥₂ + 0.209𝑥₃

the sample covariance matrix of the transformed data using variable/basis {𝑦₁, 𝑦₂, 𝑦₃} is

compare trace between 𝑆 and 𝐷 (they should be equal)

𝑡𝑟𝑎𝑐𝑒(𝑆) = 10 + 8 + 32 = 50

𝑡𝑟𝑎𝑐𝑒(𝐷) = 34.55 + 13.84 + 1.601 = 49.991 = 50 because of round off errors

𝑡𝑟𝑎𝑐𝑒(𝑆) = 𝑡𝑟𝑎𝑐𝑒(𝐷)

the percentages of “total-variance” explained/captured by each “principal-component” are:

𝑦₁= 34.55 / 50 = 69.1%

𝑦₂= 13.84 / 50 = 27.68%

𝑦₃= 1.601 / 50 = 3.202%

／var／log marcus chiu

Explorer

Principal Component Analysis (PCA)

Principal Component Analysis (PCA)

PCA - Introduction

PCA - Reducing the Dimension of Multivariate Data

PCA - Example

PCA - Subpages

Resources

／var／logmarcus chiu

Explorer

Principal Component Analysis (PCA)

Principal Component Analysis (PCA)

PCA - Introduction

PCA - Reducing the Dimension of Multivariate Data

PCA - Example

PCA - Subpages

Resources

／var／log marcus chiu