Questions

How PCA works step by step?

October 4, 2020 by Author

Table of Contents

1 How PCA works step by step?
2 How does PCA standardize data?
3 Does PCA need normalization?
4 What is normalization and standardization?

How PCA works step by step?

How do you do a PCA?

Standardize the range of continuous initial variables.
Compute the covariance matrix to identify correlations.
Compute the eigenvectors and eigenvalues of the covariance matrix to identify the principal components.
Create a feature vector to decide which principal components to keep.

How does PCA standardize data?

There are 6ish steps to PCA:

Standardize data.
Construct covariance matrix.
Extract eigenvectors and eigenvalues from the covariance matrix.
Sort the eigenvalues (and their eigenvectors!) in decreasing order.
Select a number of components to care about (and keep)
Transform your dataset.

How does normalization affect PCA?

Normalization is important in PCA since it is a variance maximizing exercise. It projects your original data onto directions which maximize the variance. The first plot below shows the amount of total variance explained in the different principal components wher we have not normalized the data.

How do you solve PCA?

Mathematics Behind PCA

Take the whole dataset consisting of d+1 dimensions and ignore the labels such that our new dataset becomes d dimensional.
Compute the mean for every dimension of the whole dataset.
Compute the covariance matrix of the whole dataset.
Compute eigenvectors and the corresponding eigenvalues.

Does PCA need normalization?

Yes, it is necessary to normalize data before performing PCA. The PCA calculates a new projection of your data set. And the new axis are based on the standard deviation of your variables.

What is normalization and standardization?

Normalization typically means rescales the values into a range of [0,1]. Standardization typically means rescales data to have a mean of 0 and a standard deviation of 1 (unit variance).

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.