Difference between revisions of "Principal Component Analysis (PCA)"

From
Jump to: navigation, search
m
m
Line 20: Line 20:
 
[https://www.bing.com/news/search?q=Principal+Component+Analysis+PCA&qft=interval%3d%228%22 ...Bing News]
 
[https://www.bing.com/news/search?q=Principal+Component+Analysis+PCA&qft=interval%3d%228%22 ...Bing News]
  
 +
* [[Backpropagation]] ... [[Activation Functions]] ... [[Loss]] ... [[Boosting]] ... [[Gradient Descent Optimization & Challenges|Gradient Descent]] ... [[Algorithm Administration#Hyperparameter|Hyperparameter]] ... [[Manifold Hypothesis]] ... [[Principal Component Analysis (PCA)|PCA]]
 
* [[AI Solver]] ... [[Algorithms]] ... [[Algorithm Administration|Administration]] ... [[Model Search]] ... [[Discriminative vs. Generative]] ... [[Optimizer]] ... [[Train, Validate, and Test]]
 
* [[AI Solver]] ... [[Algorithms]] ... [[Algorithm Administration|Administration]] ... [[Model Search]] ... [[Discriminative vs. Generative]] ... [[Optimizer]] ... [[Train, Validate, and Test]]
 
* [[...find outliers]]
 
* [[...find outliers]]
* [[Manifold Hypothesis]]
 
 
* [[Supervised|Supervised Learning]] ... [[Semi-Supervised]] ... [[Self-Supervised]] ... [[Unsupervised]]
 
* [[Supervised|Supervised Learning]] ... [[Semi-Supervised]] ... [[Self-Supervised]] ... [[Unsupervised]]
 
* [[Embedding]]:  [[Agents#AI-Powered Search|Search]] ... [[Clustering]] ... [[Recommendation]] ... [[Anomaly Detection]] ... [[Classification]] ... [[Dimensional Reduction]] ... [[...find outliers]]
 
* [[Embedding]]:  [[Agents#AI-Powered Search|Search]] ... [[Clustering]] ... [[Recommendation]] ... [[Anomaly Detection]] ... [[Classification]] ... [[Dimensional Reduction]] ... [[...find outliers]]

Revision as of 00:15, 11 July 2023

YouTube ... Quora ...Google search ...Google News ...Bing News

Principal Component Analysis (PCA) goal is to reduce the dimensionality of a data set consisting of a large number of interrelated variables, while retaining as much as possible of the variation present in the data set. This is accomplished by linearly transforming the data into a new coordinate system where (most of) the variation in the data can be described with fewer dimensions than the initial data. The new dimensions are called principal components, and they are uncorrelated and ordered by the amount of variance they explain. PCA can help you simplify large data tables, visualize multidimensional data, and identify hidden patterns in your data. This data reduction technique allows the simplifying multidimensional data sets to 2 or 3 dimensions for plotting purposes and visual variance analysis.



  1. Center (and standardize) data
  2. First principal component axis
    1. Across centroid of data cloud
    2. Distance of each point to that line is minimized, so that it crosses the maximum variation of the data cloud
  3. Second principal component axis
    1. Orthogonal to first principal component
    2. Along maximum variation in the data
  4. First PCA axis becomes x-axis and second PCA axis y-axis
  5. Continue process until the necessary number of principal components is obtained


principal-component-analysis-basics-scatter-plot-data-mining-1.png


NumXL