Difference between revisions of "Dimensional Reduction"

From
Jump to: navigation, search
m
m
Line 39: Line 39:
 
** [http://en.wikipedia.org/wiki/Projection_pursuit Projection Pursuit]
 
** [http://en.wikipedia.org/wiki/Projection_pursuit Projection Pursuit]
 
** [http://en.wikipedia.org/wiki/Sammon_mapping Sammon Mapping/Projection]
 
** [http://en.wikipedia.org/wiki/Sammon_mapping Sammon Mapping/Projection]
** [[Local Linear Embedding (LLE)]]
+
** [[Local Linear Embedding (LLE)]] creates an embedding of the dataset and tries to preserve the relationships between neighborhoods in the dataset.
 +
 
 +
LLE can be thought of as a series of local PCAs that are globally compared to find the best non-linear embedding.
 
** [[T-Distributed Stochastic Neighbor Embedding (t-SNE)]]  ...similar objects are modeled by nearby points
 
** [[T-Distributed Stochastic Neighbor Embedding (t-SNE)]]  ...similar objects are modeled by nearby points
 
** Singular Value Decomposition (SVD) is a linear dimensionality reduction technique.
 
** Singular Value Decomposition (SVD) is a linear dimensionality reduction technique.

Revision as of 06:03, 26 August 2021

Youtube search... ...Google search

To identify the most important Features to address:

  • reduce the amount of computing resources required
  • 2D & 3D intuition often fails in higher dimensions
  • distances tend to become relatively the 'same' as the number of dimensions increases

LLE can be thought of as a series of local PCAs that are globally compared to find the best non-linear embedding.

Some datasets may contain many variables that may cause very hard to handle. Especially nowadays data collecting in systems occur at very detailed level due to the existence of more than enough resources. In such cases, the data sets may contain thousands of variables and most of them can be unnecessary as well. In this case, it is almost impossible to identify the variables which have the most impact on our prediction. Dimensional Reduction Algorithms are used in this kind of situations. It utilizes other algorithms like Random Forest, Decision Tree to identify the most important variables. 10 Machine Learning Algorithms You need to Know | Sidath Asir @ Medium



Projection

Youtube search... ...Google search