Difference between revisions of "Hierarchical Clustering; Agglomerative (HAC) & Divisive (HDC)"

Revision as of 08:44, 7 January 2019

Hierarchical clustering algorithms actually fall into 2 categories:

Agglomerative; bottom-up
Divisive; top-down

Agglomerative Clustering - Bottom Up

Bottom-up algorithms treat each data point as a single cluster at the outset and then successively merge (or agglomerate) pairs of clusters until all clusters have been merged into a single cluster that contains all data points. Bottom-up hierarchical clustering is therefore called hierarchical agglomerative clustering or HAC. This hierarchy of clusters is represented as a tree (or dendrogram). The root of the tree is the unique cluster that gathers all the samples, the leaves being the clusters with only one sample. The 5 Clustering Algorithms Data Scientists Need to Know | Towards Data Science

Hierarchical clustering does not require us to specify the number of clusters and we can even select which number of clusters looks best since we are building a tree. Additionally, the algorithm is not sensitive to the choice of distance metric; all of them tend to work equally well whereas with other clustering algorithms, the choice of distance metric is critical. A particularly good use case of hierarchical clustering methods is when the underlying data has a hierarchical structure and you want to recover the hierarchy; other clustering algorithms can’t do this. These advantages of hierarchical clustering come at the cost of lower efficiency, as it has a time complexity of O(n³), unlike the linear complexity of K-Means and GMM.

1*ET8kCcPpr893vNZFs8j4xg.gif

@@ Line 10: / Line 10: @@
 * [[Hierarchical Temporal Memory (HTM)]]
 * [[K-Means]]
+* [http://www.r-bloggers.com/how-to-perform-hierarchical-clustering-using-r/ How to Perform Hierarchical Clustering using R | Perceptive Analytics]
 * [http://www.researchgate.net/publication/315966848_Exploreing_K-Means_with_Internal_Validity_Indexes_for_Data_Clustering_in_Traffic_Management_System Exploreing K-Means with Internal Validity Indexes for Data Clustering in Traffic Management System | S. Nawrin, S. Akhter and M. Rahatur]
@@ Line 16: / Line 17: @@
 # Agglomerative; bottom-up
 # Divisive; top-down
+http://i1.wp.com/r-posts.com/wp-content/uploads/2017/12/Agnes.png
@@ Line 27: / Line 30: @@
 https://cdn-images-1.medium.com/max/640/1*ET8kCcPpr893vNZFs8j4xg.gif
 <youtube>OcoE7JlbXvY</youtube>

Difference between revisions of "Hierarchical Clustering; Agglomerative (HAC) & Divisive (HDC)"

Revision as of 08:44, 7 January 2019

Agglomerative Clustering - Bottom Up

Divisive Clustering = Top Down

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools