1 / 40

Spectral Clustering for Complex Data: A Comprehensive Guide

Explore the power of spectral clustering for complex cluster shapes, with insights on eigenvectors, eigenvalues, and algorithm selection. Learn to optimize clustering results and choose the right parameters for your dataset.

janew
Download Presentation

Spectral Clustering for Complex Data: A Comprehensive Guide

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Spectral Clustering Most slides are from Eyal David’s presentation

  2. Dataset exhibits complex cluster shapes • K-means performs very poorly in this space due bias toward dense spherical clusters. In the embedded space given by two leading eigenvectors, clusters are trivial to separate. Spectral Clustering Example – 2 Spirals

  3. Eigenvectors & Eigenvalues

  4. Why? • If we eventually use K-means, why not just apply K-means to the original data? • This method allows us to cluster non-convex regions

  5. Some Examples

  6. User’s Prerogative • Affinity matrix construction • Choice of scaling factor • Realistically, search over and pick value that gives the tightest clusters • Choice of k, the number of clusters • Choice of clustering method

  7. Largest eigenvalues of Cisi/Medline data λ1 • Choose k=2 λ2 How to select k? • Eigengap: the difference between two consecutive eigenvalues. • Most stable clustering is generally given by the value k that maximises the expression

  8. Recap – The bottom line

  9. Summary • Spectral clustering can help us in hard clustering problems • The technique is simple to understand • The solution comes from solving a simple algebra problem which is not hard to implement • Great care should be taken in choosing the “starting conditions”

  10. The End

More Related