Irina Rish IBM T.J. Watson Research Center

Bregman Divergences in Clustering and Dimensionality ReductionCOMS 6998-4: Learning and Empirical Inference Irina Rish IBM T.J. Watson Research Center Slide credits: Srujana Merugu, Arindam Banerjee, Sameer Agarwal

Outline • Intro to Bregman Divergences • Clustering with Bregman Divergences • k-means: quick overview • From Euclidean distance to Bregman divergences • Some rate-distortion theory • Dimensionality Reduction with Bregman Divergences • PCA: quick overview • Probabilistic Interpretation of PCA; exponential family • From Euclidean distance to Bregman divergences • Conclusions

Distance (distortion) measures in learning • Euclidean distance – most commonly used • Nearest neighbor, k-means clustering, least squares regression, PCA, distance metric learning, etc • But…is it always an appropriate type of distance? No! • Nominal attributes (e.g. binary) • Distances between distributions • Probabilistic interpretation: • Euclidean distance  Gaussian data • Beyond Gaussian? Exponential family distributions Bregman divergences

Squared Euclidean distance is a Bregman divergence

Relative entropy (i.e., KL-divergence) is another Bregman divergence

Recall Bregman Diverences

Now, how about generalizing soft clustering Algorithms using Bregman divergences?

(natural parameter)

Irina Rish IBM T.J. Watson Research Center

Irina Rish IBM T.J. Watson Research Center

Presentation Transcript

Bluetooth Architecture Overview Dr. Chatschik Bisdikian IBM Research T.J. Watson Research Center Hawthorne, NY 10532, US

IBM Research

Paul E. McKenney , IBM Linux Technology Center Maged M. Michael, IBM TJ Watson Research

Dr. Dong Chen IBM T.J. Watson Research Center Yorktown Heights, NY

David F. Bacon T.J. Watson Research Center

David F. Bacon T.J. Watson Research Center

Rahul Garg and Rohit Khandekar IBM T. J. Watson Research Center Yorktown Heights, NY, USA

Supercomputer Platforms and Its Applications Dr. George Chiu IBM T.J. Watson Research Center

Data Management Community at IBM Watson

ADVISORY COMMITTEES Jen-Yao Chung • IBM T. J. Watson Research Center, USA

BG/L Application Tuning and Lessons Learned Bob Walkup IBM Watson Research Center

Wendy A. Kellogg, Jason B. Ellis, John C. Thomas IBM T.J. Watson Research Center

Dr. George Chiu IEEE Fellow IBM T.J. Watson Research Center Yorktown Heights, NY

Marco Pistoia, Ted Habeck, Larry Koved IBM T.J. Watson Research Center New York, USA

Fred Douglis IBM T.J. Watson Research Center

Scenes from HCI Research at the IBM T.J. Watson Research Lab

Dr. George Chiu IEEE Fellow IBM T.J. Watson Research Center Yorktown Heights, NY

Supercomputer Platforms and Its Applications Dr. George Chiu IBM T.J. Watson Research Center

IBM Watson in Health Care Joel Farrell, IBM