1.13k likes | 1.14k Views
This article explores the general applicability of the backwards approach to PCA and introduces a nested submanifold fitting method. It examines the use of constraints and angles vectors to fit data objects and discusses the concept of nested submanifold fits. The article also considers the selection of components for fitting and the application of the AUC criterion. Additionally, it explores the idea of fitting nested sub-torii in torus space and discusses the important modes of variation in curve registration.
E N D
An Interesting Question How generally applicable is Backwards approach to PCA? An Attractive Answer: James Damon, UNC Mathematics Key Idea: Express Backwards PCA as Nested Series of Constraints
General View of Backwards PCA Define Nested Spaces via Constraints E.g. SVD Now Define: ConstraintGives Nested Reduction of Dim’n
Vectors of Angles Vectors of Angles as Data Objects Slice space with hyperplanes???? (ala Principal Nested Spheres)
Vectors of Angles E.g. , Data w/ “Single Mode of Var’n” Best Fitting Planar Slice gives Bimodal Dist’n Special Thanks to Eduardo García-Portugués
Torus Space Try To Fit A Geodesic Challenge: Can Get Arbitrarily Close
Torus Space Fit Nested Sub-Manifold
PNS Main Idea Data Objects: Where is a dimensional manifold Consider a nested series of sub-manifolds: where for and Goal: Fit all of simultaneously to
General Background Call each a stratum, so is a manifold stratification To be fit to New Approach: Simultaneously fit Nested Submanifold (NS)
Projection Notation For let denote the telescoping projection onto I.e. for Note: This projection is fundamental to Backwards PCA methods
PNS Components For a given , represent a point by its Nested Submanifold components: where for In the sense that “” means the shortest geodesic arc between &
Nested Submanifold Fits Simultaneous Fit Criteria? Based on Stratum-Wise Sums of Squares For define Uses “lengths” of NS Components:
NS Components in NS Candidate 2 (Shifted to Sample Mean) Note: Both & Decrease
NS Components in NS based On PC1 Note: Yet is Constant (Pythagorean Thm)
NS Components in NS based On PC2 Note: is Constant (Pythagorean Thm)
NS Components in NS Candidate 1
NS Components in NS Candidate 2
NS Components in NS based On PC1
NS Components in NS based On PC2
Nested Submanifold Fits Simultaneously fit Simultaneous Fit Criterion? Above Suggests Want: Works for Euclidean PCA (?)
Nested Submanifold Fits Simultaneous Fit Criterion? Above Suggests Want: Important Predecessor Pennec(2016) AUC Criterion:
Pennec’s Area Under the Curve Based on Scree Plot 2 Component Index
Pennec’s Area Under the Curve Based on Scree Plot Cumulative 2 Component Index
Pennec’s Area Under the Curve Based on Scree Plot Cumulative Area = Component Index
Torus Space Fit Nested Sub-Manifold Choice of & in: ???
Torus Space Tiled embedding is complicated (maybe OK for low rank approx.) Instead Consider Nested Sub-Torii Work in Progress with Garcia, Wood, Le Key Factor: Important Modes of Variation
OODA Big Picture New Topic: Curve Registration Main Reference: Srivastava et al (2011)
Collaborators • AnujSrivastava(Florida State U.) • Wei Wu (Florida State U.) • Derek Tucker (Florida State U.) • Xiaosun Lu (U. N. C.) • Inge Koch (U. Adelaide) • Peter Hoffmann (U. Adelaide) • J. O. Ramsay (McGill U.) • Laura Sangalli (Milano Polytech.)
Context Functional Data Analysis Curves as Data Objects Toy Example:
Context Functional Data Analysis Curves as Data Objects Toy Example: How Can We Understand Variation?
Context Functional Data Analysis Curves as Data Objects Toy Example: How Can We Understand Variation?
Context Functional Data Analysis Curves as Data Objects Toy Example: How Can We Understand Variation?
Functional Data Analysis Insightful Decomposition
Functional Data Analysis Insightful Decomposition • Horiz’l • Var’n
Functional Data Analysis Insightful Decomposition Vertical Variation • Horiz’l • Var’n
Challenge • Fairly Large Literature • Many (Diverse) Past Attempts • Limited Success (in General) • Surprisingly Slippery (even mathematical formulation)
Challenge (Illustrated) Thanks to Wei Wu
Challenge (Illustrated) Thanks to Wei Wu
Functional Data Analysis Appropriate Mathematical Framework? Vertical Variation • Horiz’l • Var’n
Landmark Based Shape Analysis Approach: Identify objects that are: • Translations • Rotations • Scalings of each other Mathematics: Equivalence Relation Results in: Equivalence Classes Which become the Data Objects
Landmark Based Shape Analysis Equivalence Classes become Data Objects a.k.a. “Orbits” Mathematics: Called “Quotient Space” , , , , , ,
Curve Registration What are the Data Objects? Vertical Variation • Horiz’l • Var’n
Curve Registration What are the Data Objects? Consider “Time Warpings” (smooth) More Precisely: Diffeomorphisms
Curve Registration Diffeomorphisms • is 1 to 1 • is onto (thus is invertible) • Differentiable • is Differentiable
Time Warping Intuition Elastically Stretch & Compress Axis
Time Warping Intuition Elastically Stretch & Compress Axis (identity)
Time Warping Intuition Elastically Stretch & Compress Axis
Time Warping Intuition Elastically Stretch & Compress Axis
Time Warping Intuition Elastically Stretch & Compress Axis
Curve Registration Say curves and are equivalent, When so that
Curve Registration Toy Example: Starting Curve,