210 likes | 252 Views
Song Intersection by Approximate Nearest Neighbours. Michael Casey, Goldsmiths Malcolm Slaney, Yahoo! Inc. Overview. Large Databases: Everywhere! 8B web pages 50M audio files on web 2M songs Find duplicates with shingles Text-based LSH - Randomized projections Results Best features
E N D
Song Intersection by Approximate Nearest Neighbours Michael Casey, Goldsmiths Malcolm Slaney, Yahoo! Inc.
Overview • Large Databases: Everywhere! • 8B web pages • 50M audio files on web • 2M songs • Find duplicates with shingles • Text-based • LSH - Randomized projections • Results • Best features • 2018 song subset
The Need for Normalization • Recommendations • Apply one song’s rating to another • – > Better matches • Playlists • Find matches to user requests • Remove adult/child music • Search results • Don’t show duplicates
Specificity Spectrum Fingerprinting Remixes Cover songs Genre Look for specific exact matches Our work (nearestneighbor) Bag of Features model
Remix Examples Abba Gimme Gimme Madonna Hung Up Tracy Young Remix of Hung Up Tracy Young Remix 2 of Hung Up
How Remix Recognition Works • Algorithm • Matched filter best (ICASSP2005 result) • Nearest neighbor in 360–1200D space • Ill posed? • Efficient implementation • Audio shingles • Like web-duplicate search • Locality-sensitive hashing • Probabilistic guarantee
Remix Distance Matched filter (implemented as nearest neighbor) N-best matches
Hashing • Types of hashes • String : put casey vs cased in different bins • Locality sensitive : find nearest neighbors • High-dimensional and probabilistic • Two Nearest Neighbor implementations • Pair-wise distance computation • 1,000,000,000,000 comparisons in 2M song database • Hash bucket collisions • 1,000,000,000 hash projections
Random Projections • Random projections estimate distance • Multiple projections improve estimate
Locality Sensitive Hashing Distant Vector • Hash function is a random projection • No pair-wise computation • Collisions are nearest neighbors Distant Vector
Remix Nearest Neighbour Algorithm 1 • Extract database audio shingles • Eliminate shingles < song’s mean power • Compute remix distance for all pairs • Choose pairs with remix distance < r0
Remix Nearest Neighbour Algorithm Revisited • Extract database audio shingles • Eliminate shingles < song’s mean power • Hash remaining shingles, bin width=r0 • Collisions are near neighbour shingles
Method • Choose 20 Query Songs • Each has 3-10 Remixes • 306 Madonna Songs • 2018 Madonna+Miles
Conclusions • Remixes are hard, but well-posed • Brute force distances too expensive • LSH is 1-2 orders of magnitude faster • LSH Remix Recognition is Accurate
Conclusions • Remixes are hard, but well-posed • Brute force distances too expensive • LSH is 1-2 orders of magnitude faster • LSH Remix Recognition is Accurate
Conclusions • Remixes are hard, but well-posed • Brute force distances too expensive • LSH is 1-2 orders of magnitude faster • LSH Remix Recognition is Accurate
Conclusions • Remixes are hard, but well-posed • Brute force distances too expensive • LSH is 1-2 orders of magnitude faster • LSH Remix Recognition is Accurate