120 likes | 454 Views
SortMyTunes Martin McCrory School of Informatics, Indiana University Bloomington April 18, 2008 This presentation and any accompanying commentary is released to the world under the GNU General Public License on April 18, 2008.
E N D
SortMyTunes Martin McCrory School of Informatics, Indiana University Bloomington April 18, 2008 This presentation and any accompanying commentary is released to the world under the GNU General Public License on April 18, 2008.
SortMyTunes is a framework for metadata-based Music Classification. Why is this kind of classification important? Most music collections have at least some metadata component!
So, how does SortMyTunes work? • The user creates “pods,” or bins of music that are different from each other in some way. • SMT classifieds a portion of the rest of the database into these pods. • The user makes manual corrections, and then SMT iterates again.
Where does the metadata come from? Manual metadata creation Etc.
What are “tags”? • Tags form the basic building block of metadata for each track. • The use of tags as metadata replaces any need to retrieve data from actual audio.
Example: User creates a file representing the initial state of the “pods”: Alternative!, artist1,title1, Guitar, 2000s, Attractive, Acoustic Rap-tastic, artist2,title2, Rap, Doggy, Style, Annoying Classical and Rock, artist3, title3, Classical, Symphony, Strings, Long, Annoying, Boring Classical and Rock, artist4, title4, rock, pop
Example: Example: User inputs a file that represents the rest of the database: Jack Johnson, Where's My Girl, Guitar, 2000s, Attractive, Acoustic Beyonce, Dontcha, Rap, Doggy, Style, Annoying Beethoven, 5th String Quartet, Classical, Symphony, Strings, Long, Annoying, Boring John Mayer, Blibbity Blah, guitar, long, aCousTiC Brahms, Symphony, Classical, Symphony, Orchestra, Chicago Snoop Dogg, Thugz, Rap, Doggy, Style, R&B Indigo Girls, Go Girls!, Pop, Live, Concert, Loud, Rock Dave Matthews, Crash, Guitar, Acoustic, Live, Lame, Drama, Violin Stravinsky, Rite of Spring, Classical, Symphony, Loud, Intense Wesley Willis, Who Gives a Shit, Electronic, Repetitive, Dumb, Lame No Doubt, Gwen Stefani's Song, Pop, Rock, Woman, Vocals, Attractive Aphex Twin, Windowlicker, Ambient, Electronic, Techno, Bizarre, Wierd Aphex Twin, Orange, Orchestral, Orchestra, Ambient, Electronic, Techno, Irregular, Time, Signature Aphex Twin, Dijeridoo, Ambient, Dijeridoo, Instrument, Instrumental, Rhythmic Chopin, Piano Etude, Piano, Etude, Chopin, Classical, Harmony, Gay Kanye West, I'm So Awesome, Rap, Rock, R&B, Black, Rapper
Example: SMT sorts the collection into the pods that the user specified: * ---Alternative!--- * ************** Pod Tracks *************** * artist1 - title1 <Tags: Guitar, 2000s, Attractive, Acoustic, > * Jack Johnson - Where's My Girl <Tags: Guitar, 2000s, Attractive, Acoustic, > * John Mayer - Blibbity Blah <Tags: guitar, long, aCousTiC, > * Dave Matthews - Crash <Tags: Guitar, Acoustic, Live, Lame, Drama, Violin, > * Wesley Willis - Who Gives a Shit <Tags: Electronic, Repetitive, Dumb, Lame, > * Aphex Twin - Windowlicker <Tags: Ambient, Electronic, Techno, Bizarre, Wierd, > * Aphex Twin - Orange <Tags: Orchestral, Orchestra, Ambient, Electronic, Techno, Irregular, Time, Signature, > * Aphex Twin - Dijeridoo <Tags: Ambient, Dijeridoo, Instrument, Instrumental, Rhythmic, > * ********* Pod Tags (with count) ********* * (Acoustic|4), (Guitar|4), (Ambient|3), (Electronic|3), (Techno|2), (Lame|2), (Attractive|2), (2000s|2), (Rhythmic|1), (Instrumental|1), (Instrument|1), (Dijeridoo|1), (Signature|1), (Time|1), (Irregular|1), (Orchestra|1), (Orchestral|1), (Wierd|1), (Bizarre|1), (Dumb|1), (Repetitive|1)
Example: SMT sorts the collection into the pods that the user specified: * ---Rap-tastic--- * ************** Pod Tracks *************** * artist2 - title2 <Tags: Rap, Doggy, Style, Annoying, > * Beyonce - Dontcha <Tags: Rap, Doggy, Style, Annoying, > * Snoop Dogg - Thugz <Tags: Rap, Doggy, Style, R&B, > * Kanye West - I'm So Awesome <Tags: Rap, Rock, R&B, Black, Rapper, > * ********* Pod Tags (with count) ********* * (Rap|4), (Style|3), (Doggy|3), (R&B|2), (Annoying|2), (Rapper|1), (Black|1), (Rock|1)
Example: SMT sorts the collection into the pods that the user specified: * ---Classical and Rock--- * ************** Pod Tracks *************** * artist3 - title3 <Tags: Classical, Symphony, Strings, Long, Annoying, Boring, > * artist4 - title4 <Tags: rock, pop, > * Beethoven - 5th String Quartet <Tags: Classical, Symphony, Strings, Long, Annoying, Boring, > * Brahms - Symphony <Tags: Classical, Symphony, Orchestra, Chicago, > * Indigo Girls - Go Girls! <Tags: Pop, Live, Concert, Loud, Rock, > * Stravinsky - Rite of Spring <Tags: Classical, Symphony, Loud, Intense, > * No Doubt - Gwen Stefani's Song <Tags: Pop, Rock, Woman, Vocals, Attractive, > * Chopin - Piano Etude <Tags: Piano, Etude, Chopin, Classical, Harmony, Gay, > * ********* Pod Tags (with count) ********* * (Classical|5), (Symphony|4), (pop|3), (rock|3), (Loud|2), (Boring|2), (Annoying|2), (Long|2), (Strings|2), (Gay|1), (Harmony|1), (Chopin|1), (Etude|1), (Piano|1), (Attractive|1), (Vocals|1), (Woman|1), (Intense|1), (Concert|1)
Miscellanea • SMT uses the “k-means” classification algorithm. • SMT is not limited with regard to collection size. • Current efficiency of SMT is O(n^2).
Questions/Contact: Martin McCrory mccrory@indiana.edu