250 likes | 368 Views
ACE: A Framework for optimizing music classification. Cory McKay Rebecca Fiebrink Daniel McEnnis Beinan Li Ichiro Fujinaga Music Technology Area Faculty of Music McGill University. Goals. Highlight limitations of existing pattern recognition software when applied to MIR
E N D
ACE: A Framework for optimizing music classification Cory McKay Rebecca Fiebrink Daniel McEnnis Beinan Li Ichiro Fujinaga Music Technology Area Faculty of Music McGill University
Goals • Highlight limitations of existing pattern recognition software when applied to MIR • Present solutions to these limitations • Stress importance of standardized classification and feature extraction software • Ease of use, portability and extensibility • Present the ACE software framework • Uses meta-learning • Uses classification ensembles 2/25
Existing music classification systems • Systems often implemented with specific tasks in mind • Not extensible to general tasks • Often difficult to use for those not involved in project • Need standardized systems for a variety of MIR problems • No need to reimplement existing algorithms • More reliable code • More usable software • Facilitates comparison of methodologies • Important foundations • Marsyas (Tzanetakis & Cook 1999) • M2K (Downie 2004) 3/25
Existing general classification systems • Available general-purpose systems: • PRTools (van der Heijden et al. 2004 ) • Weka (Witten & Frank 2005) • Other meta-learning systems: • AST (Lindner and Studer 1999) • Metal (www.metal-kdd.org) 4/25
Problems with existing systems • Distribution problems • Proprietary software • Not open source • Limited licence • Music-specific systems are often limited • None use meta-learning • Classifier ensembles rarely used • Interfaces not oriented towards end users • General-purpose systems not designed to meet the particular needs of music 5/25
Special needs of music classification (1) • Assign multiple classes to individual recordings • A recording may belong to multiple genres, for example • Allow classification of sub-sections and of overall recordings • Audio features often windowed • Useful for segmentation problems • Maintain logical grouping of multi-dimensional features • Musical features often consist of vectors (e.g. MFCC’s) • This relatedness can provide classification opportunities 6/25
Special needs of music classification (2) • Maintain identifying meta-data about instances • Title, performer, composer, date, etc. • Take advantage of hierarchically structured taxonomies • Humans often organize music hierarchically • Can provide classification opportunities • Interface for any user 7/25
Standardized file formats • Existing formats such as Weka’s ARFF format cannot represent needed information • Important to enable classification systems to communicate with arbitrary feature extractors • Four XML file formats that meet the above needs are described in proceedings 8/25
The ACE framework • ACE (Autonomous Classification Engine) is a classification framework that can be applied to arbitrary types of music classification • Meets all requirements presented above • Java implementation makes ACE portable and easy to install 9/25
ACE and meta-learning • Many classification methodologies available • Each have different strengths and weaknesses • Uses meta-learning to experiment with a variety of approaches • Finds approaches well suited to each problem • Makes powerful pattern recognition tools available to non-experts • Useful for benchmarking new classifiers and features 10/25
Feature Extraction System Model Classifications Extracted Features Feature Settings Taxonomy Music Recordings Trained Classifiers Statistical Comparison of Classification Methodologies Experiment Coordinator Classification Methodology 1 Classification Methodology n ACE Dimensionality Reduction Dimensionality Reduction … Classifier Evaluator 11/25
Algorithms used by ACE • Uses Weka class libraries • Makes it easy to add or develop new algorithms • Candidate classifiers • Induction trees, naive Bayes, k-nearest neighbour, neural networks, support vector machines • Classifier parameters are also varied automatically • Dimensionality reduction • Feature selection using genetic algorithms, principal component analysis, exhaustive searches • Classifier ensembles • Bagging, boosting 12/25
Classifier ensembles • Multiple classifiers operating together to arrive at final classifications • e.g. AdaBoost (Freund and Shapire 1996) • Success rates in many MIR areas are behaving asymptotically (Aucouturier and Pachet 2004) • Classifier ensembles could provide some improvement 13/25
Musical evaluation experiments • Achieved a 95.6% success with a five-class beatbox recognition experiment (Sinyor et al. 2005) • Repeated Tindale’s percussion recognition experiment (2004) • ACE achieved 96.3% success, as compared to Tindale’s best rate of 94.9% • A reduction in error rate of 27.5% 14/25
General evaluation experiments • Applied ACE to six commonly used UCI datasets • Compared results to recently published algorithm (Kotsiantis and Pintelas 2004) 15/25
Results of UCI experiments (2) • ACE performed very well • Statistical uncertainty makes it difficult to say that ACE’s results are inherently superior • ACE can perform at least as well as a state of the art algorithm with no tweaking • ACE achieved these results using only one minute per learning scheme for training and testing 17/25
Results of UCI experiments (3) • Different classifiers performed better on different datasets • Supports ACE’s experimental meta-learning approach • Effectiveness of AdaBoost (chosen 2 times out of 6) demonstrates strength of classifier ensembles 18/25
Feature extraction • ACE not tied to any particular feature extraction system • Reads Weka ARFF as well as ACE XML files • Does include two powerful and extensible feature extractors are bundled with ACE • Write Weka ARFF as well as ACE XML 19/25
jAudio • Reads: • .mp3 • .wav • .aiff • .au • .snd 20/25
jSymbolic • Reads MIDI • Uses 111 Bodhidharma features 21/25
Graphical interface Includes an on-line manual Command-line interface Batch processing External calls Java API Open source Well documented Easy to extend ACE’s interface 22/25
Current status of ACE • In alpha release • Full release scheduled for January 2006 • Finalization of GUI • User constraints on training, classification and meta-learning times • Feature weighting • Expansion of candidate algorithms • Long-term • Distributed processing, unsupervised learning, blackboard systems, automatic cross-project optimization 23/25
Conclusions • Need standardized classification software able to deal with the special needs of music • Techniques such as meta-learning and classifier ensembles can lead to improved performance • ACE designed to address these issues 24/25
Web site: • coltrane.music.mcgill.ca/ACE • E-mail: • cory.mckay@mail.mcgill.ca