130 likes | 381 Views
jAudio. David Sears MUMT 621 22 October 2009. Background—Why jAudio ? Design Features Applications Conclusion. Outline. Competition Drawbacks Marsyas complicated user interface CLAM not intended for feature extraction Maate GUI geared towards visualization
E N D
jAudio David Sears MUMT 621 22 October 2009
Background—Why jAudio? • Design • Features • Applications • Conclusion Outline
Competition Drawbacks Marsyas complicated user interface CLAM not intended for feature extraction Maate GUI geared towards visualization Sonic Visualiser feature extraction isn’t transparent Background—Why jAudio? http://www.sonicvisualiser.org/images/sv-1.0-wide.png
written in Java • Easy to use GUI • Input for .wav, .aif, .snd formats • Output in XML and Weka formats Pre-processing options • Normalization • Downsampling • Window size • Window overlap Design
Dependencies • jAudio reorders the execution of feature calculations such that every feature’s calculation is executed only after all of its dependencies have been executed. The user doesn’t have to specify the order of feature extraction. Multidimensional Features • jAudio has the capacity to accept features that provide an arbitrary number of dimensions (MFCCs).
Metafeatures • Running Mean, Derivative, Derivative of Running Mean, Standard Deviation, Derivative of Standard Deviation Extensibility • It is possible to add new features to a plugin folder that is automatically searched by jAudio at runtime without recompiling the program..
How do changes in the acoustic stimulus relate to the psychophysiological responses of listeners? Applications
Features to Consider (Linear & Derivatives) • RMS Amplitude • Spectral Centroid • Spectral Flux • Compactness • Fraction of Low Energy Windows • Relative Difference Function • Spectral Rolloff
The metafeatures and extensibility make jAudio particularly user-friendly. • This system is geared towards feature extraction, so the method of extraction is transparent. However, the system does not provide more than a few tools for visualization of the extracted features. • jAudio doesn’t currently extract Psychoacoustic features—loudness, roughness, sharpness, fluctuation strength. Conclusions
McEnnis, D., C. McKay, and I. Fujinaga. 2006. jAudio: Additions and improvements. Proceedings of the International Conference on Music Information Retrieval. 385–6. McEnnis, D., C. McKay, and I. Fujinaga. 2006. Overview of OMEN. Proceedings of the International Conference on Music Information Retrieval. 7–12. McEnnis, D., C. McKay, I. Fujinaga, and P. Depalle. 2005. jAudio: A feature extraction library. Proceedings of the International Conference on Music Information Retrieval. 600–3. McKay, C. 2009. jAudio: Extracting features from audio. In The jMIR project: General-purpose standardized software for music information retrieval research. PhD diss. in progress, McGill University. References