220 likes | 362 Views
Song-level Multi-pitch Tracking by Heavily Constrained Clustering. Zhiyao Duan , Jinyu Han and Bryan Pardo EECS Dept., Northwestern Univ. Interactive Audio Lab, http://music.cs.northwestern.edu For presentation in ICASSP 2010, Dallas, Texas, USA. Multi-pitch Estimation & Tracking Task.
E N D
Song-level Multi-pitch Tracking by Heavily Constrained Clustering ZhiyaoDuan, Jinyu Han and Bryan Pardo EECS Dept., Northwestern Univ. Interactive Audio Lab, http://music.cs.northwestern.edu For presentation in ICASSP 2010, Dallas, Texas, USA.
Multi-pitch Estimation & Tracking Task • Given polyphonic music played by several monophonic harmonic instruments (Num known) • Estimate a pitch trajectory for each instrument Northwestern University, Interactive Audio Lab. http://music.cs.northwestern.edu
Potential Applications • Automatic music transcription • Harmonic source separation • Other applications • Melody-based music search • Chord recognition • Source localization • Music education • …… Northwestern University, Interactive Audio Lab. http://music.cs.northwestern.edu
The 2-stage Standard Approach • Stage 1: Multi-pitch Estimation (MPE): estimate pitches in each single time frame • Z. Duan, B. Pardo and C. Zhang. , “Multiple Fundamental Frequency Estimation by Modeling Spectral Peaks and Non-peak Regions”, IEEE Trans. Audio Speech Language Process., in press. • Stage 2: Multi-pitch Tracking (MPT): connect pitch estimates across frames into pitch trajectories Frequency … Time
State of the Art of MPT • What existing MPT methods do • Form short pitch trajectories within a note, (note-level) according to local time-frequency proximity of pitch estimates • Our contribution • Form long pitch trajectories through multiple notes (song-level) using a new constrained clustering algorithm Northwestern University, Interactive Audio Lab. http://music.cs.northwestern.edu
Try Clustering by Timbre • Each trajectory is a cluster of pitch estimates • One cluster per instrument • Clustering principle: maintain timbre consistency in each cluster ? Northwestern University, Interactive Audio Lab. http://music.cs.northwestern.edu
Timbre Feature of Pitch Estimates • Harmonic structure: relative amplitudes of first 50 harmonics Frequency Time
Minimize This Objective Function Number of Clusters Center of k-th cluster For all pitch estimates in k-th cluster A partition into K clusters The 50-d harmonic structure of i-th pitch estimate Northwestern University, Interactive Audio Lab. http://music.cs.northwestern.edu
Objective Function Is Not Enough Northwestern University, Interactive Audio Lab. http://music.cs.northwestern.edu
Add Pitch-locality Constraints • Must-link: pitch estimates close in both time and frequency should be in the same cluster • Cannot-link:simultaneous pitches should not be in the same cluster (only for monophonic instruments) Frequency Time
Properties of Our Problem • Objective: timbre consistency • Constraints: pitch locality • Previous constrained clustering algorithms do not apply due to the following properties: • Inconsistent constraints: pitch estimates sometimes erroneous may make constraints unsatisfiable • Heavily constrained: nearly every pitch estimate is involved in at least one constraint Northwestern University, Interactive Audio Lab. http://music.cs.northwestern.edu
The Proposed Clustering Algorithm : clustering in n-th iteration; : {all constraints satisfied by } ; 1. Start from an initial clustering , which satisfies , a subset of all constraints; n=1; 2. Find a new clustering that decreases the objective and also satisfies ; 3. = {all constraints satisfied by } ; 4. Repeat 2-4 until the objective (nearly) cannot be decreased; Northwestern University, Interactive Audio Lab. http://music.cs.northwestern.edu
Initial Clustering • Trivial one • : a random partition • : constraints satisfied by , may be empty • A more informative one for MPT • : label pitches according to pitch order in each frame: highest, second-highest, third.., fourth… • : will contain all cannot-links Frequency Frequency … … Time Time Northwestern University, Interactive Audio Lab. http://music.cs.northwestern.edu
Find A New Clustering • 1. Satisfy current constraints • 2. Decrease the objective function : satisfied cannot-link : unsatisfied cannot-link : satisfied must-link : unsatisfied cannot-link • Swap set: A connected subgraph between two clusters. • Traverse all swap sets until finding a new clustering that decreases the objective function 1 3 2 7 1 1 3 3 6 5 2 2 3 4 8 7 7 6 6 5 5 3 3 4 4 8 8 Northwestern University, Interactive Audio Lab. http://music.cs.northwestern.edu
Algorithm Review : partition of points into clusters : feasible solution space under constraints Northwestern University, Interactive Audio Lab. http://music.cs.northwestern.edu
Experiments • Data set • 10 J.S. Bach chorales (quartets, played by violin, clarinet, saxophone and bassoon) • Each instrument is recorded individually, then mixed • Ground-truth pitch trajectories • Use YIN on monophonic tracks before mixing • Input pitch estimates • Our previous work in [1] • Input accuracy: 70.0+-3.1% [1] ZhiyaoDuan, Bryan Pardo and Changshui Zhang, “Multiple Fundamental Frequency Estimation by Modeling Spectral Peaks and Non-peak Regions”, IEEE Trans. Audio Speech Language Process., in press. Northwestern University, Interactive Audio Lab. http://music.cs.northwestern.edu
Overall Multi-pitch Tracking Results Mean % of correct pitch estimates Northwestern University, Interactive Audio Lab. http://music.cs.northwestern.edu
Among Correctly Estimated Pitches Northwestern University, Interactive Audio Lab. http://music.cs.northwestern.edu
An Example Northwestern University, Interactive Audio Lab. http://music.cs.northwestern.edu
An Example Northwestern University, Interactive Audio Lab. http://music.cs.northwestern.edu
Conclusion • Formulate the song-level Multi-pitch Tracking problem as a constrained clustering problem • Objective: timbre consistency • Constraints: pitch locality • Existing constrained clustering algorithms do not apply due to problem properties • Propose a new constrained clustering algorithm • Experimental results are promising Northwestern University, Interactive Audio Lab. http://music.cs.northwestern.edu
Thanks you! Northwestern University, Interactive Audio Lab. http://music.cs.northwestern.edu