1 / 18

Recognition and Analysis of Melodies from Polyphonic Musical Audio Data

Recognition and Analysis of Melodies from Polyphonic Musical Audio Data. Karin Dressler Fraunhofer IDMT, Konstanz University, TU Ilmenau Advisor: Prof. Dr. Karlheinz Brandenburg. Graduate School ISMIR 2004. Level of Abstraction. Sampled waveform (WAV – file). Time stamped events

forever
Download Presentation

Recognition and Analysis of Melodies from Polyphonic Musical Audio Data

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Recognition and Analysis of Melodies from Polyphonic Musical Audio Data Karin Dressler Fraunhofer IDMT, Konstanz University, TU Ilmenau Advisor: Prof. Dr. Karlheinz Brandenburg Graduate School ISMIR 2004

  2. Level of Abstraction Sampled waveform (WAV – file) Time stamped events (MIDI-file) Music notation (Humdrum)

  3. What is melody? • „A rhythmical succession of single tones, ranging for the most part within a given key, and so related together as to form a musical whole, having the unity of what is technically called a musical thought, at once pleasing to the ear and characteristic in expression.“ • „The air or tune of a musical piece.“

  4. What is melody? • „A rhythmical succession of single tones, ranging for the most part within a given key, and so related together as to form a musical whole, having the unity of what is technically called a musical thought, at once pleasing to the ear and characteristic in expression.“ • „The air or tune of a musical piece.“

  5. Is this melody? [ Goto, 2000 ] • „The melody and the bass line have a harmonic structure.“ • „The melody line has the most predominant harmonic structure in middle and high frequency regions...“ • „The melody ... tends to have temporally continuous trajectories.“

  6. Is this melody? [ Goto, 2000 ] • „The melody and the bass line have a harmonic structure.“ • „The melody line has the most predominant harmonic structure in middle and high frequency regions...“ • „The melody ... tends to have temporally continuous trajectories.“ • Just a good first guess

  7. Algorithm Overview Audio file • Windowing and FFT >> logarithmic magnitude spectrum • Extract sinusodial components • Link sinusodials between frames in and find onsets of trajectories • Discriminate fundamentals and harmonics and choose candidates for melody • Choose most probable succession of notes according to music theory and psychoacoustics Spectral Analysis Extraction of Sinusodials Trajectories Onset detection Tone candidates Classification MIDI file

  8. Spectral Analysis I • Many ways: DFT, Ear models, WT, Constant Q Transform, Warped FFT, filterbanks ... • My Choice: Discrete Fourier Transform >> FFT • Contra: • Linear frequency resolution • Pro: • „Simple“ output ( linear phase, linear frequency resolution ) • Analysis functions are sinusodials • Fast • Simple monophonic example:

  9. Spectral Analysis II

  10. Deterministic Analysis I • Segmentation ( binary decission ) • Discriminate sinusoids from „noise“ • Melody: deterministic part • Sinusodial models • Deterministic plus stochastic decomposition [Serra, 1989]

  11. Deterministic Analysis II

  12. Trajectory objects • Partial trajectories • Frame linking • With respect to human auditory system ( magnitude and frequency changes) • Pixelwise representation >> object representation (labeling) • Deletion of small objects

  13. Frame Linking

  14. Deleting small objects

  15. Classification I • Assign a meaning to single objects • Classes: • First step: fundamentals, harmonics, noise • Second step: melody • Melody candidates • Classification based on music theory and psychoacoustics • MIDI note number, duration • Perceived loudness, toneness, masking effects... • Musicology: Which schema/model one should use? • Voice leading rules [Huron, 2001] • The Implication-Realization Model [Narmour, 1991] • Statistical Analysis of annotated melodies ( MIDI ) • Which classifier can modell those schemata?

  16. Classification II

  17. Another Example

  18. The End...

More Related