Machine Discoveries: A few Simple, Robust Local Expression Principles

Machine Discoveries: A few Simple, Robust Local Expression Principles Written by Gerhard Widmer presented by Siao Jer, ISE 575b, Spring 2006

Presentation Overview • General Overview • Introduction • Training Data • Target Classes • Experimental Results • Quantitative Validation • Conclusion • Future Research

Gerhard Widmer • Head of the Department of Computational Perception at Johannes Kepler University Linz, Austria • Head of Machine Learning, Data Mining, and Intelligent Music Processing Group at the Austrian Research Institute for Artificial Intelligence • Numerous publication, awards, projects

General Overview • Discovering rules of expressive music performance • Inductive machine learning • Experiments with large data sets • Simple and general principles • Robust with surprisingly high level of accuracy

Introduction • What do performers do to make music “come alive?” • Studies done through a few classical approaches • Proposal of inductive machine learning • No preconceptions and expectations • Huge data sets allowed for more validity

Introduction • Previous work • Success in ability of machine learning (Widmer 2000) • Extremely complex • Attempt to find a complete model • Current goals • Testing new learning algorithm based on partial models • Learn rules of timing, dynamics, articulation • Testing degrees of fit over various styles and performers

Training Data • 13 complete Mozart piano sonatas • Performed by Roland Batik • On computer monitored grand piano • MIDI format • Includes hammer speed, impact times, pedal movements measured & xform’ed • Written score coded into computer format • Timing, dynamics, & articulation computed • 106,000 total notes • Melody restriction limits us to 41,000 notes

Target Classes • Objective: find note-level rules • Limit predictions to categorical decisions • Timing Dimension: note N is considered lengthened • If the note is lengthened relative to the instantaneous tempo over the previous note • If lengthened relative to local tempo over the last 20 notes • Analogous to this is a note shortened

Target Classes • Dynamics: louder if • Louder than previous note • And louder than average level of piece • Analogous to this is softer • Articulation • Staccato if played duration ratio (PDR) is less than 0.8 • Legato if greater than 1.0 • Portato otherwise, but study only concerned with staccato and legato • Pedaling not taken into account for articulation • Notes do not necessarily have to fall into one of these classes

Learning Partial Rule-based Models • No expectation to cover and describe all instances • Describe parts and define in meaningful terms • PLCG algorithm developed with these ideas in mind • Goal to come up with rules that covered lots of cases with good accuracy

Learning Partial Rule-based Models • General Steps • Separation into subsets • Learning partial rules within subsets • Merge all rules • Clustering of rules • One generalization per cluster • Optimize trade-offs (coverage vs. accuracy) • Result: 383 specialized rules narrowed to 17 general rules

Experimental ResultsTiming: Lengthening Notes • "Lengthen the middle note in a “cummulative” 3-note rhythm situation (ie, given 2 notes of equal duration followed by a longer note, lengthen the note that precedes the final, longer one).” • Most important one as it has highest prediction value • “Lengthen a note if it is followed by substantially longer note (ie the ratio between its duration and the duration of the next note is < 1:3)” • “Lengthen a note if it preceds an upward melodic leap of more than a perfect forth, if it is in a metrically weak position, and if it is preceded by (at most) stepwise motion” • 2 cases above have atleast 70% prediction rate

Experimental ResultsTiming: Lengthening Notes • “Lengthen a note if it preceds an upward melodic leap of more than a perfect forth, if it is in a metrically weak position, and if it is preceded by (at most) stepwise motion” • More of a “tendancy” than a rule • Interesting Note: • previously observed • But not over such a large data set

Experimental ResultsTiming: Shortening Notes • Difficult, but understandable • No strong rules, but a few tendencies • “Shorten a note in a sequence PN-N-NN if it is longer than its predecessor and longer than its successor.” • “Shorten a note in fast pieces in 3/8 time if the duratio ratio between previous note and current note is larger than 2:1, the current note is at most a sixteenth, and is again followed by a longer note.” • Example of a specialized rule • Correlation with Gabrielsson 1987

Experimental ResultsDynamics: Stressing Notes • Clear rules emerge, low coverage • Interesting note: relating stress to • melodic contour • Upward melodic movement • Observation by previous research as well • “Stress a note by playing it louder if it is preceded by an upward melodic leap larger than a perfect fourth.”

Experimental ResultsDynamics: Stressing Notes • “Stress a note by playing it louder if it forms the apex of an up-down melodic contour and is preceded by an (upward) leap larger than a minor third.” • “Stress a note by playing it louder if it at least twice as long as its predecessor, is reached by upward motion, and is in a quite strong metrical position.”

Experimental ResultsDynamics: Attenuating Notes • Difficult to predict • “Attenuate a note by playing it softer if it is less than 1/5 the duration of its predecessor.” • “Attenuate a note by playing it softer if it is preceded by a downward leap larger than a major third, is metrically weak, and is preceded by a note at least 1/3 of a beat long.” • “Attenuate a note by playing it softer if it is preceded by a downward leap larger than a perfect fifth and is metrically weak.” • Observation: linking metrically weak notes reached by downward leaps

Experimental ResultsArticulation Staccato • Most easily predictable, 4 strong rules • “Play a note staccato if the note is marked with a staccato dot in the score.” • “Play a note staccato if it is followed by a note of the same pitch (ie the interval between the note and its successor is a unison).” • Observations: • Combine for +90% accuracy & 6,000 cases • Previously observed in KTH Rules (Friberg 1995) • Physical reasons and explanations

Experimental ResultsArticulation Staccato • “Insert a micropause after a note if it precedes an upward leap larger than a perfect fourth and is metrically weak.” • “Insert a micropuase after a note of it is reached by downward motion and is followed by a note more than twice as long (ie the ratio between its duration and duration of the next note is < 0.4).” • Observations: • Correlation to lengthening rules • Supported by “Cumulative Rhythm” (Nramour 1977) • Articulation Staccato  30% of expression observed

Experimental ResultsArticulation Legato • Most difficult to predict • A LOT fewer instances vs. staccato • No markings on score • Low prediction rate (53.7%) • “Play a note legato if it is not marked staccato in the score, if it forms the apex of an up-down melodic contour, if it is quite short (<1/3 of a beat), and is metrically quite strong.” • Observations: • Melodic peak  legato?

Quantitative Validation:Generality I • Different Performer (Philippe Entremont) • Same pieces • No significant degradation in coverage and accuracy • Exception of “softer” • Higher coverage in • “lengthen” • “louder” • “staccato”

Quantitative Validation:Generality II • Testing on Different Styles & Artists • 2 Chopin pieces • 22 skilled pianist from Univ. of Music in Vienna • Surprising Results • “softer” and “legato”  unpredictable • “louder”  high % of positive examples, but high level of false predictions too • “lengthen,” “shorten,” & “staccato”  extremely good • Need more diversity of pieces

Conclusion • Small Step • Basic & simple rules • Robust model of local expression principles • Observations from other researchers • Autonomous discovery • Large data sets • Possible foundation

Further Research • Further evaluation of rules • different performers • Different types of music • Extension to other dimensions • (e.g. Harmony) • Going beyond note level • (e.g. phrase structure) • Comprehensive multi-level model

Machine Discoveries: A few Simple, Robust Local Expression Principles