390 likes | 567 Views
Functional Data Analysis for Speech Research. Michele Gubian Radboud University Nijmegen The Netherlands London, March 24 th 2010 Cambridge, March 26 th 2010. Content. What and why Functional Data Analysis (FDA) Motivation Case study 1 Case study 2 – pitch re-synthesis
E N D
Functional Data Analysis for Speech Research Michele Gubian Radboud University Nijmegen The Netherlands London, March 24th 2010 Cambridge, March 26th 2010
Content • What and why Functional Data Analysis (FDA) • Motivation • Case study 1 • Case study 2 – pitch re-synthesis • How to use FDA • Using the R package ‘fda’
dur ext 58 48 98 … 2.8 3.8 2.9 … x ext ? x x x dur Analyzing curves PCA ANOVA Linear models
x ext ? x x x dur Problems • Decide what are the important features of a curve using • models • intuition / trial and error However • Those features may not capture all the relevant dynamic aspects • e.g. concavity/convexity • long range correlatioins
x ext ? x x x dur Problems (2) • Identify those feature points • manually • (semi)automatically However • The identification may be hard, even ill-posed • time consuming • risk of subjective judgment
x ext ? x x x dur Analyzing curves with FDA Functional Data Analysis
Analyzing curves with FDA • All the information contained in the curve (dynamics) is used • No need to reduce a curve to a set of significant features • No need to introduce assumptions on what is relevant in a curve shape and what is not • FDA provides both VISUAL and QUANTITATIVE results • input is curves, output is also curves • plus classic statistical output like p-values, confidence intervals …
Example x x x salary salary x x x x x age age Functional Data Analysis: an extension of (some) statistical techniques to the domain of functions CLASSIC FDA • Ask people: • How old are you? • How much do you earn? • Each data point is a point in 2D • Record people salary through the years • Each “data point” is a whole CURVE
Diphthong vs. hiatus in Spanish • /ja/ vs. /i.a/ contrast is unstable in European Spanish • Diachronically, in Romance languages /i.a/ becomes /ja/ • Diatopically, in Latin American Spanish the contrast seems to be lost • It is not present in orthography (“ia” in either case) • No strict minimal pairs • Investigate • Consistent realization of the contrast • Inter-speaker variation • Cues used in the realization
Cues DIPHTHONG /ja/ HIATUS /i.a/ • Duration • Formants • Pitch short long f2 f2 f1 f1 f0 f0
Dataset • Read speech • Diphthong ‘Emiliana no, …’ /e.mi.lja.na#no#.../ (‘Not Emiliana, …’) • Hiatus ‘Mi liana no, … ‘ /mi#li.a.na#no#.../ (‘Not my liana, …’) • 9 speakers (gender balanced) • 20 repetitions per speaker per type • In total 365 utterances
lja li a Pitch • Pitch was extracted from the beginning of /l/ to the end of the rising gesture • In Spanish the pitch rising peak falls beyond the accented syllable
speaker /ja/ vs /i.a/ The raw data
FDA data preparation • Each sampled curve has to be turned into a function • Decide how much detail to retain (smoothing)
B-spline FDA data preparation (2) • All functions will be obtained by a combination of so-called basis functions, usually B-splines • All functions will be linearly stretched in time to become of equal duration Functional representation
PC1 x PC2 x x x x x x x x x x x x x x x x x x salary x x x x x x x x x x x x x x x x x 25 65 age ClassicPrincipal Component Analysis (PCA)
Functional PCA on pitch contours PCA does not know about labels !!
PC2 Functional PCA on pitch contours
f1 f2 PC1 PC2 Functional PCA on formants
Functional PCA on formants PC1 PC1
Duration vs formants Duration vs pitch Cues coordination
Summary • FDA provides tools to extract relevant dynamic characteristics of a set of curves • Traditional tools like PCA (and linear regression) are extended to curves • Functional PCA revealed the main dynamic cues used in the realization of a (weak) contrast in Spanish • Without using the labels information • Without extracting features from the curves (e.g. peaks) • Combining multi-dimensional curves (formants) without effort
References • Functional Data Analysis website: www.functionaldata.org • Books: • Software: a bilingual (R and MATLAB) tool is freely available online
Functional linear models y(t) = a(t) + b(t) x diphthong, x = 0 hiatus, x = 1 Confidence intervals for a(t) and b(t) R2(t) = percentage of explained variance