680 likes | 791 Views
Focus Contrast in Web Harvested Data. Mats Rooth Linguistics and CIS Cornell University based on joint research with Jonathan Howell. Radio sites. Hundreds use Everyzing/Ramp technology Full ASR transcripts often available Time offset sometimes available
E N D
Focus Contrast in Web Harvested Data Mats Rooth Linguistics and CIS Cornell University based on joint research with Jonathan Howell
Radio sites • Hundreds use Everyzing/Ramp technology • Full ASR transcripts often available • Time offset sometimes available • Either URL of audio or RSS feed almost always available • Not not enough hits for one target on a single site • A lot or repetitions of same audio • Seemingly less “spontaneous” speech than on Everyzing
Youtube • Searchable closed captions, some obtained with ASR and some provided by video author • Time offset available on hit page and in URL • Youtube player can seek to a time • Transcript of snippet available • Full transcript not available • Not enough data now • Can hope that a lot of indexed spontaneous speech will become available
Reuters Insider • Searchable audio based on Everyzing/Ramp • Full transcripts available • Player seeks to timestamp
Goals Assemble large, focused datasets of examples where intonation varies in a way that correlates with syntax, semantics, or pragmatics. Study correlation between lexical/grammatical/pragmatic context and acoustic realization.
he stayed longer than I did -er [[ he he stayed x long]2 than [ IF stayed x long ]~2] [ y stayed x-long ] antecedent clause [ speaker stayed x-long ] scope of focus
… I should have liked that song a lot more than I did. [more x[[should w[ I like that song x well in w]] than [I like that song x well in w0]]]
I understand even less than I did before even less [[ I prs understand x much]2 than [I understood x much beforeF] ]~2]
Alternative semantics for focus -er [[ he he stayed x long]2 than [ IF stayed x long ]~2] [ y stayed x-long ] antecedent clause [ speaker stayed x-long ] scope of focus Semantics of focus is the set of alternative propositions of the form ‘y stayed x long’. Licensing condition for focus The proposition contributed by the antecedent is an element of the alternative set that is distinct from the proposition contributed by the scope.
Givenness/Entailment semantics for focus [ y stayed x-long ] antecedent clause [ speaker stayed x-long ] scope of focus Licensing condition for focus The antecedent entails the union of the alternative set (focus existential closure). If he stayed d long, then someone stayed d long.
Alternative semantics and givenness semantics are predictive theories of focus licensing, if the antecedent is stipulated. Almost always, the antecedent for focus in the than-clause is the main clause. With that hedge, grammar makes a prediction about where focus should go. Try to correlate this with acoustic signal.
Focus in comparative clauses Coherent semantic theory about where focus should go Possibilities are constrained, because the main clause is usually the antecedent for focus interpretation in the comparative clause On a theoretical basis, we often think we know the correct grammatical analysis of comparative sentences people use, including the features that determine focus Nice model system for studying contextual conditioning and phonetic realization of contrastive intonation
Automatic harvest procedure Replicates how a user would interact with website.
curl retrieve information designated by URL cutmp3 cut audio file given offsets awk process html awk, bash make control Time for one run retrieving 1000 hits is less than a day.
Classification experiment He stayed longer than IF did. s class antecedent: He stayed x long I should have liked that song a lot more than I didF. ns class antecedent: I should have liked that song x much I understand even less than I did beforeF I understand even x little ns class
SVM classifier in R statistical environement (e1071 package) 308 acoustic parameters extracted with Praat 91 tokens in cross-validated design (Several hundred more tokens with similar results.)
all parameters • duration of “I” only • duration of “I”, duration of “d” closure, formant difference 40% into “I”
Method suggested by comparatives experiment Find common grammatical or lexical contexts that trigger representations with different prosodic realization, according to relatively well-understood and well-supported theory. Correlate the semantic-grammatical categories directly with the speech signal using machine learning. Don’t worry about phonemic/morphemic categories like the accent types H* and L+H*, or assume they be annotated on the basis of pitch contour.
Fery and Ishihara (2009) Journal of Linguistics 45.3 SOF: Prenuclear Die meisten unserer Kollegen waren beim Betriebsausflug lässig angezogen. Nur Peter hat eine Krawatte getragen. Nur Peter hat sogar einen Anzug getragen.
He’s gotta pick someone who is younger than he is, and is definitely more conservative than he is. [-er [ t is d young than he is d young]]2 and more [[ t is is dconservativeF]3 than [ heF is d conservative ] ~3 ] ~2
+Generic corpus of focused pronouns The SVM classifier is good at detecting focused pronouns using local features on pronoun: Duration of vowel “I” [ai] Distance between f1 and f2 halfway into vowel “i” [ai]
Method suggested by comparatives experiment Find common grammatical or lexical contexts that trigger representations with different prosodic realization, according to relatively well-understood and well-supported theory. Correlate the semantic-grammatical categories directly with the speech signal using machine learning. Don’t worry about phonemic/morphemic categories like the accent types H* and L+H*, or assume they be annotated on the basis of pitch contour.
Inherently contrastive phrases in MY opinion ... admits that other things might be true in other people’s opinions NEXT Friday ... at end weekly Friday radio program on the TENOR saxophone ... in Jazz program where there is frequent mention also of the Alto saxophone
140 of> my mind 139 on> my part 134 in> my lifetime 125 in> my office 115 of> my family 108 with> my wife 106 on> my face 106 in> my house 99 on> my mind 96 over> my head 96 in> my family 91 for> my family 90 in> my face 1162 of> my life 1110 in> my life 681 in> my mind 377 in> my opinion 276 in> my view 231 in> my heart 217 of> my career 199 in> my career 183 in> my head 146 with> my life 146 with> my family 141 on> my way
+ Does general SVM pronoun focus classifier work on SOF tokens? + How common is SOF?
[you made a very small amount more than I did]2 [nowF I make muchFmore than youF do] ~2 2 is of the form required form of antecedent: at t speaker makes d-much more than hearer makes actual: at t hearer makes d-much more than speaker makes
two SOF tokens You made a very small amount more than I did. Now I make muchF more than youF do.
There is a correlation between the string context and prosody type? + Learn information-theoretically -- two distributions of acoustic pronoun realizations -- two distributions of trigram contexts that condition them