530 likes | 600 Views
Short-Term Reorganization of Auditory Analysis Induced by Phonetic Experience Liebenthal et al. (2003). JoCN. Audrey Kittredge 593: Neuroimaging of Language. MRI: physics. Hydrogen nuclei act as magnets (spinning, charged particle). MRI: physics.
E N D
Short-Term Reorganization of Auditory Analysis Induced by Phonetic ExperienceLiebenthal et al. (2003). JoCN. Audrey Kittredge 593: Neuroimaging of Language
MRI: physics • Hydrogen nuclei act as magnets (spinning, charged particle)
MRI: physics • In strong magnetic field: spin-axes form vector parallel to field
MRI: procedure • Radio Frequency pulse • Changes direction and strength of vector • Eventually, nuclei relax and vector returns to original position • As nuclei relax, give out pulse • Pulse type depends on water/fat ratio of tissue --> MRI images!
Functional MRI • Hemoglobin shows up better than deoxyhemoglobin on MRI SO • Brain areas with more oxygenated blood will show up better (BOLD)
Connection to neural activity? • Increase in net neural activity --> increase in oxygenated blood supply (slow) • Quick succession of images: BOLD signal at various times
Pros • Good spatial resolution • Less risky, faster acquisition than PET • Event-related design
Cons • Poor temporal resolution • BOLD signal degraded near air/bone boundary • Movement artifacts • High speed data acquisition = noisy!
Phonetic perception • How does this occur? • Automatic phonetic analysis module (Liberman & Mattingly, 1989) • Stimulus-independent auditory analysis (Kluender & Greenberg, 1989)
Past Research • PET, fMRI studies • Speech vs nonspeech: superior temporal cortex
Problem! • Confound: perception or stimuli? • Goal: study perception mode independent of stimulus properties • How do we do this?…
…Sinewave speech! • Sinewave example
Original sentence • “The steady drip is worse than a drenching rain”
Sinewave speech: properties • Sinusoid fit to center frequency and amplitude (over time) of F1-F3 or F4 • Result: rapidly changing pure tones • Lack fine-grained acoustic properties of speech
Past studies on sinewave speech • Remez et al. (1981): • “Describe”: most say non-speech • “Transcribe”: most write all/some of sentence correctly
Tone-matching Task(Remez et al., 2001) • Stimuli • Sinewave word e.g. juice • Isolated T2 from T123/4 complex • Task: is tone constituent of complex? • Listeners can do this… • When uninformed (not speech) • While matching tone complex to printed word • Difficult task!
Creation of stimuli • Phonetic stimulus (sinewave word) • 3 lowest formants = 1 sinewave each • Tone probe • “True”: from word • “False”: from other sinewave word • Nonphonetic stimulus • T1 and T3 temporally reversed
Pilot studies • Phonetic transcribed 52.1% accuracy, multiple choice 89.5% accuracy • Rated as “Clearly identifiable word”: • 61% phonetic • 22% nonphonetic • “Nonspeech”: • 58% nonphonetic • 20% phonetic
Stimuli: summary • 288 stimuli total • 108 pairs of phonetic, nonphonetic stimuli • 1/3 repeated • 1/2 trials = false
Experimental Design Practice Naïve 1 Naïve 2 Phonetic Practice Informed 1 Informed 2
Procedure • Practice • Stimuli: arbitrarily composed sinusoids • Sinewaves: same/diff pitch contour? • Tone-matching task (T2-T1234) • Naïve condition • “single tone”, “tone complex” • 2 blocks
Procedure • Phonetic practice • Sinewave stimuli: 8 sentences, 18 words • Chose from 4 transcriptions • Feedback given for every 5th sentence • Accuracy data collected • Informed condition • “words” • 2 blocks
Results: RT • Phonetic: • Test Block p < .o4 (N1-N2 p < .02, N2-I1 p < .03, I1-I2 p < .05) • Nonphonetic • Test Block p < .001 (N1-N2 p < .01) • In naïve condition, effect of stimulus type p < .04
Results: Accuracy • Phonetic: • No significant effect of Test Block p < .11 • Nonphonetic • No significant effect of Test Block p < .53 • In naïve condition, no effect of stimulus type p < .07
Results: Phonetic Form Practice • Sentence task: 84 +/- 21% accuracy • Words: 60 +/- 16% accuracy • Chance = 25% in both tasks
Results: Subjective Reports • 29/31 unaware of phonetic quality during naïve blocks • 13/31 recognized words during informed blocks
Conclusions: Behavior • Phonetic awareness interferes with task • Naïve: subjects perceived only auditory form • Informed: subjects perceived both, focused on auditory • NO explanation for stimulus RT difference in Naïve
Within each block… 9s 9s 9s 9s 9s 9s 9s 9s 2 phonetic trials 2 nonphonetic trials Baseline (silence) Clustered image acquisition
Image acquisition • 18 images per trial type per block • 36 images per condition/trial type • E.g. Naïve, phonetic
fMRI Images • 16 slices: • Axially oriented (horizontal) • Contiguous • 3x3x4mm voxels • Slice coverage: • Most of temporal lobes • Part of frontal and parietal lobes • Occipital lobe • Anatomical (MRI) images (1x1x1mm)
fMRI analysis: individuals • AFNI software package • Trial - Baseline-->BOLD difference maps • Difference maps: • averaged (BOLD vs baseline) • Voxel-wise ANOVA (sorted by trial type and condition)
fMRI analysis: averaging • Individual statistical maps transformed into standard space • Talairach brain • Complicated statistics, smoothing… • t values at each voxel averaged across subjects
fMRI analysis: significance testing • Randomization testing: • t values >/= .37 significant • uncorrected voxel-wise p < .001 • Activation foci < 300 microL removed
Phonetic: Informed-Naive • Left Heschl’s gyrus (HG/BA42) • Left posterior superior temporal gyrus (STG/BA 42/22) • Right HG/BA42
Phonetic Experience • Decreased activation = decreased task execution • Underlies reduced performance • Interference masks information like noise • STG • Primate HG/post STG analogues involved in complex sound analysis, auditory STM • Left-lateralized • Specialization for speech
Phonetic Experience cont’d • No shift to other areas • No conscious phonetic perception • Phonetic experience induces “short-term functional reorganization of auditory analysis” and is contingent on “dynamic structure”
Phonetic: Informed-Naive • Dorsomedial thalamic nucleus • Superior frontal gyrus (BA8) • Left middle frontal gyrus (MFG/BA10)
Unexplained Results • Dorsomedial thalamic nucleus, medial prefrontal cortex: • Areas with reciprocal connections to each other and ST area • Connected neural system… • Engaged in task • Sensitive to interference
Nonphonetic: Informed-Naive • Left posterior STG (BA 42/22)
Phonetic: Blocks2-Blocks1 • Left middle frontal gyrus (BA9)
Nonphonetic: Blocks1-Blocks2 • Left inferior frontal gyrus (IFG/BA44)
Proficiency Effects • Left IFG, MFG: • Initial difficulty in verbal production task (Raichle et al., 1994) • Not cause of Informed-Naïve difference (no anatomical overlap)
Conclusions…? • “Centrality” of this function • Naïve: Phonetic vs nonphonetic RT • Reorganization contingent on speech? • Decreased activation: underlies reduced performance? • Proficiency/Informed: frontal overlap?
Methodology…? • Response/accuracy inclusion criteria? • RT/accuracy data not parallel • RT: correct, incorrect, true, false trials • Word length? • Age variation (18-57)? • Naïve: phonetic vs nonphonetic? (fMRI)
Some questions… • Role of thalamus/medial frontal areas? • Task difficulty --/--> activation increase
Some more questions… • Given phonetic practice, is reorganization entirely stimulus-driven? • How generalizable to normal speech-nonspeech analysis? • Original question: automatic phonetic module or auditory analysis?