680 likes | 975 Views
Frequency Domain Coding of Speech. 主講人:虞台文. Content. Introduction The Short-Time Fourier Transform The Short-Time Discrete Fourier Transform Wide-Band Analysis/Synthesis Sub-Band Coding. Frequency Domain Coding of Speech. Introduction. Speech Coders. Waveform Coders
E N D
Content • Introduction • The Short-Time Fourier Transform • The Short-Time Discrete Fourier Transform • Wide-Band Analysis/Synthesis • Sub-Band Coding
Frequency Domain Coding of Speech Introduction
Speech Coders • Waveform Coders • Attempt to reproducing the original waveform according to some fidelity criteria • Performance: successful at producing good quality, robust speech. • Vocoders • Correlated with speech production model. • Performance: more fragile and more model dependent. • Lower bit rate
Frequency-Domain Coders • Sub-band coder (SCB). • Adaptive Transform Coding (ATC). • Multi-band Excited Vocoder (MBEV). • Noise Shaping in Speech Coders.
Frequency Domain Coding of Speech The Short-Time Fourier Transform
Definition of STFT • Filter Bank Interpretation • Block Transform Interpretation Interpretations:
Filter Bank Interpretation f (m) Analysis Filter is fixed at 0.
h(n) x(n) h(n) . . . h(n) h(n) Filter Bank Interpretation
0 Filter Bank Interpretation Modulation
0 Filter Bank Interpretation Modulation Lowpass Filter
h(n) x(n) h(n) . . . h(n) h(n) Filter Bank Interpretation Modulated Subband signals
Block Transform Interpretation n is fixed at n0. Analysis Window FT of Windowed Data Windowed Data
n1 n2 n3 nr Block Transform Interpretation n is fixed at n0. . . .
In what condition we will have Analysis Analysis/Synthesis Equations Synthesis
Analysis Analysis/Synthesis Equations Synthesis Replace r with n+r
Therefore, if Analysis Analysis/Synthesis Equations Synthesis
Therefore, if Analysis Analysis/Synthesis Equations Synthesis More general,
Examples h(0)x(n)
Frequency Domain Coding of Speech The Short-Time Discrete Fourier Transform
In what condition we will have Definition of STDFT Analysis: Synthesis:
Synthesis We need only one period. Therefore, the condition is respecified as:
Frequency k n 0 Spectrogram Implementation Consideration
Frequency k n 0 Spectrogram Sampling R 2R 3R 4R
In what condition we will have Sampled STDFT Analysis: Synthesis:
In what condition we will have Sampled STDFT Analysis: Synthesis:
Frequency Domain Coding of Speech Wide-Band Analysis/Synthesis
h(n) x(n) Short-Time Synthesis --- Filter Bank Summation STFT Lowpass Filter
|H(ej)| |Hk(ej)| k Short-Time Synthesis --- Filter Bank Summation Lowpass filter Bandpass filter
h(n) x(n) Lowpass Filter hk(n) x(n) Bandpass Filter Short-Time Synthesis --- Filter Bank Summation Lowpass representation of for the signal in a band centered at k.
h(n) x(n) Lowpass Filter hk(n) x(n) Bandpass Filter Short-Time Synthesis --- Filter Bank Summation Encoding one band Decoding one band
h0(n) h1(n) x(n) hN1(n) Short-Time Synthesis --- Filter Bank Summation . . . Analysis Synthesis
h0(n) h1(n) x(n) hN1(n) Short-Time Synthesis --- Filter Bank Summation . . . Analysis Synthesis
h0(n) x(n) hN1(n) Short-Time Synthesis --- Filter Bank Summation h1(n) . . . Analysis Synthesis
1 0 1 2 3 4 5 2 Equal Spaced Ideal Filters N = 6
h0(n) h1(n) . . . x(n) hN1(n) Equal Spaced Ideal Filters What condition should be satisfied so that y(n)=x(n)?
Equal Spaced Ideal Filters Time-Aliased version of h(n) Equal spaced sampling of H(ej) Inverse discrete FT of H(ej)
h(n) n L1 0 Equal Spaced Ideal Filters In case that N L, Consider FIR, i.e., h(n) is of duration of L samples.
h0(n) h1(n) . . . x(n) h(n) n L1 0 hN1(n) Equal Spaced Ideal Filters x(n) can always be Reconstructed if N L,
h0(n) h1(n) . . . x(n) h(n) n L1 0 hN1(n) Does x(n) can still be reconstructed if N<L? Equal Spaced Ideal Filters If affirmative, what condition should be satisfied? x(n) can always be Reconstructed if N L,
h0(n) h1(n) . . . x(n) hN1(n) Equal Spaced Ideal Filters p(n)
Equal Spaced Ideal Filters Signal can be reconstructed If it equals to (nm). p(n)
p(n) N 2N N 0 N 2N 3N 4N h(n) 1/N 2N N 0 N 2N 3N 4N Typical Sequences of h(n) Ideal lowpass filter with cutoff at /N.
p(n) N 2N N 0 N 2N 3N 4N h(n) h(0) 2N N 0 N 2N 3N 4N 2L L L 2L 3L 4L Typical Sequences of h(n) N L