1 / 13

Pitch and Amplitude Perturbation (Jitter and Shimmer)

Pitch and Amplitude Perturbation (Jitter and Shimmer). Basic idea: Phonated speech is called quasiperiodic, with quasi being Latin for “sort of” or “more-or-less. Naturally spoken voiced speech is never perfectly periodic.

angelb
Download Presentation

Pitch and Amplitude Perturbation (Jitter and Shimmer)

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Pitch and Amplitude Perturbation (Jitter and Shimmer) Basic idea: Phonated speech is called quasiperiodic, with quasi being Latin for “sort of” or “more-or-less. Naturally spoken voiced speech is never perfectly periodic. Random cycle-to-cycle variability in f0(or, equivalently, t0 – the fundamental period) is called pitch perturbation or jitter. Random cycle-to-cycle variability in the amplitude of glottal pulses is called amplitude perturbation or shimmer.

  2. The jitter concept is quite simple. This signal might look perfectly periodic, but it isn’t. There are small, more-or-less random differences in the fund. period from one cycle to the next.

  3. How Jitter is Measured: Mean Jitter Mean Jitter = sum of (abs) period diffs / number of diffs = 0.2 + 0.1 + 0.0 / 3 = 0.3 / 3 = 0.1 ms In English: MeanJ = SumOfAbsDiffs / ndiffs

  4. Percent Jitter It is far more common to represent jitter as a percentage of the average period: Percent Jitter = MeanJ/MeanPeriod x 100 Our example from the previous slide: MeanJ = 0.1 ms; MeanPeriod = 8 ms Percent Jitter = 0.1/8 * 100 = 0.125/8.0 x 100 = 1.25%

  5. What We Know About Jitter • Jitter values in non-dysphonic voices are quite small – about 0.5%. • There is a good deal of research to show that jitter values in dysphonic voices are significantly larger. • Jitter is thought to be associated with a sensation of roughness in the voice. • For that reason, it has been proposed as an objective correlate of either roughness or overall dysphonia.

  6. Shimmer: More-or-less random cycle-to-cycle variation in voice amplitude (vocal intensity) rather than f0.

  7. What We Know About Shimmer • Shimmer values in non-dysphonic voices are quite small – usually less ~0.7 dB. • As with jitter, shimmer values in dysphonic voices are significantly larger. • Shimmer is thought by some to be associated with a sensation of roughness in the voice – but this is not settled. You’ll be able to decide this on your own in a few minutes.

  8. Synthetic Continuum Varying in Jitter 0.0% 2.0% 0.2% 2.5% 0.4% 3.0% 0.6% 4.0% 0.8% 5.0% 1.0% 6.0% 1.5%

  9. Shimmer calculation: There are calculations for shimmer that are analogous to the ones we saw earlier for jitter – the average absolute difference in amplitude between adjacent periods. (There is no percent shimmer, for reasons we won’t worry about.)

  10. Synthetic Continuum Varying in Shimmer 0.00 dB 1.60 dB 0.20 dB 1.80 dB 0.40 dB 2.00 dB 0.60 dB 2.25 dB 0.80 dB 2.50 dB 1.00 dB 2.75 dB 1.20 dB 3.00 dB 1.40 dB

  11. Jitter Versus Shimmer Do pitch perturbation and amplitude perturbation produce the same kinds of sound qualities? Jitter (6%) Shimmer (3 dB) Judge for yourself, but these sound quite different to my ear. Jitter continuum: Clear to very rough, Shimmer continuum: Clear to unnaturally crackly, somewhat like static.

  12. One More Problem: Measurement Accuracy Jitter and shimmer measurements require that the starting and ending times of every pitch pulse be measured with a lot of precision. This is waaaay too time consuming to do by hand, so a computer algorithm is needed. Measuring the t0 is not too hard for highly periodic voices. But what voices are the most interesting? Highly periodic voices or dysphonic voices that show imperfect periodicity?

  13. This is the problem: The voices that are most interesting are exactly the ones that are the hardest to measure. Also, the quantities being measured (e.g., jitter) are quite small – only a few percent even in dysphonic voices – so even a few errors can make a big difference. There are commercial systems available for measuring perturbation. They’re well designed, but they will sometimes make mistakes. This doesn’t mean that you shouldn’t use them, but when the computer gives you a measurement that doesn’t agree with what your ear tells you, your ear could easily be right.

More Related