1 / 18

语音信号处理 Speech Signal Processing 教材:数字语音处理 华中科技大学出版社 参考书:语音信号数字处理 电子工业出版社

语音信号处理 Speech Signal Processing 教材:数字语音处理 华中科技大学出版社 参考书:语音信号数字处理 电子工业出版社. From prehistory to the new media of the future, speech has been and will be a primary form of communication between humans.

Download Presentation

语音信号处理 Speech Signal Processing 教材:数字语音处理 华中科技大学出版社 参考书:语音信号数字处理 电子工业出版社

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.


Presentation Transcript

  1. 语音信号处理 Speech Signal Processing 教材:数字语音处理 华中科技大学出版社 参考书:语音信号数字处理 电子工业出版社

  2. From prehistory to the new media of the future, speech has been and will be a primary form of communication between humans. • Speech will become the next major trend in the personal computer market in the near future.

  3. 学习的目标: 保障现代通信领域中人与人之间、人与机器之间的信息交换的顺畅和自然。

  4. Representation of speech signals waveform parametric anti-aliasing filter preamplifier A/D conversion

  5. 语音信号产生的数字模型Speech Production Model Excitation Discrete System Speech 根据人类语音的发音的生理过程和语音信号的声学特征来建立。

  6. 语音信号的处理方Methods 语音信号是时变信号,只是在一个短时短内可近似看作是平稳信号。 ☆短时时域处理技术 ☆短时傅立叶分析 ☆语音信号的线性预测Linear Predication ☆语音信号的同态滤波和倒谱分析 ☆矢量量化技术Vector Quantification ☆隐含马尔可夫模型HMM(Hide Markov Model)

  7. 语音信号的应用Application • 语音编码技术 Speech Coding • 语音合成技术 Speech Synthesis • 语音识别技术 Speech Recognition • 语音增强技术 Speech Enhancement • 声源定位技术 Speaker Tracking/ Sound Source Localization

  8. 语音信号处理涉及语言学、声学、认知科学、生理学、心理学和数理统计等多学科知识。用数字化的方法进行语音的传送、存储、合成、识别和增强等技术是整个数字化通信网中最重要最基本的组成部分。语音信号处理涉及语言学、声学、认知科学、生理学、心理学和数理统计等多学科知识。用数字化的方法进行语音的传送、存储、合成、识别和增强等技术是整个数字化通信网中最重要最基本的组成部分。

  9. Speech Coding The goal is reduce the information rate, measured in bits per second, while maintaining the quality of the original speech waveform. PCM:64kb/s waveform coder LPC-10:2.4kb/s parametric coder GSM:13kb/s hybrid coder Digital Channel

  10. Speech Synthesis 欢迎使用微软中国研究院中文语音合成系统 采用TTS技术合成语音 http://research. Microsoft.com/speech/tts • 金山词霸/电子词典

  11. Speech Enhancement 从音乐中提取有用信号 the goal is to improve the quality of degraded speech.

  12. SONY推出的QRIO——家庭娱乐机器人 Speech Recognition IBM语音识别软件 Furby "白雪公主"

  13. 声卡的功能 • Sound card enables the computer to • Record sound from a microphone • Output sound through a speaker • Manipulate sound stored on a disk • Sound card contains • A/D and D/A converters • Sampling rate generator • Most sound cards use DMA to read and write audio data

  14. 常见的声音文件 Two types of audio file formats: • Raw format, no header • raw • Self-describing format, with a file header • Wav • Midi • Aiff, au, voc • Mpeg • Rm, ra, ram

  15. Audio file parameters • Sampling rate • Bits per sample • Channels

  16. 可在MATLAB中用wavread函数将*.wav的文件打开,将所有数据读入一个数组中,数组中每个数的范围在[-1,1]之间。可在MATLAB中用wavread函数将*.wav的文件打开,将所有数据读入一个数组中,数组中每个数的范围在[-1,1]之间。 例如:[y,fs,Nbits]=wavread(‘c:\tts_30250.wav’); 运行结果:fs=16kHz Nbits=16 y=[…..,0.0002,-0.0009,0.0081,…..]

  17. 常见的语音软件 Cool Edit Pro Adobe Audition

  18. THANKS

More Related