Quality of Service (QoS) Adaptation for Improved Speech Perception in Wireless Networks

Experiments on QoS Adaptation for Improving End User Speech Perception Over Multi-Hop Wireless Networks Tsuwei Chen, Mario Gerla, Manthos Kazantzidis, Yuri Romanenko, Ilya Slain http://www.cs.ucla.edu/NRL/wireless June 1999 {tsuwei,gerla,kazantz,yuri,ilyas}@cs.ucla.edu Mini Quality Of Service Conference during ICC 99

Motivation • Wireless networks characteristics: • high packet loss • highly varied packet delivery time • constantly changing QoS • Audio delivery applications characteristics: • Sensitive to packet loss • ~ 20% pkt loss  sharp perceptual quality degradation • Sensitive to delay jitter • Existing audio/video delivery Internet applications • e.g. VIC, VAT • Few are adaptive to changing network conditions • Not suitable for wireless environment

lr > TUpper lr > TUpper lr > TUpper lr > TUpper lr  TLower lr  TLower lr  TLower lr  TLower Adaptation for Audio Sources Caption only 8kHz 11kHz 22kHz 44kHz lr:Packet loss rate, TUpper, TLower:Upper and lower packet loss thresholds Goals • Keep connection as long as possible • reduce call setup overhead • Reduce sound quality to keep information quality • Improve sound quality only when information quality is met

..... server client Adaptive Speech Strategies internal interference channel fading congestion mobility Options • End-to-End vs network QoS feedback • Source vs network rate/stream adaptation security breach • Server Adjustable Parameters: • sampling rate • source encoding • packet size • encryption • End-to-End ARQ • Speech-to-Text • Client QoS monitoring: • packet loss • delay jitter • pkt noise external interference (jamming, environment noise) • Network QoS • Feedback: • congestion indication • Bdw advertising • S/N advertising • Network Adaptation: • layer thinning • link ARQ • channel encoding

Speech Recognition TTS Sync Multihop Testbed server client Increase in jitter network congested sampling rate is reduced packet size is reduced Increase in Packet loss channel noise/interference UCLA Adaptive Speech Experiment Audio source adapts to QoS feedback Wireless Network • QoS Monitoring: • - packet loss • - jitter • Adjustable Parameters • - sampling rate • - packet size Audio(UDP) Piggybacked Text Stream(UDP) Control(TCP) A d a p t a t I o n S t r a t e g y :

Server Player Text to Speech Client Player Caption File Speech Recognition TTS Sync SessionController SessionController TransportController TransportController QoS API QoS API Network Monitor Adapting To Very High Loss: Speech-To-Text and Text-To-Speech Audio File QoS feedback data Wireless Networks QoS feedback

Caption Embedding Scheme 0 sec 2 sec Voice file (.wav) Wireless packets #0 #1 #2 #3 #4 …….. Question: When you are driving @ at fifty-five miles per hour … Caption file (.cpt) 0 sec 2 sec For reliable caption delivery, each 2-second segment of text is replicated in several packets.

TestBed Virtual Interfaces 131.179.25.26 131.179.26.26 131.179.27.26 Intermediate node Linux Multihop Routing Server NT/95 Client NT/95 26 subnet 25 subnet 131.179.26.25 131.179.25.175 27 subnet 131.179.27.174 Additional Traffic Linux • Packets are forwarded using routing tables • In order to force multihop in confined environment, nodes are partitioned into subnets • WaveLan I, CSMA/CA, 915Mhz, 2Mbps, 700ft (200m) open, 100ft (30m) closed

Experiment Parameters • Speech Encoding • PCM (mono, 8-bit samples, 22Khz) • server on-demand extracts lower bit/sampling rates of 11Khz, 8Khz. • RTP-type packets • 240, 480 and 960 bytes. • Jitter measurement • Rate Adaptation Steps • Single Action For every QoS Feedback • IF loss > MAX_LOSS_THRESHOLD DOWNGRADE rate, payload • IF jitter > MAX_JITTER_THRESHOLD DOWNGRADE rate • IF loss < MIN_LOSS_THRESHOLD UPGRADE rate, payload • Speech Recognition • Microsoft Speech Engine • Limited Vocabulary of 1000 words • Voice and Synthesized Speech Synchronization • On time-window (2-sec) granularity • Speed (wpm) adaptation inside window

Emulation Experiment Sampling Rate of Audio Transmission follows Available Bandwidth. Available Bandwidth of Emulated Channel

No Adaptation • The audio stream is not responsive to the loss rates observed. • Significant quality degradation perceived with increased loss rate

Sampling Rate Adaptation • The rate used for the audio coding is reduced when loss rates are high. • The sampling rate directly follows loss rate trends resulting in better perception quality

Packet Size Adaptation • Packet Size adjustment is very effective in reducing loss rates • Packet Size adjustment does not have any audible side effects

Adaptation on Both Packet Size and Sampling Rate • Adaptation on both packet size and sampling rate results in the lowest loss rates (best perception quality)

Adapting To Very High Loss using SR/TTS • At very high loss rates, beyond a threshold, Speech Recognition is applied at the server and Text-To-Speech synthesis at the client side. • The speech is recognized at packet loss rates up to 99%

Conclusions • Perceived speech quality is substantially enhanced by payload size and rate adaptation • Recognizable speech is delivered in extreme conditions by Text-To-Speech translation Work in progress • Real/Synthetic Speech synchronization • explicit relative byte-to-word mapping • Production of Speech CODEC traces • CELP, SBC, A/u-Law, DSP TSpeech, GSM 6.10, G723, MP3 etc. • Large scale network emulation experiments • using GlomoSim platform • Adaptive Video-On-Demand • Network Feedback (QoS routing) • Multicast Stream Adaptation

Cellular vs Multihop Standard Base-Station Cellular Networks Instant Infrastructure, Multihop wireless Networks

Ad-Hoc Network Characteristics • Instantly deployable • Reconfigurable infrastructure • Node mobility • Heterogeneous nodes • big/small • fast/slow • Heterogeneous traffic • voice, image, video, data • Limited battery power • Multihopping • to save power • overcome obstacles • enhance spatial spectrum reuse

Ad-Hoc Network Applications • Disaster Recovery • flood, fire, earthquakes etc • Law enforcement • crowd control, border patrol etc • Search and rescue in remote areas • Sport events, festivals • Ad hoc nomadic, collaborative computing • Indoor network appliances • Sensor networks • Battlefield

Quality of Service (QoS) Adaptation for Improved Speech Perception in Wireless Networks

Quality of Service (QoS) Adaptation for Improved Speech Perception in Wireless Networks

Presentation Transcript

Routing in Wired Nets CS 215 W 01 - Mario Gerla

Ilya Zaliapin

Top Ilya

Mario Gerla Computer Science Dept, UCLA cs.ucla/NRL

Mario Gerla Computer Science Dept UCLA

Bluetooth Performance Mario Gerla CS Dept, UCLA gerla@cs.ucla

Pengkai Zhao, You Lu, Babak Daneshrad, Mario Gerla

MAC Reliable Broadcast in Ad Hoc Networks Ken Tang, Mario Gerla

Chen Avin Ilya Shpitser Judea Pearl Computer Science Department UCLA

Mario Gerla, Aiguo Fei, Jun-Hong Cui, Michalis Faloutsos

Daniela Maniezzo, UCLA Gianluca Villa, Politecnico di Milano Mario Gerla, UCLA

Mario Gerla cs.ucla/NRL

Mario Gerla Computer Science Dept UCLA

Kevin C. Lee, Uichin Lee, Mario Gerla

Ilya Zaliapin

Ilya Zaliapin

MAC Reliable Broadcast in Ad Hoc Networks Ken Tang, Mario Gerla

Ilya Silin