An End-to-end Architecture for Quality-Adaptive Streaming Applications in Best-effort Networks

An End-to-end Architecture for Quality-Adaptive Streaming Applications in Best-effort Networks Reza Rejaie reza@isi.edu USC/ISI http://netweb.usc.edu/reza April 13, 1999

Motivation • Rapid growth in deployment of realtime streams(audio/video) over the Internet • TCP is inappropriate for realtime streams • The Internet requires end-system to react to congestion properly and promptly • Streaming applications require sustained consumption rate to deliver acceptable and stable quality

Best-effort Networks (The Internet) • Shared environment • Bandwidth is not known a prior • Bandwidth changes during a session • Seemingly-random losses • TCP-based traffic dominates • End-to-end congestion control is crucial for stability, fairness & high utilization • End-to-end congestion control in a TCP-friendly fashion is the main requirement in the Internet

Streaming Applications • Delay-sensitive • Semi-reliable • Rate-based • Require QoS from the end-to-end point of view Encoder Adaptation Source Server TCP TCP Internet Buffer Decoder Display

The Problem • Designing an end-to-end congestion control mechanism • Delivering acceptable and stable quality while performing congestion control

Outline • The End-to-end Architecture • Congestion Control (The RAP protocol) • Quality Adaptation • Extending the Architecture • Multimedia Proxy Caching • Contributions • Future Directions

The End-to-end Architecture Error Control Quality Adaptation Cong. Control Acker Playback Buffer Internet Buffer Manager Buffer Manager Transmission Buffer Decoder Archive Adaptation Buffer Server Client Data path Control path

Outline • The End-to-end Architecture • Congestion Control (The RAP Protocol) • Quality Adaptation • Extending the Architecture • Multimedia Proxy Caching • Contributions • Future Directions

Previous works on Congestion Ctrl. • Modified TCP • [Jacob et al. 97], SCP[Cen et al. 98] • TCP equation • [Mathis et al. 97], [Padhye et al. 98] • Additive Inc., Multiplicative Dec. • LDA[Sisalem et al. 98] • NETBLT[Lixia!] • Challenge: TCP is a moving target

Overview of RAP • Decision Function • Increase/Decrease Algorithm • Decision Frequency • Goal: to be TCP-friendly Decision Function Rate Increase/Decrease Algorithm -- + Time Decision Frequency

Congestion Control Mechanism • Adjust the rate once per round-trip-time (RTT) • Increase the rate periodically if no congestion • Decrease the rate when congestion occurs • Packet loss signals congestion • Cluster Loss • Grouping losses per congestion event

Rate Adaptation Algorithm • Coarse-grain rate adaptation • Additive Increase, Multiplicative Decrease (AIMD) • Extensive simulations revealed: • TCP’s behavior substantially varies with network conditions, e.g. retransmission timeout, bursty • TCP is responsive to a transient congestion • AIMD only emulates window adjustment in TCP

Rate Adaptation Algorithm(cont’d) • Fine-grain rate adaptation • The ratio of short-term to long-term average RTT • Emulates ACK-clocking in TCP • Increase responsiveness to transient congestion

Coarse vs fine grain RAP fig Impact of fine-grain rate adaptation

RAP Simulation TCP Traffic • RAP against Tahoe, Reno, NewReno & SACK • Inter-dependency among parameters • Config. parameters: • Bandwidth per flow • RTT • Number of flows TCP Sinks TCP Sources SW SW RAP Sinks RAP Sources Avg. RAP BW Fairness Ratio = Avg. TCP BW RAP Traffic

Fairness ratio across the parameter space without F.G. adaptation

Fairness ratio across the parameter space with F.G. adaptation

Impact of RED switches on Fairness ratio

Summary of RAP Simulations • RAP achieves TCP-friendliness over a wide range • Fine grain rate adaptation extends inter-protocol fairness to a wider range • Occasional unfairness against TCP traffic is mainly due to divergence of TCP congestion control from AIMD • Pronounced more clearly for Reno and Tahoe • The bigger TCP’s congestion window, the closer its behavior to AIMD • RED gateways can improve inter-protocol sharing • Depending on how well RED is configured • RAP is a TCP-friendly congestion controlled UDP

Quality Adaptation Error Control Quality Adaptation Cong. Control Acker Playback Buffer Internet Buffer Manager Buffer Manager Transmission Buffer Decoder Archive Adaptation Buffer Server Client Data path Control path

Delivering acceptable and stable quality while performing congestion control Seemingly random losses result in random & potentially wide variations in bandwidth Streaming applications are rate-based The Problem

Role of Quality Adaptation • Buffering only absorb short-term variations • Long-lived session could result in buffer overflow or underflow • Quality Adaptation is complementary for buffering • Adjust the quality with long-term variations in bandwidth BW(t) Time

Mechanisms to Adjust Quality • Adaptive encoding [Ortega 95, Tan 98] • CPU-intensive • Switching between multiple encoding • High storage requirement • Layered encoding[McCanne 96, Lee 98] • Inter-layer decoding dependency • When/How much to adjust the quality?

Assumptions & Goals • Assumptions • AIMD variations in bandwidth(rate) • Linear layered encoding • Constraint • Obeying congestion controlled rate limit • Goal • To control the level of smoothing

Layered Quality Adaptation bw (t) 2 C buf 2 bw (t) bw (t) 2 Layer 2 1 BW(t) BW(t) C + buf 1 Internet bw (t) bw (t) 1 Layer 1 0 C Display buf 0 bw (t) 0 Layer 0 Decoder Filling Phase Quality Adaptation Draining Phase BW(t) Linear layered stream a c C BW(t) Consumption rate C b C Time(msec) Time(sec)

Each buffering layer can only contribute at most C(bps) Buffering for more layers provides higher stability C C C C Buffering Tradeoff bw (t) 2 C buf 2 bw (t) 1 BW(t) C buf 1 bw (t) 0 C buf 0 • Buffered data for a dropped layer is useless for recovery • Buffering for lower layers is more efficient BW(t) • What is the optimal buffer distribution for a single back-off scenario? nC Time

Optimal Inter-layer Buffer Allocation Filling Phase Draining Phase BW(t) • Optimal buffer state depends on time of the back-off • Draining pattern depends on the buffer state • Back-off occurs randomly • Keep the buffer state as close to the optimal as possible during the filling phase C C 4C Time Buf. data & BW share of L0 BW share of L1 Buf. data & BW share of L2 Buf. data &

Draining Phase Draining Phase (n-1)C nC Adding & Dropping BW(t) • Add a layer when buffering is sufficient for a single back-off • Drop a layer when buffering is insufficient for recovery • Random losses could result in frequent add and drop • unstable quality • Conservative adding results in smooth changes in quality Time Buf. data for L0 Buf. data for L1 Buf. data for L2

Smoothing • Conservative adding • When average bandwidth is sufficient • When sufficient buffering for K back-offs • Buffer constraint is preferred and sufficient • Directly relate time of adding to the buffer state • Effectively utilizes the available bandwidth • K is a smoothing factor • Short-term quality vs long-term smoothing

Smooth Filling & Draining Proper Buf. State recovery from 1 backoff Proper Buf. State recovery from 2 backoffs Proper Buf. State recovery from K backoffs Add a Layer Drop a Layer Filling Draining

Effect of smoothing factor (K = 2) KB/s TX rate & Quality C = 10 40 Time(sec) Buf. L3(KB) 9.5 9.5 Buf. L2(KB) 9.5 Buf. L1(KB) 9.5 Buf. L0(KB) 40 Time(sec) (K = 4) KB/s TX rate & Quality C = 10 40 Time(sec) Buf. L3(KB) 17.5 Buf. L2(KB) 17.5 17.5 Buf. L1(KB) 17.5 Buf. L0(KB) 40 Time(sec)

Adapting to network load (K = 4) KB/s TX rate & Quality C = 10 30 60 90 Time(sec) KB 17.5 Buf. L3(KB) 17.5 Buf. L3(KB) 17.5 Buf. L3(KB) 17.5 Buf. L3(KB) 30 60 90 Time(sec)

No of Dropped Layers

Summary of the QA results • Quality adaptation mechanism can efficiently control the quality • Smoothing factor allows the server to trade short-term improvement with long-term smoothing • Buffer requirement is low • Deploying for live but non-interactive sessions!

L 4 L 3 L 2 L 1 L 0 Limitation of the E2E Approach • Delivered quality is limited to the average bandwidth between the server and client • Solutions: • Mirror servers • Multimedia proxy caching Client Client Client Internet Server Quality(layer) Time

Multimedia Proxy Caching • Assumptions • Proxy can perform: • End-to-end congestion ctrl • Quality Adaptation • Goals • Improve delivered quality • Low-latency VCR-functions • Natural benefits of caching Client Client Client Proxy Internet Server

Played back stream Played back stream Stored stream Challenge • Cached streams have variable quality • Layered organization provides opportunity for adjusting the quality L 4 Quality (layer) L 3 L 2 L 1 L 0 Time

Issues • Delivery procedure • Relaying on a cache miss • Pre-fetching on a cache hit • Replacement algorithm • Determining popularity • Replacement pattern

Stream is located at the original server Playback from the server through the proxy Proxy intercepts and caches the stream No benefit in a miss scenario Cache Miss Scenario Client Client Client Proxy Internet Server

Playback from the proxy cache Lower latency May have better quality! Available bandwidth allows: Lower quality playback Higher quality playback Cache Hit Scenario Client Client Client Proxy Internet Server

Missing pieces of the active layers are pre-fetched on-demand Required pieces are identified by QA Results in smoothing Pre-fetched data Played back stream Stored stream Lower quality playback L 4 L Quality (no. active layers) 3 L 2 L 1 L 0 Time

Pre-fetch higher layers on-demand Pre-fetched data is always cached Must pre-fetch a missing piece before its playback time Tradeoff Pre-fetched data Played back Stream Stored stream Higher quality playback L 4 L Quality (no. active layers) 3 L 2 L 1 L 0 Time

Replacement Algorithm • Goal: converge the cache state to optimal • Average quality of a cached stream depends on • popularity • average bandwidth between proxy and recent interested clients • Variation in quality inversely depends on • popularity Client Client Client Proxy Internet Server

Popularity • Number of hits during an interval • User’s level of interest (including VCR-functions) • Potential value of a layer for quality adaptation • Calculate whit on a per-layer basis • Layered encoding guarantees monotonically decrease in popularity of layers whit = PlaybackTime(sec)/StreamLength(sec)

Replacement Pattern • Multi-valued replacement decision for multimedia object • Coarse-grain flushing • on a per-layer basis • Fine-grain flushing • on a per-segment basis Cached segment Fine-grain Quality(Layer) Coarse-grain Time

Summary of Multimedia Caching • Exploited characteristics of multimedia objs • Proxy caching mechanism for multimedia streams • Pre-fetching • Replacement algorithm • Adaptively converges state of the cache to the optimal

Contributions • End-to-end architecture for delivery of quality-adaptive multimedia streams • RAP, a TCP-friendly cong. ctrl mechanism over a wide range of network conditions • Quality adaptation mechanism that adjusts the delivered quality with a desired degree of smoothing • Proxy caching mechanism for multimedia streams to effectively improve the delivered quality of popular streams

Future Directions • End-to-end Congestion Control • RAP’s behavior in the presence web-like traffic • Emulating timer-driven regime TCP • Bi-directional RAP connections, Reverse ns forward path congestion control • Experiments over CAIRN & the Internet • Integration of RAP and congestion manager • Adopting RAP into class-based QoS • Using RAP for multicast congestion control • Congestion control over wireless networks

An End-to-end Architecture for Quality-Adaptive Streaming Applications in Best-effort Networks

An End-to-end Architecture for Quality-Adaptive Streaming Applications in Best-effort Networks

Presentation Transcript

Multimedia Proxy Caching Mechanism for Quality Adaptive Streaming Applications in

Computer Networks Chapter 5: End-to-End Protocols

End-to-End Delay Analysis for Fixed Priority Scheduling in WirelessHART Networks

Efficient Resource Management for End-to-End QoS Guarantees in DiffServ Networks*

CCSDS Overview End-to-End Architecture

Disconnected, Non end-to-end networks

An End-to-End Service Architecture

End-to-end Anomalous Event Detection in Production Networks

End-to-End Quality in IP networks: Can we offer and charge it?

End-to-End Bandwidth Allocation and Reservation for Grid Applications

Simulation Model for End-to-end QoS across Heterogeneous Networks

End to end OBS Applications

End-to-end Performance over Research Networks

Adaptive End-to-End QoS Guarantees in IP Networks using an Active Network Approach

End to End Quality of Service

Achieving End-to-End Fairness in Wireless Networks

End-to-end slicing in all-optical networks

Towards an end-to-end architecture for handling sensitive data

End-to-End Quality of service in ITU-T

End-to-end Anomalous Event Detection in Production Networks

Disconnected, Non end-to-end networks

End-to-end Performance over Research Networks