440 likes | 480 Views
Video Gross Error Detection. Royce Fernald, Phillip Corriveau, Audrey Younkin – Intel Corporation. Abstract. Provide an overview of the Gross Error Detection (GED) methodology for measuring the video playback experience
E N D
Video Gross Error Detection Royce Fernald, Phillip Corriveau, Audrey Younkin – Intel Corporation
Abstract • Provide an overview of the Gross Error Detection (GED) methodology for measuring the video playback experience • Present initial data on the correlation between objective GED metrics and subjective user opinions • Discuss goals for the GED project and outline current industry standardization efforts
Agenda • Video Gross Error Detection Overview • Gross Error Detection Methodology • Frame Identifier Sampling • Sequence Markers • Temporal Alignment • Intel Video GED Application • GED / Subjective User Experience Correlation • Subjective Assessment Methodology • Results and Analysis • Current Standardization Efforts
Gross Error Detection Overview • Designed to assess the video playback experience • Provides objective data on dropped, repeated or out-of-sequence frames • Measures frame rate stability • Suitable for video streaming assessment over unreliable media such as wireless networks • Complements full reference methodologies such as VQM
GED Methodology • Instrument video clips with a deterministic sequence of color blocks • Play video through system under test and capture the results • Measure missing or repeated frames by examining capture files for the expected frame identifiers
Platform Independence • The GED methodology provides a way to rate the user video experience of any platform or device • Since GED operates directly on capture files, it can characterize the user experience impact of various display technologies, network transports, operating systems, streaming applications, media players and compression formats
GED Frame Identifiers Frame 1 Frame 2 Frame 3 Original Sequence GED Encoded
Detection of Repeated Frames Frame 1 Frame 2 Frame 3 Original Sequence GED Encoded (Green Marker is duplicated)
Detection of Dropped Frames Frame 1 Frame 2 Frame 3 Original Sequence GED Encoded (Green Marker is missing)
GED Methodology – Workflow 0. Source Material 1. Marked Source Material GED Encode Video Encoder 2. Compressed and Marked 3. Capture Results 4. GED Scoring System Under Test GED Decode
GED Scoring • GED provides an overall “Gross Error” metric, which is the sum of the dropped, repeated and out-of-sequence frames in the test clip • GED scores correlate strongly with subjective opinions of video playback smoothness (user experience)
Agenda • Video Gross Error Detection Overview • Gross Error Detection Methodology • Frame Identifier Sampling • Sequence Markers • Temporal Alignment • Intel Video GED Application • GED / Subjective User Experience Correlation • Subjective Assessment Methodology • Results and Analysis • Current Standardization Efforts
Frame Identifier Sampling • GED averages over the entire color patch, discarding the outside edges • Accommodates blurring, minor compression artifacts and other small changes
Color Sampling Methodology • GED frame identifiers consist of fully-saturated colors to tolerate compression, digital to analog conversion, color space changes and capture device deltas • Basic Frame Sequence (RGB24 colorspace): 0 1 2 3 4 5 6 7 R: 0 G: 0 B: 0 R: 0 G: 0 B: 255 R: 0 G: 255 B: 0 R: 0 G: 255 B: 255 R: 255 G: 0 B: 0 R: 255 G: 0 B: 255 R: 255 G: 255 B: 0 R: 255 G: 255 B: 255
GED Composite Sequence • GED can use composite marker sequences to uniquely identify large numbers of frames while tolerating normal chroma changes during capture • For instance, a 3x3 grid using 8 colors per element allows 8^9 or 13 million unique frames (enough for 51 days of video) • Composite Sequence (RGB24 colorspace): 0 1 2 3 4 5 6 7 8 9 10 … 2162688 2162689 2162690 2162691 2162692
Agenda • Video Gross Error Detection Overview • Gross Error Detection Methodology • Frame Identifier Sampling • Sequence Markers • Temporal Alignment • Intel Video GED Application • GED / Subjective User Experience Correlation • Subjective Assessment Methodology • Results and Analysis • Current Standardization Efforts
Sequence Markers • Start of sequence markers can be used to delineate several video clips that have been concatenated together • Start markers also allow padding at the beginning of video clips during analog capture – this padding can be removed to keep the clips in temporal sync • Example of start of sequence marker for 3x3 grid (RGB24 colorspace):
Sequence Markers – Multiple Sequences • Multiple sequences in the same test clip can be evaluated and scored separately Sequence 2 Sequence 1 Frame 4 Frame 5 Frame 1 Frame 2 Frame 3
Sequence Markers – Video Capture Padding Source File – 3 Frames Capture File – 5 Frames Start marker used to crop extra frames X X
Agenda • Video Gross Error Detection Overview • Gross Error Detection Methodology • Frame Identifier Sampling • Sequence Markers • Temporal Alignment • Intel Video GED Application • GED / Subjective User Experience Correlation • Subjective Assessment Methodology • Results and Analysis • Current Standardization Efforts
Temporal Alignment • Missing frames can be replaced and repeated frames deleted to temporally align capture files with source clips • The feature is useful for full-reference tools, such as VQM
Temporal Alignment Example – Source Clip of Three Frames Frame 1 Frame 2 Dropped Frame Detected = 2 Frames Frame 1 Frame 2 Frame 3 = 3 Frames (Frame 2 replaced with copy of frame 1 and marked accordingly)
Agenda • Video Gross Error Detection Overview • Intel Video GED Application • GED / Subjective User Experience Correlation • Subjective Assessment Methodology • Results and Analysis • Current Standardization Efforts
Intel Video GED Application • The Video GED application is a reference implementation of the GED methodology described above • The Video GED is freely available in binary form and unencumbered by patents • Calculates Mean Opinion Scores based on Intel’s research
Agenda • Video Gross Error Detection Overview • Intel Video GED Application • GED / Subjective User Experience Correlation • Subjective Assessment Methodology • Results and Analysis • Current Standardization Efforts
Subjective Assessment Methodology • Video sequences with varying levels of degradation were generated to represent errors generally encountered during video over wireless playback • A series of 240-frame Standard Definition (720x486) clips was assembled into a one hour test session (10 video clips x 5 conditions x 2 Error types) • Fifty non-expert subjects (50% male and 50% female) were asked to evaluate each clip for “playback smoothness and fluidity,” not their opinion of the video content • Before each test session, subjects were shown a sample of the best and worst clips to establish a frame of reference in order to reduce the impact of participant inherent biases
After each video sequence, participants were presented with a choice of five adjectives describing their opinion of the video experience on a subjective scale Subjective Video Quality Scale Trial Structure
Randomization • The presentation order of the video content was randomized using a pseudo-random number generator tool to prevent ordering effects, i.e. to mask any tendency for a participant to rate a clip in relation to the previous one • Randomization is a key element of psycho-visual testing that ensures participants do not see the material in a repeated fashion that would allow a learning effect
20° 5H Participants Sat at a Predetermined Viewing Distance
Agenda • Video Gross Error Detector Overview • Intel Video GED Application • GED / Subjective User Experience Correlation • Subjective Assessment Methodology • Results and Analysis • Current Standardization Efforts
Overall Means for Dropped and Repeated Frames Significant difference found between dropped frames and repeated frames.
Collapsed Across Dropped and Repeated for Each Condition Plotted means for both error types.
Collapsed Across Dropped and Repeated for Each Clip User ratings are content dependent.
Mean Opinion Score for Dropped Frames Log fit graph for predicting dropped errors.
Mean Opinion Score for Repeated Frames Log fit graph for predicting repeated errors.
Combined Mean Opinion Score for Dropped and Repeated Frames Log fit graph for predicting dropped and/or repeated errors with high correlation.
Agenda • Video Gross Error Detector Overview • Intel Video GED Application • GED / Subjective User Experience Correlation • Current Standardization Efforts
Current Standardization Efforts • GED was designed to measure the playback experience over unreliable media, such as 802.11 wireless networks • Initial standardization efforts have focused on IEEE 802.11 Task Group T: User-Centric Wireless Performance Prediction • While generally in favor of the GED methodology, the task group would prefer including the GED in the TGT draft after it has received the endorsement of a recognized body of video experts • VQEG’s input would be extremely helpful for refining the GED and furthering the standards process
Conclusion • Video Gross Error Detection provides an efficient, repeatable method of characterizing video playback performance • GED complements quality tools such as VQM, providing a complete picture of the overall user experience • GED is freely available and intended to help the industry perform platform-independent technology comparisons