410 likes | 540 Views
Simpsons @ work. Why temporal segmentation?. Terms. A Shot. smallest piece of information elementary building block of a video. A Cut. Easy Most common 95% of all transitions are cuts. A Fade-in. Often used at the start of a chapter From Black Color to Image Linear intensity change.
E N D
A Shot • smallest piece of information • elementary building block of a video
A Cut • Easy • Most common • 95% of all transitions are cuts
A Fade-in • Often used at the start of a chapter • From Black Color to Image • Linear intensity change
A Fade-out • Often used at the end of a chapter • From Image to Black Color • Linear intensity change
A Dissolve • Used to show that time has passed • Often used as a subchapter • From one Shot to another Shot • Linear intensity change
Human vs. Computer • Human: Knows what happens. • Computer: No cognitive understanding. Segmentation has to rely on features or signals.
SAD • Sum of absolute differences • Substract each pixel by each pixel • Very easy approach SAD = -
amount dark bright RGB-Histogram • Find and count all the different shades of red, green and blue and order them from dark to bright.
Histogram Difference = • For red,green and blue: Substract the histograms bin by bin and sum up the difference. - 256 bins
No cut No cut No cut CUT No cut No cut No cut 1 Frame Difference Result:
cut cut cut cut cut cut cut cut How to find the cuts signal • Find Cuts with a threshold • Works on SAD and Histogram Difference frames
cut cut cut cut cut cut cut cut cut Problems with Motion SAD: • SAD and Histogram are very sensitive to camera or object movement. signal frames
cut cut cut cut cut cut cut cut cut cut cut cut cut cut cut cut Problems with Flash signal • SAD and Histogram are very sensitive to flash. frames
Problems with dissolves & fades • So far we have no features that would indicate a fade-in, fade-out or a dissolve Improvment needed.
No cut No cut No cut CUT No cut No cut No cut So far: 1 Frame Difference Result:
cut cut cut cut cut cut cut No Cut CUT CUT CUT No cut No cut No cut Now: n - Frame difference Result:
Motion & Flash signal signal • Not sensitive to flash anymore • Not sensitive to motion anymore frames frames
fade Dissolves & Fades signal • Finally we are able to detect dissolves frames
dissolve start dissolve end Problems with long dissolves signal • With this feature we miss the long and slow dissolves. frames
mean blue mean green mean red Mean • Mean is the arithmetic average of the histogram values. • When the histogram changes, the mean moves to left or right.
Variance • Variance indicates how much the histogram values are spread around the mean. • When the histogram mean changes, the variance changes, too. vs small big
Linear Mean Change in Dissolves signal Result: frames
Parabolic Variance Change in Dissolves signal Result: frames
Process of Detection • Finds slow dissolves • Finds fade-in / fade-out Search for candidates Approximate mean Approximate variance If small enough: Add disolve Calculate the error made
Evaluation • ~ 250.000 frames of video material (ca. 3h)~ 1800 cuts, 90 dissolves / fades • Genres: Sport, Cartoons, News, Music, Commericals, Movies, Documentary • Focus on News ( ABC Lateline ) • 3 Categories: Match, Miss and False Positive