170 likes | 333 Views
ICME 2008 Huiying Liu, Shuqiang Jiang, Qingming Huang, Changsheng Xu. LOWER ATTENTIVE REGION DETECTION FOR VIRTUAL CONTENT INSERTION IN BROADCAST VIDEO. Outline. Introduction LAR detection framework LAR detection for sports video VCI methods Experiments Conclusion. Introduction.
E N D
ICME 2008 Huiying Liu, ShuqiangJiang, QingmingHuang, ChangshengXu LOWER ATTENTIVE REGION DETECTION FOR VIRTUAL CONTENT INSERTION IN BROADCAST VIDEO
Outline • Introduction • LAR detection framework • LAR detection for sports video • VCI methods • Experiments • Conclusion
Introduction • Virtual Content Insertion (VCI) is an emerging application of video analysis and has been studied for several years. • The task of VCI is to make the inserted content attractive to the viewers and meanwhile not intrusive. • Compared with time point, spatial position is even more important as improper placement will make the insertion intrusive.
LAR detection • Framework • In our work we perform LAR detection with shot as the spatio-temporal context.
LAR detection • Attention analysis : static saliency map + motion saliency map • A region contains more perceptive information than a pixel or a block and can be obtained by image segmentation • Image segmentation DENSITY-BASED CLUSTERING [12] [12] Q. Ye, W. Gao, W. Zeng, Color Image Segmentation Using Density-Based Clustering, ICME 2003.
LAR detection • Attention analysis • Static saliency feature contrast area factor spatial distance between the two regions adjacency degree central effect [5] H. Liu, S. Jiang, Q. Huang, C. Xu, and W. Gao. “Region-Based Visual Attention Analysis with Its Application in Image Browsing on Small Displays”. ACM Multimedia,2007
LAR detection • Attention analysis • Motion saliency [8] • The MVS visualizes the MVF and provides us with a visual aid to understand. Angle Magnitude Texture Hue Saturation Value [8] L-Y. Duan, M. Xu, Q. Tian, C-S. Xu, J. S. Jin, “A Unified Framework for Semantic Shot Classification in Sports Video” IEEE Multimedia,2005
LAR detection • Information analysis : information map + entropy map • The necessity of a region is evaluated by its information and entropy in our work. • An LAR must supplies less information and contains less entropy at the same time. • In this paper a simple and effective method is adopted to calculate the information and entropy of each region and the result is used to evaluate the region’s necessity.
LAR detection • Information can be calculated as: • Entropy of the region can be calculated as: The final LAR should be of less information and less entropy. H :the normalized accumulative histogram of a shot h :the normalized histogram of a Region A : the region’s area
Example of bottom-up LAR Detection • Fusing the 4 maps generates the frame attention map. (a): The previous frame (b): Current frame (c): Motion field (d):Segmentation result (e): Static saliency map (f ): Motion intensity (g): Texture by DCT AC energy (h): HSV image of motion vector (i): Motion saliency map (j): Information map. (k): Entropy map (l): Final attentive map
LAR detection for sports video • Method Overview • The shots here are classified into field view, player view (coach , referee) and audience view. • Filed view and player view shots take up more than 90% of the video. So we perform LAR detection and VCI only on filed view and player view shots. • For player view shots LAR can be detected by using the bottom up method.
LAR detection for sports video • Field View LAR Detection • Playfield detection, line and curve detection, as well as object detection [2] can be combined to detect the LAR. • The GMM based method [9] is adopted to detect the playfield. [2]C. Xu, K. W. Wan, S. H. Bui, Q. Tian, “Implanting Virtual Advertisement into Broadcast Soccer Video”, PCM, 2004 [9] S. Jiang, Q. Ye, W. Gao, T. Huang. “A New Method to segment Playfield and Its Applications in Match Analysis in Sports Video”. ACM Multimedia, 2004
Virtual content insertion • The methods for VCI include • Static insertion • The content floats over the original video. • Dynamic insertion • Merge the content into the background. • Camera parameter must be reconstructed and the content is adapted to the camera metric [10]. Static insertion input dynamic insertion [10] X. Yu, X. Yan, T. T. P. Chi and L. F. Cheong, “Inserting 3D projected virtual content into broadcast tennis video”, ACM Multimedia, 2006
Experiments • Test sequences (MPEG format) • Broadcast TV play series (22 minutes) • Sports1 soccer video and (45 minutes) • Sports2 tennis videos (26minutes) • For the field view shots, we insert the virtual content using dynamic method. • The VCI method presented in before estimates the camera motion and adapts the VC to the camera metric.
Experiments Static insertion dynamic insertion
Conclusion • In this paper, an LAR detection method is proposed using visual attention analysis and information theory. • In the future we will construct an integrated system of VCI and the temporal position for VCI.