230 likes | 345 Views
TIM @ SBU 2008. Outline. Results and User Study. Implementation Steps. AutoCollage Framework. TIM @ SBU 2008. About AutoCollage. An automatic procedure for constructing a visually appealing collage from a collection of input images. TIM @ SBU 2008.
E N D
Outline Results and User Study Implementation Steps AutoCollage Framework TIM @ SBU 2008
About AutoCollage An automatic procedure for constructing a visually appealing collage from a collection of input images. TIM @ SBU 2008
AutoCollage : A Labelling System Collage : I c Input Images : I = {I1, . . . , IN} Domain : P Each pixel-location : p ∈ P Label : L(p) = (n, s) in which In ∈ I and s ∈ S Compact form : I(p) = S(p,L(p)) = In(p−s) The labelling L = {L(p), p ∈ P} completely specifies the collage. Goal : To find the labelling L which minimises the energy/cost E(L) E(L) = Erep(L) + Wimp Eimp(L) + Wtrans Etrans(L) + Wobj Eobj(L) TIM @ SBU 2008
Results and User Study AutoCollage Framework Implementation Steps TIM @ SBU 2008
Representative Images TIM @ SBU 2008
Representative Images E(L) = Erep(L)+ Wimp Eimp(L) + Wtrans Etrans(L) + Wobj Eobj(L) The cost associated with the set Is of chosen images is of the form Erep = ∑n Erep(n) where Erep(n) = −an Dr(n) − min an am Vr(n,m) m:Im∈ Is an = 1 if ∃p ∈ P with L(p) = (n, s) Dr(n) = Entropy(In) + Wfacedδ({Image n contains a face}) δ(π) = 1 if predicate π is true Wface weights the influence of an image containing a face Vr(m,n) shows pairwise distances between images TIM @ SBU 2008
Region of Interest TIM @ SBU 2008
Region of Interest E(L) = Erep(L) + Wimp Eimp(L) + Wtrans Etrans(L) + Wobj Eobj(L) The Eimp term ensures that a substantial and interesting region of interest (ROI) is selected from each image in Is. Eimp(L) = − ∑p G(p, L(p)) T(p, L(p)) G(p, L(p)) is the Gaussian weighting function that favors the center of the input image from which p is drawn. T(p, L(p)) measures the local entropy of a 32×32-pixel region around the pixel p, and is normalized so that local entropy sums to 1 over a given input image. TIM @ SBU 2008
Packing ROIs TIM @ SBU 2008
Packing ROIs : Constraints 1. Faces should be regard as a preferred material 2. Sky should be constrained to appear at the top 3. No two ROIs should intersect 4. Every pixel is couvered by an image TIM @ SBU 2008
Packing ROIs : The Energy E(L) = Erep(L) + Wimp Eimp(L) + Wtrans Etrans(L) + Wobj Eobj(L) Eobj incorporates information on object recognition, and favors placement of objects in reasonable configurations. For faces, Eobj = ∑p, q∈ N f (p, q, L(p), L(q)) f (p,q,L(p),L(q)) = ∞ whenever L(p) ≠ L(q) and p,q are pixels from the same face in either the images of L(p) or L(q), 0 otherwise. For sky, rather than defining an explicit energy, we simply label images containing sky and pass this information to the constraint satisfaction engine which attempts to position such images only at the top of the collage. TIM @ SBU 2008
Transition Between Images TIM @ SBU 2008
Transition between Images E(L) = Erep(L) + Wimp Eimp(L) + Wtrans Etrans(L) + Wobj Eobj(L) Etrans is a pairwise term which penalises any transition between images that is not visually appealing. Etrans = ∑p,q∈N VT (p, q, L(p), L(q)) N is the set of all pairs of neighboring (8-neighborhood) pixels. Compact form: I(p) = S(p,L(p)) = In(p−s) ε = 0.001 prevents underflow ||.|| defines the Euclidean norm TIM @ SBU 2008
Transition between Images E(L) = Erep(L) + Wimp Eimp(L) + Wtrans Etrans(L) + Wobj Eobj(L) Etrans measures mismatch across the boundary between two input images. 1. VT (p, q, L(p), L(q)) = 0 unless L(p) ≠ L(q). 2. VT (p, q, L(p), L(q)) is small if there is a strong gradient in one of the input images, since the relevant denominator will then be large. TIM @ SBU 2008
AutoCollage : Constraints 1. Information bound Any image Inthat is present in the labelling must satisfy Eimp(L,n) > T. Eimp(L,n) ∈ [0,1]is the proportion of local image information that is captured in the ROI. 2. Uniform shift A given input image In may appear in the collage with one unique shift s. Given two distinct pixels p, q ∈ P : p ≠ q, with labels L(p) = (n, s), L(q) = (n, s′), it is required that s = s′. 3. Connectivity Each set Sn ∈ {p ∈ P: L(p)=(n, s), for some s} of collage pixels drawn from image n, should form a connected region. TIM @ SBU 2008
Implementation Summary E(L) = Erep(L) + Wimp Eimp(L) + Wtrans Etrans(L) + Wobj Eobj(L) Ereptends to select the images from the input image set that are most representative. Eimpterm ensures that a substantial and interesting region of interest (ROI) is selected from each image in I. Etransis a pairwise term which penalises any transition between images that is not visually appealing. Eobjincorporates information on object recognition, and favors placement of objects in reasonable configurations. Wimp, Wtrans, Wobj and Wfaced are weighting parameters and have been adjusted by informal testing over 50 sets of images. TIM @ SBU 2008
Demo TIM @ SBU 2008
Implementation Steps AutoCollage Framework Results and User Study TIM @ SBU 2008
Results : Comparison Tapestry AutoCollage TIM @ SBU 2008
Results : Limitations 1. Occasional inclusion of sky fragments in the interior (83% Accuracy) 2. Occasionally the face detection fails, allowing inappropriate cut 3. Sometimes texture edges trigger inappropriately sharp transitions 4. Lake of user interaction TIM @ SBU 2008
User Study TIM @ SBU 2008