1 / 27

How do ideas from perceptual organization relate to natural scenes?

How do ideas from perceptual organization relate to natural scenes?. Brunswik & Kamiya 1953. Thesis : Gestalt rules reflect the structure of the natural world Attempted to validate the grouping rule of proximity of similars Brunswik was ahead of his time… we now have the tools.

vahe
Download Presentation

How do ideas from perceptual organization relate to natural scenes?

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. How do ideas from perceptual organization relate to natural scenes?

  2. Brunswik & Kamiya 1953 • Thesis: Gestalt rules reflect the structure of the natural world • Attempted to validate the grouping rule of proximity of similars • Brunswik was ahead of his time… we now have the tools. Egon Brunswik (1903-1955)

  3. Ecological Statistics of Perceptual Organization • Can we define these cues for real images? • Are these cues “ecologically valid”? • How informative are different cues? Figure/Ground Grouping

  4. Task: detect generic pattern or group • Signal: class of patterns, known null hypothesis • Cues: optimal test is usually obvious • Result: mathematically precise characterization of when detection is possible • Task: capture “useful” information about the scene • Signal: natural image statistics, clutter • Cues: something computable from real pixels • Result: empirical statistics about relative power of different cues

  5. Berkeley Segmentation DataSet [BSDS]

  6. distance [proximity] • region cues [similarity] • boundary cues [connectedness, closure, convexity] Cues: What image measurements allow us to gauge the probability that pixels i and j belong to the same group?

  7. Learning Pairwise Affinities Sij – indicator variable as to whether pixels i and j were marked as belonging to the same group by human subjects. Wij – our estimate of the likelihood that pixel iand j belong to the same group conditioned on the image measurements. • Use the ground truth given by human segmentations to calibrate cues. • Learn “statistically optimal” cue combination in a supervised learning framework • Ecological Statistics: Measure the relative power of different cues for natural scenes

  8. Brightness Texture Color L* Boundary Processing Region Processing a* b* Textons D E Distance 2 2 Wij C A B C A B Original Image

  9. Evaluation Measures • Precision-Recall of same-segment pairs • Precision is P(Sij=1 | Wij > t) • Recall is P(Wij > t | Sij = 1) • Mutual Information between W and S Groundtruth Sij Estimate Wij ∫ p(s,w) log [p(s)p(w) / p(s,w)]

  10. Individual Features Gradients Patches

  11. Affinity Model vs. Human Segmentation

  12. Findings • Both Edges and Patches provide useful “independent” information. • Texture gradients can be quite powerful • Color patches better than gradients • Brightness gradients better than patches. • Proximity is a result, not a cause of grouping

  13. Figure-Ground Labeling • start with 200 segmented images of natural scenes • boundaries labeled by at least 2 different human subjects • - subjects agree on 88% of contours labeled

  14. Local Cues for Figure/Ground • Assume we have a perfect segmentation • Can we predict which region a contour belongs to based on it’s local shape? • Size/Surroundedness • Convexity • Lower Region

  15. p Size and Surroundedness [Rubin 1921] G F Size(p) = log(AreaF / AreaG)

  16. p G F Convexity [Metzger 1953, Kanizsa and Gerbino 1976] ConvG = percentage of straight lines that lie completely within region G Convexity(p) = log(ConvF / ConvG)

  17. θ p center of mass Lower Region [Vecera, Vogel & Woodman 2002] LowerRegion(p) = θG

  18. Figural regions tend to be convex

  19. Figural regions tend to lie below ground regions

  20. Size Lower Region Convexity

  21. Power of cue depends on support of the analysis window.

  22. Power of cue depends on support of the analysis window.

  23. “Upper Bounding” Local Performance • Present human subjects with local shapes, seen through an aperture.

  24. Human Performance on Local Figure-Ground

  25. Extension to Real Images • Build up library of prototypical contour configurations by clustering local shape descriptors • Geometric Blur [Berg & Malik 01] • Train a classifier which uses similarities to these prototype shapes to predict figure/ground label

  26. Shapemes Classifier using 64 shapeme features: 61%

  27. Globalization of Figure/Ground Measurements • Averaging local shapeme cue over human-marked boundaries: 71% • Prior over junction types and label continuity: 79%

More Related