330 likes | 413 Vues
Explore a database of human segmented natural images for image segmentation benchmarking and ecological statistics analysis. The dataset includes 1000 Corel images with manual segmentations by 20 subjects. Discover the consistency and error measurement of human segmentations, along with insights on region properties. Validate Gestalt grouping factors and explore the relative power of cues for image segmentation.
E N D
A Database of Human Segmented Natural Images and Two Applications David Martin, Charless Fowlkes, Doron Tal, Jitendra Malik UC Berkeley {dmartin,fowlkes,doron,malik}@eecs.berkeley.edu
Motivation • Berkeley Segmentation Dataset Groundtruth for image segmentation of natural images • App#1: A segmentation benchmark • App#2: Ecological statistics David Martin - UC Berkeley - ICCV 2001
Benchmark Example for Recognition MNIST handwritten digit dataset [LeCun, AT&T] http://www.research.att.com/~yann/exdb/mnist/index.html Training set, test set, evaluation methodology, algorithm ranking David Martin - UC Berkeley - ICCV 2001
The Image Dataset • 1000 Corel images • Photographs of outdoor scenes • Texture is common • Large variety of subject matter • 481 x 321 x 24b David Martin - UC Berkeley - ICCV 2001
Establishing Groundtruth • Def: Segmentation = Partition of image pixels into exclusive sets • Manual segmentation by human subjects • Custom Java tool to facilitate task • Currently: 1000 images, 5500 segmentations, 20 subjects • Naïve subjects (UCB undergrads) given simple, non-technical instructions David Martin - UC Berkeley - ICCV 2001
Directions to Image Segmentors • You will be presented a photographic image • Divide the image into some number of segments, where the segments represent “things” or “parts of things” in the scene • The number of segments is up to you, as it depends on the image. Something between 2 and 30 is likely to be appropriate. • It is important that all of the segments have approximately equal importance. David Martin - UC Berkeley - ICCV 2001
The segmentations are not identical. • But are they consistent?? David Martin - UC Berkeley - ICCV 2001
image background left bird right bird beak grass bush far beak eye head body eye head body Perceptual organization forms a hierarchy Each subject picks a slice through this hierarchy. David Martin - UC Berkeley - ICCV 2001
Quantifying inconsistency S2 S1 How much is S1 a refinement of S2 at pixel ? David Martin - UC Berkeley - ICCV 2001
Segmentation Error Measure • One-way Local Refinement Error: • Segmentation Error allows refinement in either direction at each pixel: David Martin - UC Berkeley - ICCV 2001
Human segmentations are consistent Distribution of segmentation error over the dataset. David Martin - UC Berkeley - ICCV 2001
Color Gray InvNeg David Martin - UC Berkeley - ICCV 2001
InvNeg David Martin - UC Berkeley - ICCV 2001
Color Gray InvNeg David Martin - UC Berkeley - ICCV 2001
Gray vs. Color vs. InvNeg Segmentations SE (gray, gray) = 0.047 SE (gray, color) = 0.047 Color may affect attention, but doesn’t seem to affect perceptual organization SE (gray, gray) = 0.047 SE (gray, invneg) = 0.059 InvNeg interferes with high-level cues (2500 gray, 2500 color,200 invneg segmentations) David Martin - UC Berkeley - ICCV 2001
Benchmark Methodology • Separate training and test datasets with no images in common • Generate computer segmentation(s) of each image in test set • Determine error of each computer segmentation using SE measure • Algorithm scored by mean SE • Example: • SE (human, human) = 0.05 • SE (NCuts, human) = 0.22 • SE (different images) = 0.30 David Martin - UC Berkeley - ICCV 2001
Ecological Statistics of Image Segmentations • Validating and quantifying Gestalt grouping factors [Brunswik 1953] • Priors on region properties • Recent work on natural image statistics: • Filter outputs [Ruderman 1994, Olshausen & Field 1996, Yuille et. al. 1999] • Object sizes [Alvarez, Gousseau, Morel 1999] • Shape [Zhu 1999] • Contours [August & Zucker 2000, Geisler et al. 2001] David Martin - UC Berkeley - ICCV 2001
Relative power of cues • Pairwise grouping cues • Proximity • Luminance similarity • Color similarity • Intervening contour • Texture similarity David Martin - UC Berkeley - ICCV 2001
P (Same Segment | Proximity) David Martin - UC Berkeley - ICCV 2001
P (Same Segment | Luminance) David Martin - UC Berkeley - ICCV 2001
Bayes Risk for Proximity Cue David Martin - UC Berkeley - ICCV 2001
Bayes Risk for Various Cues Conditioned on Proximity David Martin - UC Berkeley - ICCV 2001
Mutual Information for Various Cues Conditioned on Proximity David Martin - UC Berkeley - ICCV 2001
Priors on Region Properties • Area • Convexity David Martin - UC Berkeley - ICCV 2001
Empirical Distribution of Region Area y = Kx- = 0.913 Compare with Alvarez, Gousseau, Morel 1999. David Martin - UC Berkeley - ICCV 2001
Empirical Distribution of Region Convexity David Martin - UC Berkeley - ICCV 2001
Conclusion • Large new database of segmentations of natural images by humans • A segmentation benchmark • Ecological statistics • Relative power of grouping cues • Priors on region properties http://www.cs.berkeley.edu/~dmartin/segbench David Martin - UC Berkeley - ICCV 2001