200 likes | 279 Vues
This publication discusses the usage of relative visual attributes for more detailed and informative image descriptions. By describing an image's attributes in relation to others, it allows for richer human supervision and potentially higher recognition accuracy. The proposed algorithm involves training a ranking function per attribute and predicting relative strengths on novel images, enabling tasks like zero-shot learning and generative models for joint ranking outputs. Experimental results show the effectiveness of this approach in face and outdoor scene recognition.
E N D
Relative Attributes Speaker DengLei At I-VisionGroup
Devi Parikh & Kristen Grauman • ICCV 2011 Marr Prize
… last 3 years: • ICCV 3(one only her) ECCV 1 CVPR 9 IJCV 1 NIPS 1
Outline • Introduction • Algorithms • Experiments
Introduction—Backgrounds • Visual attributes • Benefit various recognition tasks • Restrict on categorical label • Binaries are unnatural • Motivation • How to describe middle image • Relative description — one image’s attribute strength with respect to others • E.g. less natural than left, more nature than right • Richer mode of communication • Allow more detailed human supervision (maybe higher recognition accuracy) • More informative descriptions of novels
Proposal • Steps • Training – learn ranking function per attribute • Testing – predict the relative strength per attribute on novel image • New Tasks • Build generative model over joint space of ranking output • Zero-shot learning relates unseen to seen • E.g. 'bears are furrier than giraffes‘ • Enable richer textual description for new images • More precise • Tested on faces and natural scenes compared with binaries
Outline • Introduction • Algorithms • Experiments
Novel zero-shot learning • Setup • N total categories: S seen, U unseen (no images available) • S: described relative to each other via attrs (no need all pairs) • U: described relative to seen in (subset of ) attrs • Gaussian • Test by Max-likehood
Auto gen relative textual desc of images • Img -> Img • Img -> Class • More info than bin
Outline • Introduction • Algorithms • Experiments
Experiments • Setup • Outdoor Scene Recognition (OSR) • I: 2668, C: 8, • Coast, forest, highway, inside-city, mountain, open-country, street, tall-building • Gist • Public Figures Face (PubFig) • I: 772, C: 8 • Alex, Clive, Hugh, Jared, Miley, Scarlett, Viggo, Zac • Concatenated gist and color feature
Database — relative attrs • Marked By a colleague
Conclusion • Idea to learn relative visual attrs. • Two new tasks • Zero-shot learning • Img description • Based on relative description