200 likes | 361 Vues
This paper presents a novel method for image recognition through relative visual attributes, emphasizing their advantages over binary classifications. By using relative descriptions, such as comparing an image's attributes to others, we enable richer, more informative human supervision and potentially enhance recognition accuracy. We propose steps for training a ranking function per attribute and testing its predictive power on unseen images. Our experiments focus on recognizing outdoor scenes and faces, illustrating the effectiveness of this approach in generating detailed textual descriptions and facilitating zero-shot learning.
E N D
Relative Attributes Speaker DengLei At I-VisionGroup
Devi Parikh & Kristen Grauman • ICCV 2011 Marr Prize
… last 3 years: • ICCV 3(one only her) ECCV 1 CVPR 9 IJCV 1 NIPS 1
Outline • Introduction • Algorithms • Experiments
Introduction—Backgrounds • Visual attributes • Benefit various recognition tasks • Restrict on categorical label • Binaries are unnatural • Motivation • How to describe middle image • Relative description — one image’s attribute strength with respect to others • E.g. less natural than left, more nature than right • Richer mode of communication • Allow more detailed human supervision (maybe higher recognition accuracy) • More informative descriptions of novels
Proposal • Steps • Training – learn ranking function per attribute • Testing – predict the relative strength per attribute on novel image • New Tasks • Build generative model over joint space of ranking output • Zero-shot learning relates unseen to seen • E.g. 'bears are furrier than giraffes‘ • Enable richer textual description for new images • More precise • Tested on faces and natural scenes compared with binaries
Outline • Introduction • Algorithms • Experiments
Novel zero-shot learning • Setup • N total categories: S seen, U unseen (no images available) • S: described relative to each other via attrs (no need all pairs) • U: described relative to seen in (subset of ) attrs • Gaussian • Test by Max-likehood
Auto gen relative textual desc of images • Img -> Img • Img -> Class • More info than bin
Outline • Introduction • Algorithms • Experiments
Experiments • Setup • Outdoor Scene Recognition (OSR) • I: 2668, C: 8, • Coast, forest, highway, inside-city, mountain, open-country, street, tall-building • Gist • Public Figures Face (PubFig) • I: 772, C: 8 • Alex, Clive, Hugh, Jared, Miley, Scarlett, Viggo, Zac • Concatenated gist and color feature
Database — relative attrs • Marked By a colleague
Conclusion • Idea to learn relative visual attrs. • Two new tasks • Zero-shot learning • Img description • Based on relative description