Finding Celebrities in Billions of Web Images

Finding Celebrities in Billions of Web Images 云飞 2012-12-13

Overview • 一、label an input image with a list of celebrities. • 二、the celebrity names are assigned to the faces by label propagation on a facial similarity graph.

Overview • 本文的优点： • 1、the proposed image annotation system is capable of labeling names to general web images. • 2、our name assignment algorithm does not impose any assumption on the facial feature distribution. • 3、not only visual cues are used.

Overview • 1. determine, by identifying celebrity names from surrounding text. • 2. given a set of names, assign the names to the faces in the input image.

Overview • A. Image Annotation System • 1) construct a vocabulary; • 2) discover all webpages hosting its near-duplicates; • 3) use the vocabulary to filter the surrounding text. • Advances： • 1)effective; • 2)remove noise. • Annotated images: • 1)SFSN • 2)SFMN • 3)MF

Overview • B. Multimodal Name Assignment • The context likelihood incorporates the information from surrounding text by using the confidence scores estimated by the image annotation system.

IMAGE ANNOTATION SYSTEM • Goal: label an input image with a list of celebrities who may appear in the image. • A. Constructing a Large-Scale Celebrity Name Vocabulary • B. Discover Related Webpages by Near-Duplicate Image Retrieval • C. Annotating Images by Mining Surrounding Text of Related Webpages

IMAGE ANNOTATION SYSTEM • Constructing a Large-Scale Celebrity Name Vocabulary 1)Wikipedia 首段信息框标签 2)Entitycube

IMAGE ANNOTATION SYSTEM • B. Discover Related Webpages by Near-Duplicate Image Retrieval • divide and conquer strategy • 图片分成n×n • 降维 • 阈值化

IMAGE ANNOTATION SYSTEM • C. Annotating Images by Mining Surrounding Text of Related Webpages • 1) Type of names； • 2) Type of surrounding text； • 3) Frequency versus ratio；

MULTIMODAL NAME ASSIGNMENT • A. Notation • B. Overview of the Assignment Model • C. Label Propagation from SFSN Images p(Y|F) • D. Constrain the Propagation by a Context Likelihood p(Y|T; λ) • E. Normalization by Name Prior p(Y) • F. Implementation Detail: Face Representation

A. Notation • faces in image In • denote the face labels as

B. Overview of the Assignment Model • the confidence for label

C. Label Propagation from SFSN Images p(Y|F) • how to propagate labels from SFSN images to SFMN and MF images

D. Constrain the Propagation by a Context Likelihood p(Y|T; λ) • 1) For each image-level name vk, generate a binary variable zk from p(vk |T) as defined in (3) to indicate whether vk appears in image I. • 2) If zk=1, generate mk faces of name vk in image I from p(m|z; λ) as defined in (13).

E. Normalization by Name Prior p(Y) • p(Y) represents the prior of names.

F. Implementation Detail: Face Representation • the appearance of each face is described by local binary pattern (LBP). • the face image is divided into small regions from which the LBP features are extracted and concatenated into a single feature histogram. • pply PCA to reduce the dimension of face descriptor from over 3000 to 500 dimensions.

Evaluation

Finding Celebrities in Billions of Web Images

Finding Celebrities in Billions of Web Images

Presentation Transcript

Net impact in $ billions *

2012 Revenues (in billions of US dollars)

Celebrities of Prešov

CELEBRITIES

In billions:

Web 2.0: Turning billions of elements into one big picture.

Foreground Focus: Finding Meaningful Features in Unlabeled Images

$ in Billions

Billions

Images in web pages

Making Web Images Accessible

Celebrities

Dollars in billions

Images and the Web

Finding Images in the Library DFTT 209

“A Web for billions of Services”

Hairstyles of Celebrities

Images and the web

Finding Images in the Library DFTT 209