Computer Assisted Visual InterActive Recognition (CAVIAR)

Computer Assisted Visual InterActive Recognition(CAVIAR) Jie Zou RPI ECSE DocLab Advisor: Prof. George Nagy Committee: Prof. Qiang Ji Prof. Robert B. Kelley Prof. Mukkai Krishnamoorthy Jie Zou, Doclab ECSE, RPI

Agenda • Introduction • Related research • CAVIAR methodology • Interactive segmentation • CAVIAR – flower recognition system • CAVIAR – face recognition system • Conclusions Jie Zou, Doclab ECSE, RPI

Motivation • All operational systems require human assistance (preprocessing, handling rejects). • CAVIAR makes parsimonious use of human visual talent throughout the process rather than only at the beginning or the end. Jie Zou, Doclab ECSE, RPI

Scope of CAVIAR • Visual pattern recognition only • Each CAVIAR system addresses a specific domain • Many class classification • Low throughput Jie Zou, Doclab ECSE, RPI

Research Goals • Allocation of human and machine responsibilities • Mathematical model • Framework and design principles • Prototype CAVIAR systems Jie Zou, Doclab ECSE, RPI

Content-Based Image Retrieval • Typical search of CBIR system • Submit a query image. • Specify the relative importance of the features. • Relevance feedback (label the retrieved images as acceptable or not-acceptable). • Iterates until user finds the desired image. Jie Zou, Doclab ECSE, RPI

CBIR vs. CAVIAR CBIR CAVIAR Subjective retrieval Objective classification User judges retrieval results Statistical decision boundary Machine weights features User weights features Narrow domain Broad domain Relevance feedback Relevance feedback Model adjustment Jie Zou, Doclab ECSE, RPI

Flower Recognition • Little research on automatic flower recognition • M. Das, R. Manmatha, and E.M. Riseman, “Indexing flower patent images using domain knowledge," IEEE Intelligent Systems, vol. 14, no. 5, pp. 24-33, 1999. Jie Zou, Doclab ECSE, RPI

Face Recognition • Started in 1960’s. Now, most active pattern recognition application • Eigenface, dominant method • Geometrical feature models are appropriate for interactive recognition Jie Zou, Doclab ECSE, RPI

Psychophysics • Attneave (1954): “the nature of redundancy in visual stimulations”, and “information is concentrated along contours.” • Miller (1956): plus or minus 7 Jie Zou, Doclab ECSE, RPI

Allocation of Human and Machine Responsibilities Conventional System CAVIAR Jie Zou, Doclab ECSE, RPI

Notation CAVIAR state Model parameters Features Index vector Training set Label Jie Zou, Doclab ECSE, RPI

Formal Description (1) • Finite state machine • Initial state created by: • Model building • Feature extraction • Indexing Jie Zou, Doclab ECSE, RPI

Formal Description (2) • Model manipulation leads to a state transition from state n to state n+1: • Model building , • Feature extraction • Indexing Jie Zou, Doclab ECSE, RPI

Formal Description (3) • The task can terminate at any state by identification. Jie Zou, Doclab ECSE, RPI

Illustration (Video) Jie Zou, Doclab ECSE, RPI

Notation • Parametric boundary • Exact boundary • Foreground region or • Background region or • Radius vectorintersectsat , andat . Jie Zou, Doclab ECSE, RPI

Training – Color Distributions Jie Zou, Doclab ECSE, RPI

Training – Circle Parameter Distributions Jie Zou, Doclab ECSE, RPI

Training – Deviation of Circular Model From Exact Boundary ß=5.52 Jie Zou, Doclab ECSE, RPI

Automatic Segmentation – Circle Partition Use a circle to isolate a region, which contains mostly flower colors. Jie Zou, Doclab ECSE, RPI

Automatic Segmentation – Generate Boundary Likelihood Map Distance to the circle Magnitude of color gradient Boundary pixels should be close to the circle, and have high color gradient. Jie Zou, Doclab ECSE, RPI

Automatic Segmentation – Deformation on BLM • Circle center = a foreground seed, and four corner pixels = background seeds • Foreground and background regions compete with each other to expand. • Eventually, converge to the watershed of the seed pixels on the BLM. Jie Zou, Doclab ECSE, RPI

Advantage of BLM over Gradient Map Jie Zou, Doclab ECSE, RPI

Examples of the Result of Automatic Segmentation Jie Zou, Doclab ECSE, RPI

Interactive Correction (Video) Jie Zou, Doclab ECSE, RPI

Segment Flower Pictures with Interactive Correction • 1078 flowers from 113 speciesBorjan Gagoski and Adam Callahan • 5.7 seed pixels, 15.2 seconds per picture Greenie Cheng Jie Zou, Doclab ECSE, RPI

Agenda • Motivation • Related research • Interactive segmentation • CAVIAR methodology • CAVIAR – flower recognition system • CAVIAR – face recognition system • Conclusions Jie Zou, Doclab ECSE, RPI

Flower Database • 320 by 240 resolution • Highly variable illumination, and complex background • 216 samples from 29 classes for development • 612 samples from 102 classes for evaluation Jie Zou, Doclab ECSE, RPI

Rose Curve Model • Parametric curve withsix parameters. • Flowers are composed of petals, which havecircular symmetry. • When n=0, rose curvereduces to circle. Jie Zou, Doclab ECSE, RPI

Classification Features number of petals. the ratio of outer to inner radius. first three order moments of the hue andsaturation histograms Jie Zou, Doclab ECSE, RPI

Adjust inner circle radius by dragging this control point. Move the rose curve by dragging this control point. Change the petal number using this ComboBox. Adjust outer circle radius by dragging this control point. CAVIAR-Flower GUI Jie Zou, Doclab ECSE, RPI

CAVIAR-Flower (Video) Jie Zou, Doclab ECSE, RPI

Evaluation • CAVIAR compared to human-alone and machine-alone • Machine learning • Decision directed approximation • Finite state machine calibration Jie Zou, Doclab ECSE, RPI

Experimental Protocol Thanks to Borjan Gagoski Jie Zou, Doclab ECSE, RPI

CAVIAR Compared to Human-Alone and Machine-Alone • Significantly reduce the recognition time compared to human-alone • Significantly increase the accuracy compared to machine-alone Jie Zou, Doclab ECSE, RPI

Accuracy of Initial Automatic Recognition T2 – 5 labeled T3 – 1 labeled T4 – 1 labeled + 2 pseudo T5 – 1 labeled + 4 pseudo Jie Zou, Doclab ECSE, RPI

Rank Order after Initial Automatic Recognition T2 – 5 labeled T3 – 1 labeled T4 – 1 labeled + 2 pseudo T5 – 1 labeled + 4 pseudo Jie Zou, Doclab ECSE, RPI

Time of Interactive Recognition T2 – 5 labeled T3 – 1 labeled T4 – 1 labeled + 2 pseudo T5 – 1 labeled + 4 pseudo Jie Zou, Doclab ECSE, RPI

Accuracy of Interactive Recognition T2 – 5 labeled T3 – 1 labeled T4 – 1 labeled + 2 pseudo T5 – 1 labeled + 4 pseudo Jie Zou, Doclab ECSE, RPI

Observations about Machine Learning • Initialized with a single training samples per class. • Self-learning: user classified pseudo-labeled samples improve the performance. • Performance of T5 is close to T2, suggesting that instead of initializing with many training samples, we can trust the system’s self learning. Jie Zou, Doclab ECSE, RPI

Calibration of Finite State Machine • 52% samples are immediately confirmed. • 90% samples are identified by 3 adjustments. • The probability of success on each adjustment is just over one half. Jie Zou, Doclab ECSE, RPI

Summary of CAVIAR-Flower • Parameterized rose curve to model the flowers. • Display the rose curve and let user adjust it if necessary. • The evaluation of the system shows advantages of CAVIAR system. Jie Zou, Doclab ECSE, RPI

Agenda • Motivation • Related research • Interactive segmentation • CAVIAR methodology • CAVIAR – flower recognition system • CAVIAR – face recognition system • Conclusions Jie Zou, Doclab ECSE, RPI

CAVIAR - Face • Not to implement a state-of-the-art face recognition system. • To demonstrate the wider applicability of CAVIAR methodology. • 400 FERET pictures of 200 subjects (“ba” and “bk” series) Jie Zou, Doclab ECSE, RPI

Face model and features • Face model contains only two pupils. • An automatic facial feature detection program locates the other 26 points. Thanks to Yan Tong, Zhiwei Zhu, and Dr. Qiang Ji. Jie Zou, Doclab ECSE, RPI

Computer Assisted Visual InterActive Recognition (CAVIAR)

Computer Assisted Visual InterActive Recognition (CAVIAR)

Presentation Transcript

CS 1674: Intro to Computer Vision Visual Recognition

Visual Word Recognition

CS 2770: Computer Vision Intro to Visual Recognition

Visual Word Recognition

Visual Pattern Recognition

Age: 3 weeks

CAVIAR

CAVIAR water vapour laboratory FTS measurements

Caviar

Visual Object Recognition

Visual Object Recognition

Visual Object Recognition

On Visual Recognition

Computer- Assisted Assessment

Best Salmon Offers UK | Fresh Wild Salmon Roe UK - Ecaviar

CATIA- Interactive three-dimensional computer-assisted application

Computer Assisted Assessment

CAVIAR

Visual Object Recognition

Visual Recognition

Caviar Market Trends Analysis 2020-2025

Caviar Market