One-Shot Learning Gesture Recognition

One-Shot LearningGesture Recognition Students: ItayHubara AmitNishry Supervisor: MaayanHarel Gal-On

Outline

Background Gesture recognition is a strong upcoming field in computer vision Gesture recognition can be seen as a way for computers to begin to understand human body language

Goals Learn and understand existing Gesture recognition algorithms. Compare different approaches Design Gesture recognition algorithm which reduces training time

Data • The Data is compose from several set each contains: • Gesture vocabulary (learning set) which contain only one sample per gesture. • Test set which contain one or more gestures. • Each of the sets has different vocabulary features such as large/small gesture hand/legs movement etc.

Data – Train Base gesture

Data – Test Multiple base gestures Large movements

Data - Test Multiple base gestures Small movements

Challenges • One shot learning - only one learning sample (unlike the common approach of multi class classification) • Tests videos segmentation • Same gesture can have different number of frames • Each set has different features (small/big gestures)

Outline

Reduced Problem • Assume that each of the test movies has only one gesture • Goal:finding features space and distance function which have good separation of the features space

Features • Motion Energy • subtracting consecutive frames • Space Quantization

Features • Harris Corner Detector • Find interest point in the difference image based on corner detection • Space Time Interest Points • Extend Harris to the time domain

Features Harris STIP

Features • Head Relative Interest Points

Features Interest points Head Histogram

Distance Functions • Good features space is defined not only by the features but also by the distance (similarity) function • Different features need different distance functions

Principal Motion Using PCA • Using principal component analysis (PCA), to find the main motion vectors. • For test set - project feature onto each of train principals and evaluate similarity

Earth Moving Distance • Given two sets of distribution, EMD will measure the minimum cost to shift “dirt” from one distribution to the other.

Perturbed Variations • Given two sets of distribution and predefined value of permitted variations optimally perturbs the distribution to best fit each other. Transportation problem under permitted variations constrain

LevenshteinDistance • Measure the difference between two sequences. • Consider lengths and classification.

Results Top 20 Top 10

Results

Outline

Complete Problem Separate problems • Basic Segmentation (equal/movement) Whole problem solving approach • Moving Window • Dynamic Time Warping (DTW)

Moving Window • Move a window along the test video. • Assume each window frames has only one gesture • Preform basic analysis as did before to and build the distance matrix

Dynamic Time Warping • Create a state machine from train data: • Module standing position • Form standing position can move to start of base gestures • Assume we can move forward, or stay in the same sate. • For a given gesture – find the best path along the sate machine

Dynamic Time Warping

Results

Results Top 20 Top 10

Outline

Conclusions Each approach receive better results in different feature and similarity function Different algorithms has different strengths (segmentation\recognition) Segmentation require standing position model.

Conclusions • Pre-processing unsupervised algorithms help better representing the data. • There is still allot left to do on the field

Future Work Try different models for the standing position to improve segmentation results Try combing DTW for segmentation and PCA for recognition. Use different unsupervised algorithms to better represent the data.

References Ivan Laptev, "On Space-Time Interest Points”, 2005 Hugo JairEscalantea and Isabelle Guyonb, "Principal motion: PCA-based reconstruction of motion histograms” M.Harel, S.Manor, "The Perturbed Variation”, NIPS 2012 ElizavetaLevina, Peter Bickel Department of Statistics, “The EarthMover’s Distance is the Mallows Distance: Some Insights from Statistics”. OfirPele,MichaelWerman, “Fast and Robust Earth Mover’s Distances”.2008

One-Shot Learning Gesture Recognition

One-Shot Learning Gesture Recognition

Presentation Transcript

Gesture Recognition Market

Gesture Recognition

Gesture Recognition

Gesture Recognition Interface Device

Gesture Recognition

Gesture Recognition

Vision-Based Gesture Recognition

Real Time Gesture Learning and Recognition: Towards Automatic Categorization

Gesture Recognition Interface Device

Gesture Recognition / Sign Language

Gesture Recognition System

Gesture Recognition

Ink and Gesture recognition techniques

Gesture recognition

Overview on Gesture Recognition

Inertial Gesture Recognition

Gesture Recognition Market

Gesture Recognition market

One Shot Keto | One Shot Keto Reviews

Global Gesture Recognition Market

Automotive-Gesture-Recognition-Systems