3D Fingertip and Palm Tracking in Depth Image Sequences
300 likes | 466 Vues
3D Fingertip and Palm Tracking in Depth Image Sequences. Hui Liang , Junsong Yuan and Daniel Thalmann. Proceedings of the 20th ACM international conference on Multimedia 2012. Outline. Introduction Related Work Proposed Method Experimental Results Conclusion. Introduction.
3D Fingertip and Palm Tracking in Depth Image Sequences
E N D
Presentation Transcript
3D Fingertip and Palm Tracking in Depth Image Sequences Hui Liang, JunsongYuanand Daniel Thalmann Proceedings of the 20th ACM international conference on Multimedia2012
Outline • Introduction • Related Work • Proposed Method • Experimental Results • Conclusion
Introduction • Human hand is an essential body part for human-computer interaction. • The positions of tracked fingertips: hand pose estimation • Difficulty in fingertip tracking: Side-by-side Bending Nearby
Introduction • Many previous methods: • Only focus on extracting 2Dfingertips • Cannot track fingertips robustly for a freely moving hand • In this paper: • Present a robust fingertip and palm tracking scheme • With the input of depth images (KINECT) • Track the 3D fingertip positions quite accurately
Related work • Focus only on 2D fingertips:[4][5][6][9] • Based on contour analysis of the extracted hand region:[2][4][5][6] • Usually can track the fingertips for only stretched fingers.
Related work • In [6], • Fingertips are tracked for infrared image sequences. • It utilizes a template matching strategy • Fingertip tracking : Kalman filter • In [2], • Stereoscopic vision is adopted • maximize the distance center of gravity of the hand & the boundary
Related work • In [9] (Kinect), Minimum depth Depth < Threshold Circular filter
Related work • [2] S. Consei1, S. Bourennane, and L. Martin. Three dimensional fingertip tracking in stereovision, 2005. Proc. of the 7th Int’l Conf. on Advanced Concepts for Intelligent Vision Systems. • [4] K. Hsiao, T. Chen, and S. Chien. Fast fingertip positioning by combining particle filtering with particle random diffusion, 2008. Proc. IEEE Int’l Conf. on Multimedia and Expo. • [5] I. Katz, K. Gabayan, and H. Aghajan. A multi-touch surface using multiple cameras, 2007. Proc. of the 9th Int’l Conf. on Advanced concepts for intelligent vision systems. • [6] K. Oka, Y. Sato, and H. Koike. Real-time tracking of multiple fingertips and gesture recognition for augmented desk interface systems, 2002. Proc. IEEE Int’l Conf. on Automatic Face and Gesture Recognition. • [9] J. L. Raheja, A. Chaudhary, and K. Singal. Tracking of fingertips and centres of palm using kinect, 2011. Proc. Of the 3rd Int’l Conf. on Computational Intelligence, Modelling and Simulation.
Hand and Palm Detection • 1) Assume the hand is the nearest object • 2) Constrain global hand rotation by: • : global rotation angle of the hand
Hand and Palm Detection • Foreground Segmentation • Palm • Localization • Threshold the depth frame to obtain the foregroundF: • p : a pixel coordinate • z(p) : depth value (of point p ) • z0 : the minimum depth value • zD : depth threshold • Hand • Segmentation 0.2m foreground F
Hand and Palm Detection • Foreground Segmentation • Palm • Localization • The palm region is approximated with a circle: • pp: the palm center (of point p ) • rp : the radius • Assume that hand palm forms a globally largest blob • Cpequals to the largest inscribed circle of the contour of F . • 2D Kalman filter : reduce computational complexity • Hand • Segmentation
Hand and Palm Detection • Foreground Segmentation • Palm • Localization • Separate hand and forearmbyaline: • 1) Tangent to Cp • 2) Perpendicular to the orientation of the forearm • Orientation of the forearm : • The Eigenvector that corresponds to the largest Eigenvalue of the covariance matrix of the contour pixel coordinates of F • Hand • Segmentation Hand region : FV(2D)→ FD(3D)
Finger Detection and Tracking • Constraints on possible fingertip locations: • 1) Only in depth discontinuous region ( in contour Fv) • 2) | Depth(one point) – Depth(neighborhoods) | are important. • 3) Utilize the 3D geodesic shortest path (GSP) Fingertip vs. Non-fingertip Nearby Fingertips
Fingertip Detection • Fingertip detection • Fingertip tracking • Goal: detect all five fingertips in the depth image • Based on three depth-based features • Build a graph G : • Vh : contains of all points within FV (hand contour) • Eh : for each pair of vertices(p,q), 1) they are in the 8-neighborhood of each other 2) their 3D distance is within threshold τ Edge weight : 3D Euclidean distance
Fingertip Detection • Fingertip detection • Fingertip tracking • Calculate Geodesic distance dg(p): • From palm center ppfor each vertex Vh • Dijkstra graph search on Gh • GSP point set Ug(p): • The set of vertices on the shortest path from pp to p • Rectangle local feature RL(p): • Describe the neighborhood of a point p in FV • : ratio of 1s 1cm
Fingertip Detection • Fingertip detection • Fingertip tracking • Calculate Geodesic distance dg(p): 0.4 dg(p)
Fingertip Detection • Fingertip detection • Fingertip tracking • Fingertip labeling: Nmax=6 j : label :estimate the probability that has the label lj number of GSP points frame number i : fingertip
Fingertip Detection • Fingertip detection • Fingertip tracking • Fingertip labeling:
Fingertip Tracking • Fingertip detection • Fingertip tracking • Build a particle filter for each fingertip • (x, ω) denote a particle • x : 2D position in FV • ω : the particle weight • denote a particle • Constrain the positions of each particle to the border point set UB to reduce the search space
Fingertip Tracking • Fingertip detection • Fingertip tracking • Likelihood function : Geodesic distance / GSP points Neighbor depth /
Fingertip Tracking • Fingertip detection • Fingertip tracking • Likelihood function :
Experimental Results • Quantitative results on synthetic sequences:
Experimental Results • Virtual object grasping:
Conclusion • Using multiple depth-based features for accurate fingertip localization • Adopting a particle filter to track the fingertips over successive frames • Track the 3D positions of fingertips robustly • Great potential for extension to other HCI applications