210 likes | 228 Vues
Explore system theoretic approach for lip articulation synthesis & classification. Includes features, trajectories, speech recognition, and dynamic systems. Experimental results & future work discussed.
 
                
                E N D
A System Theoretic Approach to Synthesis and Classification of Lip Articulation H.E.Çetingül, R.A. Chaudhry, R. Vidal Center for Imaging Science, Johns Hopkins University International Workshop on Dynamical Vision at ICCV 2007
Previous Work & Contributions 1T.Chen, “Audiovisual Speech Processing,” IEEE SPM 2001. 2H.E.Cetingul, E.Erzin, Y.Yemez, A.M.Tekalp, “Discriminative analysis of lip motion features for speaker identification and speech-reading,” IEEE TIP 15(10), 2006.
System Overview Lip Sequences Feature Extraction Lip Trajectories System Parameters of Lip Dynamics Lip Database Speaker/Speech Recognition Lip Movement Synthesis
Representation of Lip Articulation (1/2) 1N.Eveno, A.Caplier, P.-Y.Coulon, “Accurate and quasi-automatic lip tracking,” IEEE TCSVT 14(5), 2004.
ARMA System Identification (SID) 1P.V.Overschee, B.D.Moor, “N4SID:…,” Automatica, 1994. 2G.Doretto, A.Chiuso, Y.Wu, S.Soatto, “Dynamic textures,” IJCV 51(2), 2003.
Distances and Nearest Neighbor (NN) 1K.D.Cock, B.D.Moor, “Subspace angles and distances between ARMA models,” System and Control Letters 46(4), 2002.
Kernels and Support Vector Machines (1/2) 1R.J.Martin, “A metric for ARMA processes,” IEEE TSP 48(4), 2000. 2A.B.Chan, N.Vasconcelos, “Probabilistic kernels for the classification of autoregressive visual processes,” IEEE CVPR 2005.
Kernels and Support Vector Machines (2/2) 1S.Vishwanathan, A.Smola, R.Vidal, “Binet-Cauchy kernels on dynamical systems and its applications to the analysis of dynamic scenes,” IJCV 73(1), 2006.
Synthesis Results video01 video02 video03