Real-Time Video for Computer Entertainment
330 likes | 333 Vues
Real-Time Video for Computer Entertainment
Real-Time Video for Computer Entertainment
E N D
Presentation Transcript
Real-Time Video for Computer Entertainment Richard Marks Sony Computer Entertainment America Research & Development Division
Outline • • Background Background • • Video Input for PS2 Video Input for PS2 • • Natural Interfaces Natural Interfaces • • Conclusions Conclusions • • Future Work Future Work
Sony Computer Entertainment • • Subsidiary of Sony Subsidiary of Sony • • Responsible for PlayStation Responsible for PlayStation- -related products products • • SCEI (Tokyo) SCEI (Tokyo) • • SCEE (London) SCEE (London) • • SCEA (Foster City, CA SCEA (Foster City, CA-- --near related near SanFran SanFran) )
Sony Computer Entertainment America (SCEA) • • R & D: 15 people in Foster City, CA R & D: 15 people in Foster City, CA • • Mission: Catalyze new ideas for Mission: Catalyze new ideas for computer entertainment computer entertainment • • Focus: Software for PlayStation2 Focus: Software for PlayStation2 • Advanced rendering • Intelligent characters • Physical simulation • Digital interfaces
My Background • • MIT, B.S. 1990 in Avionics MIT, B.S. 1990 in Avionics • • Stanford, Stanford, Ph Ph.D. 1995 in underwater .D. 1995 in underwater robotics robotics • • Teleos Teleos Research, 10 person computer Research, 10 person computer vision startup vision startup • • Autodesk Autodesk, makers of AutoCAD, worked , makers of AutoCAD, worked on photo on photo- -to to- -model research model research • • Digital Video Art, graphics consulting Digital Video Art, graphics consulting
My Research Goals • • Explore new capabilities generated by Explore new capabilities generated by real real- -time video input to PS2 time video input to PS2 • Investigate alternative user-input mechanisms for PS2 (besides joystick) • Understand limitations of low-cost video input • Create new video-based entertainment models
Real-time Video Research • • Natural Interfaces Natural Interfaces • SIGGRAPH 2000 sketch, emerging technology • Game Developer Conference 2001 • • Enhanced Reality Enhanced Reality • ACM1 • SIGGRAPH 2001
PlayStation2, not PC • • Platform is constant Platform is constant • iLINK (IEEE1394) and 2 USB ports • Known compute capability (much) • • Very diverse audience Very diverse audience • • Unique architecture Unique architecture • Highly parallel • Micro-programmable • Data-centric
PlayStation2 EE GS Emotion Engine Graphics Synthesizer RAM (32 MB) MIPS-based core 2 FP Vector Units Image Proc. Unit VIDEO OUT 4 MB on-die VRAM RAM (2 MB) SPU2 IOP AUDIO OUT 2 USB I/O Processor Sound Processor 1 iLINK (IEEE 1394) HD/ETHERNET ROM DVD/CD
Current Setup • • USB USB webcam webcam (<$50) • 30 Hz YUV420 video • 320x240 compressed,160x120 uncompressed (<$50) • • Video processing performed by core Video processing performed by core • Decompression (bit-stream decode, IDCT) • Low-level image filters (smooth, threshold, etc.) • Segmentation, matching, tracking • • Demo Demo
Natural Interfaces • • Intuitive Intuitive • • Simple Simple • • Enabling Enabling • • Enjoyable Enjoyable ⇒ ⇒ ⇒ ⇒ Video Video- -based interfaces, with and based interfaces, with and without props without props
Specification • • Fun Fun • • Intuitive Intuitive • • Enabling Enabling • • Real Real- -time time • 30 frames/second • Less than 3 frames total latency • • Robust Robust • Graceful failure/error recovery
Medieval Chamber • • Multiple color Multiple color- -based tracking approaches based tracking approaches • Richard Marks • • Advanced rendering including shadows, Advanced rendering including shadows, transparency, reflections, etc. transparency, reflections, etc. • Gabor Nagy • • Physical simulation/collisions Physical simulation/collisions • Eric Larsen
Medieval Chamber • • Known camera, objects Known camera, objects • Spheres and cylinders have special projection properties • • Tracking steps Tracking steps • Color segmentation • Centroid, moment calculation • Windowed centroid, moment calculation • Color-transition detection • Situational probabilistic ambiguity resolution • Kalman filtering
Medieval Chamber • Sphere – x,y from centroid, – z from principal moment – Rx, Ry from dot centroid (given x,y) • Cylinder – x,y from centroid – Rz from angle of principal moment » Marker used to resolve ambiguity – z from secondary moment – Body Rx from principal moment (given z) » Foreshortening used to resolve ambiguity – Body Ry from helical stripe Y X Y X
Medieval Chamber • • Combination of sphere and cylinder Combination of sphere and cylinder provides most robust tracking provides most robust tracking
Marionette • • Alternative form of character control Alternative form of character control • • Traditional marionette Traditional marionette • Darwin the Wizard, created by Daniel Oates • • Virtual marionette Virtual marionette • 3D model by Care Michaud
Marionette • • Color segmentation Color segmentation • • Line fitting to find T shape Line fitting to find T shape • • T shape analysis to recover puppet T shape analysis to recover puppet parameters parameters
Planet Explorer • • 3D viewing, navigation 3D viewing, navigation • • Earth rendering Earth rendering • Greg Corson • • Rotating the ball rotates the earth Rotating the ball rotates the earth • • Proximity of ball to camera adjusts zoom Proximity of ball to camera adjusts zoom
Planet Explorer • • Color segmentation/ Color segmentation/centroid • • Principal moment to adjust zoom Principal moment to adjust zoom • • Motion Motion- -estimation to measure rotations estimation to measure rotations centroid to find ball to find ball
Fly • • Flight simulation Flight simulation • • Procedural landscape Procedural landscape • Tyler Daniel • • Relative arm angles determine bank angle Relative arm angles determine bank angle • • Average arm angles determine attack angle Average arm angles determine attack angle • • Arm motion increases airspeed Arm motion increases airspeed • • Shadow wings mimic arms Shadow wings mimic arms
Fly • • Centroid Centroid/ moment determines body extent / moment determines body extent • • Principal axis angles of outer regions Principal axis angles of outer regions correspond to arm angles correspond to arm angles
Conclusions • • Known props can provide more Known props can provide more information and still be natural information and still be natural • • Area Area- -based measurement more robust based measurement more robust and precise than lineal measures and precise than lineal measures • • Precision more important than accuracy Precision more important than accuracy • • Color sensitive to lighting, but color Color sensitive to lighting, but color transitions are not transitions are not
Conclusions (contd.) • • Make signal proportional to action Make signal proportional to action • • Secondary motion can enhance Secondary motion can enhance perceived response (and hide error) perceived response (and hide error) • • Display perspective important Display perspective important
Future Work • • Attach interfaces to Attach interfaces to real • • Investigate interfaces for creative Investigate interfaces for creative content generation content generation • • Enhanced Reality research Enhanced Reality research real games games
Enhanced Reality: What is it? • • Starting with live audio/video, enhance it Starting with live audio/video, enhance it by adding/modifying the content by adding/modifying the content • • Not virtual reality! Not virtual reality! • • Similar to Augmented Reality (a la MIT Similar to Augmented Reality (a la MIT Media Lab), but with entertainment focus Media Lab), but with entertainment focus • • Consists of both enhancements to user Consists of both enhancements to user and enhancements to environment and enhancements to environment
Color Transitions • • Project ( Project (Cr pixel onto a line pixel onto a line • • Similar to barcodes, Similar to barcodes, but selectable but selectable • • Maximal separation Maximal separation produces best results produces best results • • Robust to lighting Robust to lighting variation variation • • Patents pending Patents pending Cr, , Cb Cb) for each ) for each Cb Green Cr Magenta
Hand Puppet • • Very simple form of character control Very simple form of character control • • 3D models by Guy 3D models by Guy Burdick Burdick
Hand Puppet • • Color segmentation to locate hands Color segmentation to locate hands • • Split screen for left/right hands Split screen for left/right hands • • Centroid Centroid, moments and principal moment , moments and principal moment orientation orientation • • Split along principal moment, principal Split along principal moment, principal moment orientations provide mouth angle moment orientations provide mouth angle