170 likes | 295 Vues
This document delves into the evolution and mechanisms of vision-based user interfaces (UIs), focusing on the pioneering work of Myron Krueger in the 1970s with Responsive Environments. Discover the foundational concepts such as background subtraction, image moments, and how these techniques facilitate detecting and tracking various elements in a scene. Tools like Intel's OpenCV and the Papier-Mâché Java API are discussed for their relevance in developing applications in a geek-driven culture.
E N D
Vision-based Interaction Scott Klemmer 17 November 2005
cs547: Blake Ross and Asa Dotzler Mozilla: Creating simple software in a geek-driven culture
The first vision-based interface • Myron Krueger used computer vision to create Responsive Environments (1970s) • “Reaction is the Medium” • http://www.artmuseum.net/w2vr/timeline/videoplace_video.html
How it works • Video and background are separated in analog using chroma key techniques(think broadcast news) • The first and last points of each raster are stored in the computer, and represent the person’s outline
Vision-based UIs: “Verbs” • Detecting and Tracking elements of a certain type in a scene • Capturing contents of detected objects • Recognizing individual members in an object class
Vision-based UIs: “Verbs” • Detecting and Tracking elements of a certain type in a scene
Vision-based UIs: “Verbs” • Capturing contents of detected objects
Vision-based UIs: “Verbs” • Recognizing individual members in a class
Vision-based UIs: “Nouns” • People (one or multiple) • Bodies • Faces • Hands • Documents • Objects
Vision-based UIs: “Nouns” • People (one or multiple) • Bodies • Faces • Hands • Documents • Objects
Vision-based UIs: “Nouns” • People (one or multiple) • Bodies • Faces • Hands • Documents • Objects
I N F R A S T R U C T U R E Background Subtraction
Image Moments (of Inertia) • 0th moment is mass(total number of pixels)
Image Moments (of Inertia) • 1st moment is center
Image Moments (of Inertia) • 2nd moment is orientation
Tools for Vision apps • Intel’s OpenCV • C API to highly optimized image processing functions (threshold, dilate, optical flow, …) • http://www.intel.com/research/mrl/research/opencv • Fast to run! Slow to develop • Great for vision folks; too low-level for app folks • Papier-Mâché • Java API (and to some extent visual UI) for vision (and other physical input) • http://guir.berkeley.edu/papier-mache • Fast to develop! Slow to run • Great for app folks; ~5 fps can sometimes be too slow
Good Vision Books • Computer Vision: A Modern Approach • David Forsyth and Jean Ponce (2003) • Fantastic book; but goal is more theoretical understanding than practical application • Robot Vision • Berthold Horn (1987) • More focused on apps and interactive algorithms • Somewhat out of date