1 / 7

Foundations & Core in Computer Vision: A System Perspective

Foundations & Core in Computer Vision: A System Perspective. Ce Liu Microsoft Research New England. Vision vs. Learning. Computer vision: visual application of machine learning? Data  features  algorithms  data ML : design algorithms given input and output data

gafna
Télécharger la présentation

Foundations & Core in Computer Vision: A System Perspective

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Foundations & Core in Computer Vision: A System Perspective Ce Liu Microsoft Research New England

  2. Vision vs. Learning • Computer vision: visual application of machine learning? • Data  features  algorithms  data • ML: design algorithms given input and output data • CV: find the best input and output data given available algorithms

  3. Theoretical vs. Experimental • Theoretical analysis of a visual system • Best & worst cases • Average performance • Theoretical analysis is challenging as many visual distributions are hard to model (signal processing: 2nd order processes, machine learning: exponential families) • Experimental approach: full spectrum of system performance as a function of the amount of data, annotation, number of categories, noise, and other conditions

  4. Quality vs. Speed • HD videos, billions of images to index • Real time & 90% vs. one hour per frame & 95%? • Mechanism to balance quality and speed in modeling

  5. Automatic vs. semi-automatic • Common review feedback: parameters are hand-tuned; not clear how to set the parameters • Vision system user feedback: I don’t know how to tweak parameters! • Computer-oriented vs. human-oriented representations • Human-in-the-loop (collaborative) vision • How to optimally use humans (what, which and how accurate) beyond traditional active learning • Model design by crowd-sourcing • Learning by subtraction

  6. Algorithms vs. Sensors • Two approaches to solving a vision problem • Look at images, design algorithms, experiment, improve… • Look at cameras, design new/better sensors, … • Cameras for full-spectrum, high res, low noise, depth, motion, occluding boundary, object, … • What’s the optimal sensor/device for solving a vision problem? • What’s the limit of sensors?

  7. Thank you! Ce Liu Microsoft Research New England

More Related