1 / 48

CMSC 426: Image Processing (Computer Vision)

CMSC 426: Image Processing (Computer Vision) . David Jacobs. Vision. ``to know what is where, by looking.’’ (Marr). Where What. Why is Vision Interesting?. Psychology ~ 50% of cerebral cortex is for vision. Vision is how we experience the world. Engineering

bernad
Télécharger la présentation

CMSC 426: Image Processing (Computer Vision)

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. CMSC 426: Image Processing (Computer Vision) David Jacobs

  2. Vision • ``to know what is where, by looking.’’ (Marr). • Where • What

  3. Why is Vision Interesting? • Psychology • ~ 50% of cerebral cortex is for vision. • Vision is how we experience the world. • Engineering • Want machines to interact with world. • Digital images are everywhere.

  4. Vision is inferential: Light (http://www-bcs.mit.edu/people/adelson/checkershadow_illusion.html)

  5. Vision is Inferential

  6. Vision is Inferential: Geometry movie

  7. Vision is Inferential: Prior Knowledge

  8. Vision is Inferential: Prior Knowledge

  9. Computer Vision • Inference  Computation • Building machines that see • Modeling biological perception

  10. A Quick Tour of Computer Vision

  11. Boundary Detection: Local cues

  12. Boundary Detection: Local cues

  13. Boundary Detection http://www.robots.ox.ac.uk/~vdg/dynamics.html

  14. (Sharon, Balun, Brandt, Basri)

  15. Boundary Detection Finding the Corpus Callosum (G. Hamarneh, T. McInerney, D. Terzopoulos)

  16. Texture Photo Pattern Repeated

  17. Texture Photo Computer Generated

  18. Tracking (Comaniciu and Meer)

  19. Tracking (www.brickstream.com)

  20. Tracking

  21. Tracking

  22. Tracking

  23. Tracking

  24. Stereo http://www.ai.mit.edu/courses/6.801/lect/lect01_darrell.pdf

  25. Stereo http://www.magiceye.com/

  26. Stereo http://www.magiceye.com/

  27. Motion http://www.ai.mit.edu/courses/6.801/lect/lect01_darrell.pdf

  28. Motion - Application (www.realviz.com)

  29. Pose Determination Visually guided surgery

  30. Recognition - Shading Lighting affects appearance

  31. Classification (Funkhauser, Min, Kazhdan, Chen, Halderman, Dobkin, Jacobs)

  32. Vision depends on: • Geometry • Physics • The nature of objects in the world (This is the hardest part).

  33. Approaches to Vision

  34. Modeling + Algorithms • Build a simple model of the world (eg., flat, uniform intensity). • Find provably good algorithms. • Experiment on real world. • Update model. Problem: Too often models are simplistic or intractable.

  35. Bayesian inference • Bayes law: P(A|B) = P(B|A)*P(A)/P(B). • P(world|image) = P(image|world)*P(world)/P(image) • P(image|world) is computer graphics • Geometry of projection. • Physics of light and reflection. • P(world) means modeling objects in world. Leads to statistical/learning approaches. Problem: Too often probabilities can’t be known and are invented.

  36. Engineering • Focus on definite tasks with clear requirements. • Try ideas based on theory and get experience about what works. • Try to build reusable modules. Problem: Solutions that work under specific conditions may not generalize.

  37. The State of Computer Vision • Science • Study of intelligence seems to be hard. • Some interesting fundamental theory about specific problems. • Limited insight into how these interact.

  38. The State of Computer Vision • Technology • Interesting applications: inspection, graphics, security, internet…. • Some successful companies. Largest ~100-200 million in revenues. Many in-house applications. • Future: growth in digital images exciting.

  39. Related Fields • Graphics. “Vision is inverse graphics”. • Visual perception. • Neuroscience. • AI • Learning • Math: eg., geometry, stochastic processes. • Optimization.

  40. Contact Info • Prof: David Jacobs • Office: Room 4421, A.V. Williams Building (Next to CSIC). • Phone: (301) 405-0679 Email: djacobs@cs.umd.edu Homepage: http://www.cs.umd.edu/~djacobs • TA: Hyoungjune Yi • Email: aster@umiacs.umd.edu

  41. Tools Needed for Course • Math • Calculus • Linear Algebra (can be picked up). • Computer Science • Algorithms • Programming, we’ll use Matlab. • Signal Processing (we’ll teach a little).

  42. Rough Syllabus

  43. Course Organization Reading assignments in Forsyth & Ponce, plus some extras. ~6-8 Problem sets - Programming and paper and pencil Two quizzes, Final Exam. Grading: Problem sets 30%, quizzes: first quiz 10%; second quiz 20%; final 40%. Web page: www.cs.umd.edu/~djacobs/CMSC426/CMSC426.htm

More Related