Computer vision
Topic outline
-
Welcome to the Computer Vision class!
Computer Vision is the branch of Computer Science whose goal is to model the real world or to recognize objects from digital images. These images can be acquired using still and video cameras, infrared cameras, radars, or specialized sensors such as those used in the medical field.
The students will be introduced to the basic techniques of the field of Computer Vision. They will learn to apply Image Processing techniques where appropriate.
We will concentrate on the black and white and color images acquired using standard video cameras. We will introduce basic processing techniques, such as edge detection, segmentation, texture characterization, and shape recognition.Instructor
Prof. Pascal Fua
Computer Vision Laboratory (CVLAB)
BC 310
E-mail: pascal.fua@epfl.chCourse Times and Locations
Lectures: Monday 13:15 - 15:00 CM13
Exercises: Tuesday 10:15 - 12:00 every other week. INM 200 (A-M), INM 202 (N-Z)
Please check the course schedule and bring your own laptops for the exercise sessions.
Questions
If you have any questions please post them in the discussion forum and we will answer you.
Contact TAs
If you have any questions please email one of the TAs and we can arrange a meeting.
Chen Zhao (chen.zhao@epfl.ch)
Corentin Dumery (corentin.dumery@epfl.ch)
Aoxiang Fan (aoxiang.fan@epfl.ch)Deniz Mercadier (deniz.mercadier@epfl.ch)
Graded Exercise Sessions
We will grade two of the exercise sessions. They will count for 10% of you final grade each. There will be around two hours for you to implement some algorithms. You must join the graded exercise sessions in person, otherwise you will lose the points.
Recorded Lectures
The lectures will be recorded and deposited on this channel.Final exam
It will be a 90min closed book exam with multiple-choice and open-ended questions. You will be allowed ONE hand-written A4 page of notes. It will count for 80% of your final grade. -
19-02-2024 Course 26-02-2024 Course 27-02-2024 Exercise Session 1 04-03-2024 Course 11-03-2024 Course 12-03-2024 Exercise Session 2 18-03-2024 Course 25-03-2024 Course 26-03-2024 Exercise Session 3 GRADED 08-04-2024 Course 15-04-2024 Course 16-04-2024 Exercise Session 4 22-04-2024 Course 29-04-2024 Course 30-04-2024 Exercise Session 5 06-05-2024 Course 13-05-2024 Course 14-05-2024 Exercise Session 6 GRADED 27-05-2024 Course 28-05-2024 Exercise Session 7 -
R. Szeliki, Computer Vision: Computer Vision: Algorithms and Applications, 2021.
R. Hartley and A. Zisserman, Multiple View Geometry in Computer Vision, Cambridge University Press, 2003.
-
Edge definition, edge operators, Canny edge detector, and machine-learning based detectors.
-
Going from edge elements to complete outlines.
-
Partitioning images into separate regions of interest.
-
Texture: What is it and how can it be characterized and analyzed.
-
Recovering 3D shape from one single image.
-
Recovering Depth from Multiple Images
-
Recovering 3D shape from edges and occluding contours
-
Introduction to Python for Computer Vision
-
Convolutions, image filters, gradients
-
General Hough Transform
-
K-Means Clustering for Image Segmentation, Image Sharpening
-
-
-
Exercise Session 7 Answers File