Offerta formativa | Università degli Studi di Firenze

Course year

Second year - First Semester

Belonging Department

Information Engineering (DINFO)

Course Type

Single education field course

Scientific Area

ING-INF/05 - INFORMATION PROCESSING SYSTEMS

Credits

9

Teaching Hours

72

Teaching Term

19/09/2016 ⇒ 23/12/2016

Attendance required

No

Type of Evaluation

Final Grade

Course Content

show

Course program

show

Lectureship

Teaching Language

Italian

Course Content

Computer Vision and Machine Learning for automatic recognition

This course is an up-to-date, deep immersion into details of the most important solutions for visual recognition, with updates in the very recent achievements of deep learning architectures.
During the course, students will gain a detailed understanding of cutting-edge research in computer vision and have the opportunity of hand-on sessions in the lab.

Learning Objectives

1. Learn the state of the art of computer vision research for automatic recognition

2. Learn how to implement such techniques and apply to real problems.

Prerequisites

Fundamentals of image and video analysis

Teaching Methods

Classroom and labs

Further information

Students are tutored during the course and exercises

Type of Assessment

Labs and Homework

Course program

VISUAL AND MULTIMEDIA RECOGNITION 2016-17

Week 1-2 Section 1. Section 2.
• Introduction to visual and multimedia recognition
• Global image features : Color; Texture; Edges and Lines
• Dimensionality reduction: PCA
• MPEG7 holistic descriptors

Week 3-4 Section 4. Local image features
• Rotation invariant Harris corner detector
• Scale invariant keypoint detectors:
- Harris-Laplacian,
- SIFT SURF Features
• Affine invariant region detectors:
- MSER Maximally Stable Extremal Regions
• Local descriptors:
- SIFT, Color SIFT,
- SURF

Week 5-6 Section 5 Visual words and bag of Words representation
• Visual Words and Bag of Words model:
- vocabulary formation by K-means
- Radius-based clustering
• Evolution of BoW model by Coding/Reconstruction- based approaches:
- Sparse Coding
- Local Linear Coding
- Soft Assignment 
- Fisher Vectors and VLAD

Week 7 Section 6. Object instance recognition
• Distance measures
• Nearest Neighbour Matching
• Geometric alignment and outliers rejection: Random Sample Consensus

Week 8 - 9 Section 7. Object detection and categorization
• Bayes classification, Expectation maximization (Recall of statistical principles)
• Support Vector Machines classifier
• Boosting classifier, Adaboost
• Probabilistic Latent Semantic Analysis classifier
• HOG Histogram of Oriented Gradients people detector
• Viola and Jones face detector
• Partial matching of sets of features

Week 9 Laboratory1 Bag of Visual Words

Week 10 Section 8. Deep Learning
• Recall of Multi layer Networks
• Convolutional Neural Networks
• CNN for Visual Recognition

Week 11-12 Laboratory2 Convolutional Neural Networks

Week 13 Section 9. With image sequences
• Spatio-temporal features and Detectors:
- Dollar’s spatio-temporal detector;
- Dense trajectories improved.
-Advanced solutions with deep nets
• Descriptors for Spatio-temporal features:
- HoG3D (Histogram of 3D Gradients),
- HOF (Histogram of Optical Flow),
- MBH (Motion Boundary Histogram),
- Dense Trajectory Descriptors

• Action and Event recognition
• Principles of Tracking

Week 14 Section 10. Matching at large scale
• Vocabulary Tree
• Multidimensional hashing:
- Local Sensitive Hashing
- Pyramid Match Hashing,
- Semantic Hashing

Week 15 Section 11. Exploiting human and social knowledge
• Imagenet
• Exploiting data from Social Networks

B024319 - VISUAL AND MULTIMEDIA RECOGNITION

Academic Year 2016-17

Teaching Language

Course Content

Suggested readings (Search our library's catalogue)

Learning Objectives

Prerequisites

Teaching Methods

Further information

Type of Assessment

Course program