5LSH0 (2016) - Advanced Video Content Analysis and Video Compression

Goal:
Learn the theory and algorithms of video signal analysis, and finding features and objects in image/video. Learn the basics of 3D image processing, sensing in 3D and 3D model reconstruction. Learn the practical basics of MATLAB programming and analysis and/or 3D algorithms for practical applications in surveillance and medical imaging.

Contents:
Content of the course is divided globally into three areas.

  1. New transformation techniques (wavelets and projective) for analysis in medical domain and 3D modeling in surveillance. The application of wavelets is given using the JPEG-2000 standard.
  2. 3D processing based on the camera pinhole model, homography and multi-view processing and calibration. Also registration of 3D datasets, 3D reconstruction models with TSDF, introduction to SLAM, RGB-Depth processing and specific algorithms like G2.0 and bundle adjustment. Finally, the 3D processing modules end with plane/object segmentation in 3D.
  3. Techniques for object detection and recognition, feature extraction and analysis, like SIFT and Laplacian methods. Furthermore, semantic level processing for understanding events and scenes, including human behavior. Furthermore, classification techniques for understanding objects and events. Modern classification like K-means and SVM (support vector machine) algorithms, evolving into basics of learning with neural nets. This part will gradually evolve to deep learning fundamentals.

The computer assignments aim at applying the knowledge and algorithms (or parts of them) to provide the student a framework for experiments with video content understanding and 3D image-babsed modeling for both surveillance and medical applications.

Preknowledge:
5LSE0 - Multimedia Video Coding and Architectures (recommended)
5XSA0 - Introduction Medical Imaging Processing (recommended)

Schedule and location: 
All lectures in a single week at the end of August, about 40% is dedicated to computer exercises.
Full schedule: 

All lectures in Flux 1.06. Schedule:

  • 22 aug 09.30-12.30 : Wavelet transformation theory and JPEG2000
  • 22 aug 13.30-17.30 : 3D Pinhole camera, projective transformation
  • 23 aug 09.30-12.30 : 3D Registration, data fusion and SLAM reconstruction
  • 23 aug 13.30-17.30 : Visual feature extraction
  • 24 aug 09.30-12.30 : Motion analysis and estimation
  • 24 aug 13.30-17.30 : Object-level content analysis - segmentation
  • 25 aug 09.30-12.30 : Object-level content analysis - tracking
  • 25 aug 13.30-17.30 : Semantic-level content analysis and classification 1
  • 26 aug 09.30-12.30 : Semantic-level content analysis and classification 2
  • 26 aug 13.30-17.30 : Analysis applications (medical and surveillance)
Slides: 
Module 1A: Wavelets
Module 1B: JPEG2000
Module 3: 3D Registration, data fusion and SLAM reconstruction
Module 4: Visual feature extraction part 1
Module 5: Motion analysis and estimation
Module 6: Object-level content analysis: Segmentation
Module 7: Object-level content analysis: Tracking
Module 8: Semantic-level content analysis and classification I
Module 9: Semantic-level content analysis and classification II
Module 10A: Analysis applications: Sports
Module 10B: Analysis applications: Surveillance
Instruction sheets: 
Code of conduct