Syllabus
-
EventDateTopicContents
-
Lecture02/21/2024 15:10
WednesdayCourse IntroductionLogistic, Introduction, History
-
Lecture02/28/2024 15:10
WednesdayClassic Vision IImage processing: image gradient, filter, convolution.
Classic edge/corner/line detection methods: Canny edge detection, Harris corner detection, line fitting.
-
Lecture03/06/2024 15:10
WednesdayClassic Vision II and Deep Learning IEdge/line detection: Canny edge detection, line fitting(RANSAC, Hough transform).
Corner detection and feature descriptors: Harris corner detection, feature.
Outline of deep leanring.
-
Lecture03/13/2024 15:10
WednesdayDeep Learning IIOutline of deep leanring.
Network structure: Multi-Layer Perceptron (MLP) and Convolutional Neural Network (CNN).
-
Assignment03/15/2024
FridayAssignment #1 released -
Lecture03/20/2024 15:10
WednesdayDeep Learning IIICNN Training: weight initialization, loss, optimizer (gradient descent, SGD, Adam), learning rate.
-
Lecture03/27/2024 15:10
WednesdayDeep Learning IVUnderfitting and overfitting: batch normalization, skip link, augmentation, regularization
Classification task: KNN, SoftMax, cross entropy loss
-
Due03/30/2024 23:59
SaturdayAssignment #1 due -
Lecture04/03/2024 15:10
WednesdayDeep Learning IVClassification task: receptive field, architecture(VGG, ResNet…)
Segmentation task: K-Means, upsampling, architecture (FCN, UNet)
-
Assignment04/04/2024
ThursdayAssignment #2 released -
Lecture04/10/2024 15:10
Wednesday3D Vision ICamera model: pinhole camera, lense, intrinsic, extrinsic, calibration
Depth image: sensor, stereo
-
Lecture04/17/2024 15:10
Wednesday3D Vision II (3D Deep Learning)3D data (voxel, mesh); 3D Deep Learning: Point, Mesh; 3D Deep Learning: Sparse Voxel Conv
-
Due04/20/2024 23:59
SaturdayAssignment #2 due -
Exam04/24/2024 15:10
WednesdayMidterm Exam -
Lecture04/24/2024 17:10
WednesdayTemporal Analysis IMotion and optical flow
-
Assignment04/26/2024
FridayAssignment #3 released -
Peking University Anniversary05/01/2024
WednesdayOne Week Break (Labor Day Holiday) -
Lecture05/08/2024 15:10
WednesdayTemporal Analysis IIRNN, LSTM, GRU; Video Analysis: 3D CNN
-
Due05/12/2024 23:59
SundayAssignment #3 due -
Lecture05/15/2024 15:10
WednesdayGuest Lecture by Dr. Zhizheng ZhangAttention mechanism, transformer, visual transformer, vision language pretraining, large language model, vision language model.
-
Lecture05/22/2024 15:10
WednesdayObject Detection and Instance Segmentation2D Object detector (SSD, RCNN series, YOLO); Instance Segmentation, Panoptic Segmentation; 3D object detection and instance segmentation
-
Assignment05/24/2024
FridayAssignment #4 released -
Lecture05/29/2024 15:10
WednesdayGenerative ModelVAE, GAN, Diffusion, text-to-image generation.
-
Lecture06/05/2024 15:10
WednesdayEmbodied AIObject grasping, object manipulation, locomotion and navigation, embodied vision-language models.
-
Due06/08/2024 23:59
SaturdayAssignment #4 due -
Exam06/19/2024 13:59
WednesdayFinalterm Exam