CS60050 : Machine Learning | Spring 2023, L-T-P: 3-0-0 |
Schedule
Instructor Aritra Hazra Timing (Slot: A3) Monday (08:00 – 10:00) and Tuesday (12:00 – 13:00) Venue NC443 (Nalanda Complex) Teaching Assistants Abhinav Bohra | Naincy Vimal | Rijoy Mukherjee | Somnath Hazra | Suryansh Kumar | Suvadeep Hajra
Notices and Announcements
- April 20, 2023
- The graded answers scripts for End-semester examination will be shown on Tuesday (02-May-2023) from 11:00 AM - 11:30 AM in CSE-120 (Ground Floor of CSE Dept.).
- March 10, 2023
- End-semester examination is scheduled on 18-Apr-2023 (AN) and will be conducted centrally by the institute. The syllabus is everything that is covered in the course.
- March 26, 2023
- Project (Phase-3) has been released and you have already received an email with the necessary details. Please note that the submission deadline is 15-Apr-2023 (STRICT)! The submissions will be accepted (only) via CSE-Moodle.
- February 26, 2023
- Project (Phase-2) has been released and you have already received an email with the necessary details. Please note that the submission deadline is 18-Mar-2023 (STRICT)! The submissions will be accepted (only) via CSE-Moodle.
- February 24, 2023
- The graded answers scripts for Mid-semester examination will be shown on Tuesday (28-Feb-2023) from 6:00 PM - 6:30 PM in CSE-120 (Ground Floor of CSE Dept.).
- February 07, 2023
- Mid-semester examination is scheduled on 15-Feb-2023 (AN) and will be conducted centrally by the institute. The syllabus is whatever covered till today's class (i.e. upto SVM).
- January 22, 2023
- Project (Phase-1) has been released and you have already received an email with the necessary details. Please note that the submission deadline is 11-Feb-2023 (STRICT)! The submissions will be accepted (only) via CSE-Moodle.
- January 01, 2023
- Hello everyone! Wish you all a very Happy New Year!
It is time for you all to understand and learn about how machines learn, which is the primary subject matter of this course. So, here we go ...
- The first class for this course will be held on 03-Jan-2023 (Tue, 12pm) at NC443 (Nalanda).
- You need to enrol into our course from CSE-Moodle too. If you are new to CSE-Moodle (this is not the Institute-Moodle!), please register there first. Then you can login/enrol. An email has been sent already detailing the same with the enrolment-key.
- To initiate our proceedings, the registered students are requested to go through the materials/slides posted under PROLOGUE part, to get an informal overview (metadata) of the course.
- December 31, 2022 – 08:00pm
- Shortlisting has been finalized based on all the requests available until now via ERP portal. No further requests / emails will be processed / addressed. Due to operational constraints, we are unable to consider all requests! (Only) the approved / registered students will receive an introductory mail soon (preferably by tomorrow) with all the required details.
- Selection Criteria Followed for Shortlisting Students:
- Research Scholar Students (MS and PhD) : Everyone
- CS Students (UG and PG) : CGPA >= 7.00 (Exception for UG-20 Batch : CGPA >= 8.25)
- Non-CS Students (UG and PG) : CGPA >= 9.00 (Exception for Dual-18 Batch : CGPA >= 8.00)
- Declined: All students who do not satisfy the above approval criteria.
- December 22, 2022
- Interested students are requested to apply via the ERP portal. Please keep in mind the following requirements.
- The timings for this course as mentioned above are to be treated as frozen and final. These timings are chosen so as to reduce clashes with other courses. However, it is impossible to find any slot that is free for everybody. So any further change may solve a few cases but will introduce new cases of clash. An applicant must not have any clash with these timings. If so, the students are supposed to make attempts to shift the other courses/labs clashing with these.
- I shall consider requests via ERP (only) from all students till 8:00pm, 31-Dec-2022 and finalize the approvals among those requested students based on their CGPA (only) immediately after 8:00pm (if possible). Considering the first class to be held on 03-Jan-2023, the declined students are requested to switch over to other courses before the subject registration deadline expires. We shall NOT consider any further request for approval beyond that. All approvals by me in ERP will wait until that time. I may not respond to individual e-mails about the application and the application-approval processes. Please note that, I cannot take all requested students due to seat limitations. However, this selection will primarily be based/prioritized on -- (i) preferring the final year students and research scholars, and (ii) the CGPA of the students.
- Pre-requisites for this course are: Probability and Statistics, Linear Algebra (basics), Algorithms, Programming Knowledge (in C/C++/Java/Python).
Coverage
Topic Details Duration Schedule References PROLOGUE (Metadata) Course Information and Logistics.
Overview of Machine Learning – The Landscape.– Information – Introduction The Learning Problem.
Feasibility of Learning.2-hours 03-Jan-2023
09-Jan-2023Mitchell [1] (Chapter-1)
Mostafa-Ismail-Lin [2] (Chapter-1)
Alpaydin [3] (Chapter-1+2)
Duda-Hart-Stork [7] (Chapter-1)Concept Learning General-to-Specific Hypotheses Ordering.
Find-S and Candidate Elimination Algorithm.
Version Space and Inductive Bias.1.5-hour 09-Jan-2023 Mitchell [1] (Chapter-2) Decision Tree Learning Decision Tree Representation and Learning Algorithm (ID3).
Attribute Selection using Entropy Measures and Gains.
Hypotheses Space and Inductive bias.
Overfitting, Generalization and Occam's Razor.
Extensions: Continuous-valued Attributes, Alternative Attribute Selection Measures.1.5-hour 10-Jan-2023
16-Jan-2023Mitchell [1] (Chapter-3)
Alpaydin [3] (Chapter-9)
Tan-Steinbach-Kumar [4] (Chapter-4)Bayesian Learning Probability Overview.
MLE and MAP Estimates.
Naive Bayes Classifier.
Gaussian Naive Bayes Classifier.
Bayesian Networks.4-hours 16-Jan-2023
17-Jan-2023
23-Jan-2023Mitchell [1] (Chapter-6)
Tan-Steinbach-Kumar [4] (Chapter-5)
Bishop [5] (Chapter-8)
Alpaydin [3] (Chapter-14)Instance-based Learning k-Nearest Neighbour (kNN) Classifier.
Voronoi Diagram and Distance-Weighted kNN.
Distance Metrics and Curse of Dimensionality.
Computational Complexity: Condensing and High Dimensional Search (kd-tree).2-hours 23-Jan-2023
24-Jan-2023Mitchell [1] (Chapter-8)
Mostafa-Ismail-Lin [2] (Chapter-6)
Tan-Steinbach-Kumar [4] (Chapter-5)
Duda-Hart-Stork [7] (Chapter-4)Linear Models and Regression Linear Classification.
Linear Regression.
Non-linear Transformation.
Logistic Regression.2-hours 30-Jan-2023 Mostafa-Ismail-Lin [2] (Chapter-3)
Alpaydin [3] (Chapter-10)
Tan-Steinbach-Kumar [4] (Chapter-5)
Hastie-Tibshirani-Friedman [6] (Chapter-2+3)Artificial Neural Network Perceptron Learning Algorithm: Delta Rule and Gradient Descent.
Multi-layer Perceptron Learning: Backpropagation and Stochastic Gradient Descent.
Hypotheses Space, Inductive Bias and Convergence.
Variants of Neural Network Structures.2-hours 31-Jan-2023
06-Feb-2023Mitchell [1] (Chapter-4)
Mostafa-Ismail-Lin [2] (Chapter-7)
Alpaydin [3] (Chapter-11)
Tan-Steinbach-Kumar [4] (Chapter-5)
Hastie-Tibshirani-Friedman [6] (Chapter-11)Support Vector Machines Decision Boundary and Support Vector: Optimization and Primal-Dual Problem.
Extension to SVM: Soft Margin and Non-linear Decision Boundary.
Kernel Functions and Radial Basis Functions (detailed later).2-hours 06-Feb-2023
07-Feb-2023Mostafa-Ismail-Lin [2] (Chapter-8)
Hastie-Tibshirani-Friedman [6] (Chapter-12)
Alpaydin [3] (Chapter-10)
Tan-Steinbach-Kumar [4] (Chapter-5)Classifier / Hypothesis Evaluation Accuracy, Precision, Recall and F-Measures.
Scores, Sampling, Bootstraping and ROC.
Hypotheses Testing and Cross-validation.1-hour 13-Feb-2023 Mitchell [1] (Chapter-5)
Tan-Steinbach-Kumar [4] (Chapter-4+5)Computational Learning Theory Error and Noise Formalisms.
Training vs. Testing.
Theory of Generalization.
PAC Learnability and VC Dimensions.
Bias-Variance Tradeoffs.
Overfitting, Regularization and Validation.6-hours 27-Feb-2023
28-Feb-2023
06-Mar-2023
07-Mar-2023Mostafa-Ismail-Lin [2] (Chapter-2+4)
Shwartz-David [9]
Mitchell [1] (Chapter-7)Unsupervised Learning: Clustering Partitional Clustering and Hierarchical Clustering.
Cluster Types, Attributes and Salient Features.
k-Means, Hierarchical and Density-based Clustering Algorithms.
Inter and Intra Clustering Similarity, Cohesion and Separation.
MST and DBSCAN Clustering Algorithms.2-hours 13-Mar-2023 Tan-Steinbach-Kumar [4] (Chapter-8+9)
Hastie-Tibshirani-Friedman [6] (Chapter-14)
Duda-Hart-Stork [7] (Chapter-10)Ensemble Learning Bagging and Boosting.
Adaboost and Random Forest.2-hours 14-Mar-2023 Tan-Steinbach-Kumar [4] (Chapter-5)
Hastie-Tibshirani-Friedman [6] (Chapter-15)Some Leftover Topics
(Additional Details)Dimensionality Reduction and Principal Component Analysis (PCA).
Kernel Machines and Kernel Trick.
Radial Basis Functions (more details).
Core Learning Principles.
E-M Algorithm and Gaussian Mixture Model.
Hidden Markov Model.4-hours 20-Mar-2023
21-Mar-2023
27-Mar-2023Mostafa-Ismail-Lin [2] (Chapter-5+9+C)
Alpaydin [3] (Chapter-13+15)
Duda-Hart-Stork [7] (Chapter-3)
Murphy [8]Other Learning Paradigms Reinforcement Learning.
Deep Learning.
Transfer Learning.
Semi-supervised Learning.
Active Learning.5-hours 27-Mar-2023
28-Mar-2023
03-Apr-2023
10-Apr-2023Sutton-Barto [10]
Mitchell [1] (Chapter-13)
Alpaydin [3] (Chapter-18)
Goodfellow-Bengio-Courville [11]
(+ Research Survey Articles)Conclusion Explainability in Learning.
ML in Summary and Path Ahead.1-hour 10-Apr-2023 Molnar [12]
(+ Research Survey Articles)
** For Reference Slides/Materials, Visit the following Course Pages:Course by Dr. Pabitra Mitra (CSE, IIT Kharagpur)
Course by Dr. Yaser S. Abu-Mostafa (CalTech, USA)
Course by Dr. Tom Mitchell (CMU, USA)
Short Term Course by MIT
Books and References
- Tom Mitchell; Machine Learning, First Edition, McGraw Hill, 1997. [ TEXTBOOK ]
- Yaser S. Abu-Mostafa, Malik Magdon-Ismail, Hsuan-Tien Lin; Learning From Data, First Edition, AML Book, 2012. [ TEXTBOOK ]
- Ethem Alpaydin; Introduction to Machine Learning, Third Edition, The MIT Press, September 2014.
- Pang-Ning Tan, Michael Steinbach, Vipin Kumar; Introduction to Data Mining, Second Edition, Pearson Addison-Wesley, 2019.
- Christopher Bishop; Pattern Recognition and Machine Learning, First Edition, Springer-Verlag New York, 2006. [ Open-Access ]
- Trevor Hastie, Robert Tibshirani, Jerome Friedman; The Elements of Statistical Learning, Second Edition, Springer, 2009.
- Richard O. Duda, Peter E. Hart, David G. Stork; Pattern Classification, Second Edition, John Wiley & Sons, November 2000.
- Kevin P. Murphy; Machine Learning: A Probabilistic Perspective, MIT Press, 2012.
- Shai Shalev-Shwartz, Shai Ben-David; Understanding Machine Learning: From Theory to Algorithms, First Edition, Cambridge University Press, 2014.
- Richard S. Sutton and Andrew G. Barto; Reinforcement Learning: An Introduction, 2nd Edition, MIT Press, 2020.
- Ian Goodfellow, Yoshua Bengio and Aaron Courville; Deep Learning, MIT Press, 2016.
- Christoph Molnar; Interpretable Machine Learning, Leanpub Publisher, 2019.
Tests
- Mid-Semester [ Question | Solution ] (60 Marks, 15-Feb-2023, Wednesday, 2:00pm-4:00pm) [ Syllabus: upto SVM ]
- End-Semester [ Question | Solution ] (100 Marks, 18-Apr-2023, Tuesday, 2:00pm-5:00pm) [ Syllabus: FULL ]
Projects
Project Submission Page : CSE-Moodle
CS60050 : Machine Learning | Spring 2023, L-T-P: 3-0-0 |