|
COL774: Machine Learning
General Information
Semester: Sem II, 2024-25.
Instructor: Parag Singla (email: parags AT cse.iitd.ac.in)
Class Timings (Slot F):
- Tue, 11:00 am - 11:50am
- Thu, 11:00 am - 11:50am
- Fri, 11:00 am - 11:50am
Venue: LHC 318
Sign up for Piazza
Code: As (will be) announced in class.
TA Assignment: Coming soon!
Announcements
[Jan 28, 2025]: Assignment 1 is out! Due Date: Feb 14, 2025. 11:59 pm.
[Jan 27, 2025]: Course Website is finally up!
Course Objectives
(a) To familiarize with/develop the understanding of fundamental concepts of Machine
Learning (ML)
(b) To develop the understanding of working of a variety of ML algorithms (both
supervised as well as unsupervised)
(c) To learn to apply ML algorithms to real world
data/problems
(d) To update with some of the latest advances in the field
Course Content
NOTE: The exact list of topics below is tentative (until we are past that week).
We will update it as we go through the lectures in each week. So, stay tuned!
Week |
Topic |
Supplementary Notes (by Andrew Ng and Others) |
Class Notes/Other Resources |
1 | Introduction | |
Jan 2, Jan 3, Jan 7 (Refer to last year's notes on ML applications) |
2 | Supervised Learning Basics - Linear Regression, Gradient Descent |
lin-log-reg.pdf
|
Jan 9 (coming soon),
Jan 10, Jan 16 |
3 | Gradient Descent (Including Convergence Properties), Stochastic Gradient Descent |
lin-log-reg.pdf
| Jan 17, Jan 21, Jan 23,
Jan 24
| 4 | Linear Regression - alternate intepretation (probabilistic), Logistic Regression, GLMs |
lin-log-reg.pdf
|
|
5 | Gaussian Discriminant Analysis (GDA) |
gda_nb.pdf |
|
6 | Naive Bayes |
gda_nb.pdf
| |
7,8 | Support Vector Machines |
svm.pdf |
|
9 | Decision Trees, Random Forests |
Mitchell, Chapter 3.
dtrees.pdf.
Online Resources:
Random Forests,
Gradient Boosting - Wikipedia,
Paper by Friedman (2001) (up to Section 4.5)
|
|
10 | Neural Networks |
Mitchell, Chapter 4.
nnets.pdf
nnets-hw.pdf
|
|
11 | Deep Learning |
cnn.pdf
Online Resource:
Convolutional Neural Networks |
|
12 | K-Means, Gaussian Mixture Models |
kmeans.pdf
gmm.pdf
|
|
13 | Expectation Maximiation (EM), Principal Component Analysis (PCA) |
em.pdf
pca.pdf |
|
14 | Learning Theory, Model Selection |
Mitchell, Chapter 7.
theory.pdf
model.pdf |
|
For week-wise notes, see the Content Table Above.
Video Lectures:
Video Lectures from Previous offering can be acccessed here
COL 774, Sem I, 2023-24 Course Page (Search for Videos)
COL 774, Sem I, 2021-22 Course Page (Search for Videos)
References (latest)
References (older)
- Machine Learning.The Art and Science of Algorithms that Make Sense of Data
Peter Flach, Cambridge University Press, 2012.
- Machine Learning: A Probabilistic Perspective.
Kevin Murphy. MIT Press, 2012.
- Pattern Recognition and Machine Learning. Christopher Bishop. First Edition, Springer, 2006.
- Pattern Classification. Richard Duda, Peter Hart and David Stock. Second Edition, Wiley-Interscience, 2000.
- Machine Learning. Tom Mitchell. First Edition, McGraw-Hill, 1997.
Assignment Submission Instructions
- You are free to discuss the assignment problems with other students in the class. But all your code should be
produced independently without looking at/referring to anyone else's code.
- Python is the default programming
languages for the course. You should use it for programming your
assignments unless otherwise explicitly allowed.
- Code should be submitted using Moodle Page.
Make sure to include commenrs for readability.
- Create a separate directory
for each of the questions named by the question number. For instance, for question 1,
all your submissions files should be put in the directory named
Q1 (and so on for other questions). Put all the Question sub-directories in a single
top level directory. This directory should be named as "yourentrynumber_firstname_lastname".
For example, if your entry number is "2022cs19535" and your name is "Nitika Rao", your
submission directory should be named as "2022cs19535_nitika_rao". You should zip your
directory and name the resulting file as "yourentrynumber_firstname_lastname.zip" e.g. in
the above example it will be "2022cs19535_nitika_rao.zip". This single zip file should
be submitted online.
- Honor Code: Any cases of copying will be awarded a zero on the assignment and an additional penalty equal to the negative of the total
weightage of the assignment. More severe penalties may follow.
- Late Policy: You are allowed a total of 5 late (buffer) days acorss the first 3 assignments. You are free to decide how you would like to use them. The late policy (if any) for
the last assignment will be announced separately.
You will get a penalty of 10% deduction in marks (per day) for every additional late day in submission used beyond the allowed 5 buffer days (applicable to first 3 assignments only).
- Audit Policy: To get an Audit Pass in the course, you are required to get a score equivalent to getting a C grade (or more) in the course. Additionally, you are required to satisfy any
attendance requirements as stipulated by the institute, which is 75% or more attendance in the class.
Practice Questions
Assignments
- Assignment 1
Due Date: Friday February 14, 2025. 11:59 pm
Grading Policy (Tentative)
Assignments (4) | Ass1: 7%. Ass2: 9%, Ass3: 9%, Ass4: 10 %. [Total Assignment Weight: 35%] |
Minor | 25% |
Major | 40% |
|