courses:psaw:lab

Machine Learning 101

Goal: Answer the question: What Machine Learning is and how to use it?

All models are wrong
Supervised Machine Learning – this is a fairly comprehensive but relatively easy introduction to machine learning. It's not necessary to memorize everything - just get a general idea of what machine learning is and what the basic techniques are

Q&A Session – a short series of keywords to “warm up” (based on the textbook):
1. What is the difference between supervised learning, unsupervised learning, and reinforcement learning?
2. What is the difference between regression and classification?
3. What is linear regression?
  1. How do we select the best linear regression model? Do you know what MSE is?
4. What is a decision tree?
5. What is a neural network?
6. What is overfitting and why is it a problem?
  1. How to deal with this problem in linear regression? How in other models?
  2. Why do we separate the learning set and the test set?
  3. What is cross validation?
7. All the models are wrong. Is this the problem?
8. Is 80% accuracy a good result?
Practice session:
1. Today we will practice two basic models: linear regression (for regression problems) and logistic regression (for classification problems). There will be also a short bonus on neural networks. To do the tasks, go to the Jupyter Notebooks listed below.
2. ML. Regression
3. ML. Classification
Advanced practice session:
1. If you want to tackle additional topics, do the optional Advanced section in both notebooks. This will give you the opportunity to generate artificial features (for linear regression) and learn about decision trees (for classification problems).