Skip to content

Commit 7cbb74b

Browse files
committed
class 10 updates
1 parent 00ececd commit 7cbb74b

File tree

3 files changed

+1024
-5
lines changed

3 files changed

+1024
-5
lines changed

Diff for: README.md

+5-4
Original file line numberDiff line numberDiff line change
@@ -286,7 +286,7 @@ Tuesday | Thursday
286286

287287
### Class 10: Linear Regression
288288
* Machine learning exercise ([article](http://blog.dominodatalab.com/10-interesting-uses-of-data-science/))
289-
* Linear regression
289+
* Linear regression ([notebook](http://nbviewer.ipython.org/github/justmarkham/DAT8/blob/master/notebooks/10_linear_regression.ipynb))
290290
* [Capital Bikeshare dataset](data/bikeshare.csv) used in a Kaggle competition
291291
* [Data dictionary](https://www.kaggle.com/c/bike-sharing-demand/data)
292292
* Feature engineering example: [Predicting User Engagement in Corporate Collaboration Network](http://nbviewer.ipython.org/github/mikeyea/DAT7_project/blob/master/final%20project/Class_Presention_MYea.ipynb)
@@ -330,12 +330,13 @@ Tuesday | Thursday
330330
### Class 12: Logistic Regression
331331
* Yelp votes homework due (solution)
332332
* Logistic regression ([notebook](http://nbviewer.ipython.org/github/justmarkham/DAT8/blob/master/notebooks/12_logistic_regression.ipynb))
333-
* Exercise with Titanic data ([instructions](homework/12_titanic.md), [solution](code/12_titanic.py))
334-
* Confusion matrix ([slides](slides/12_confusion_matrix.pdf))
333+
* [Glass identification dataset](https://archive.ics.uci.edu/ml/datasets/Glass+Identification)
334+
* Exercise with Titanic data ([notebook](http://nbviewer.ipython.org/github/justmarkham/DAT8/blob/master/notebooks/12_titanic_confusion.ipynb), [data](data/titanic.csv), [data dictionary](https://www.kaggle.com/c/titanic/data))
335+
* Confusion matrix ([slides](slides/12_confusion_matrix.pdf), [notebook](http://nbviewer.ipython.org/github/justmarkham/DAT8/blob/master/notebooks/12_titanic_confusion.ipynb))
335336
336337
**Homework:**
337338
* If you aren't yet comfortable with all of the confusion matrix terminology, watch Rahul Patwari's videos on [Intuitive sensitivity and specificity](https://www.youtube.com/watch?v=U4_3fditnWg) (9 minutes) and [The tradeoff between sensitivity and specificity](https://www.youtube.com/watch?v=vtYDyGGeQyo) (13 minutes).
338-
* Video assignment on [ROC curves and AUC](homework/13_roc_auc.md)
339+
* Video/reading assignment on [ROC curves and AUC](homework/13_roc_auc.md)
339340
* Video/reading assignment on [cross-validation](homework/13_cross_validation.md)
340341
341342
**Resources:**

Diff for: homework/10_yelp_votes.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -22,4 +22,4 @@ This assignment uses a small subset of the data from Kaggle's [Yelp Business Rat
2222
7. **Bonus:** Think of some new features you could create from the existing data that might be predictive of the response. Figure out how to create those features in Pandas, add them to your model, and see if the RMSE improves.
2323
8. **Bonus:** Compare your best RMSE on the testing set with the RMSE for the "null model", which is the model that ignores all features and simply predicts the mean response value in the testing set.
2424
9. **Bonus:** Instead of treating this as a regression problem, treat it as a classification problem and see what testing accuracy you can achieve with KNN.
25-
10. **Bonus:** Figure out how to use linear regression for classification, and compare its classification accuracy to KNN.
25+
10. **Bonus:** Figure out how to use linear regression for classification, and compare its classification accuracy with KNN's accuracy.

0 commit comments

Comments
 (0)