Week 4: Supervised and Unsupervised Learning

Announcements

Friday

Supervised learning

  • From Algorithms perspective

  • From Dataset perspective

Unsupervised learning

Clustering

  • Overview

  • K-means

  • DBSCAN

Dimmension Reduction

  • Decomposing signals in components

  • Principal component analysis (PCA)

  • Independent component analysis (ICA)

The Diabetes dataset

  • Regression

  • Efron, Bradley, Trevor Hastie, Iain Johnstone, and Robert Tibshirani. "Least angle regression." The Annals of statistics 32, no. 2 (2004): 407-499.

  • Cited by 11138

  • Google Scholar has the Paper PDF file

  • lab 04

Midterm

  • Research questions

  • Literature review

  • Paper format, two-page poster and full length eight-page paper

  • Timeline

  • Team, student pairs

Concept

  • Loss function

  • Accuracy

  • Train-test

  • Cross validation

  • Overfitting and underfitting

Textbooks

All textbooks are free available online, and are optinal, not required.

Professor Ernst

  • Advice for researchers and students

  • Writing a technical paper (19 August 2023)

  • How to read an engineering research paper, by William Griswold"

  • Choosing a venue for a technical paper (29 September 2022)