Homework
E N D
Presentation Transcript
Homework • Homework #1 is up • Programming Language: whatever • Write your own code • HW questions about code: Be succinct and clear • Partial credit will be given • So briefly justifying your answers may help
Look at the homework early, because… • No class next Monday • Your TA is out next week • In the meantime, we’re both available via e-mail
Project Guidelines • Project proposal due October 16 (~1 pg) • Who is in your group • Your task (and why is it interesting?) • Where did/will you get your data? • Which ML algorithms will you try first? • Final project write-up due December 8th • Web page • Report (~4 pgs, ACM format…link on course page)
Some project ideas • The “standard” problems • Handwriting, text classification, disease detection, etc.; see the UCI ML repository • Recommendations • E.g., the Netflix prize • Sports predictions • Question: how does “intransitivity” impact ML? • Multi-task learning • TinyGrams • Google n-grams corpus gives P(phrase) for up to 5-word phrases • Based on 1 trillion words: unprecedented coverage • But around 150G uncompressed – could an ML approximation fit in memory?
More “researchy” projects • A couple of project ideas related to information extraction • Example: TextRunner • If interested, drop me an e-mail (soon)