BBM406: Fundamentals of Machine Learning

Fall 2019

Course Project

An integral part of the course is the class project (30% of the grade), which gives students a chance to apply the algorithms discussed in class to a research oriented project. This semester the theme is Machine Learning for Good.

Some example project titles are as follows:

Recommended Reading

  1. Neal Jean, Marshall Burke, Michael Xie, William Davis, David Lobell, Stefano Ermon. Combining Satellite Imagery and Machine Learning to Predict Poverty. Science, 2016
  2. Hayate Iso, Shoko Wakamiya, Eiji Aramaki. Forecasting Word Model: Twitter-based Influenza Surveillance and Prediction/ COLING, 2016
  3. Dayong Wang, Aditya Khosla, Rishab Gargeya, Humayun Irshad, Andrew H. Beck. Deep learning for identifying metastatic breast cancer. arXiv Preprint:1606.05718, 2016
  4. Christos Kyrkou, Theocharis Theocharides. Deep-Learning-Based Aerial Image Classification for Emergency Response Applications Using Unmanned Aerial Vehicles. CVPR Workshops, 2019
  5. Rohan Kshirsagar, Tyrus Cukuvac, Kathleen McKeown, Susan McGregor. Predictive Embeddings for Hate Speech Detection on Twitter. EMNLP Workshops, 2018
  6. David Rolnick et al. Tackling Climate Change with Machine Learning. arXiv preprint:1906.05433, 2019

Resources

In your projects, you may use a dataset available on the web (some example datasets are listed below) or collect your own data. However, if you choose the latter option, you must you must keep in mind that data collection can be fun and exciting, but it is also time-consuming.

Software and Libraries

You are encouraged to learn and use the following machine learning and deep learning frameworks in your projects. Links to some useful NLP tools are also provided.

Deliverables

In preparing your progress and final project reports, you should use the provided LaTeX template and submit them electronically in PDF format. Late submissions will be penalized.

Collaboration Policy

Each project should be done in groups of 3 students. Of course, there may be some exceptions, depending on the enrollment. Note that students without a team will be randomly assigned to one project group.

Grading

Project Proposal

Each project group should submit a half page project proposal on their specific project idea by November 10, 2019. The proposal should provide

Blog posts/GitHub commits/Meetings with TAs

Each project group should maintain a blog sharing their steady progress, ideas, and experiments, and they must write at least one blog post per week (excluding exam weeks). NEW Moreover, they will regularly meet with TAs to discuss their progress and get feedback. Each group should maintain a GitHub repository for their project (must be viewable to the TAs and instructor). The frequency of your commits to GitHub will also be graded.

Progress Report

Due: December 22, 2019 (11:59pm)

Each student should submit a project progress report by December 4, 2017. The report should be 3-4 pages and should describe the following points as clearly as possible:

Project Presentations

Due: January 8-10, 2020 (in class)

Each project group will have ~8 mins to present their work in class. The suggested outline for the presentations are as follows:

In addition to classroom presentations, each group should also prepare an engaging video presentation of their work using online tools such as PowToon, moovly or GoAnimate. The deadline is January 12, 2020.

Final Report

Due: January 15, 2020 (11:59pm)

As the last deliverable of the course project, each group is expected to submit a project report prepared using the style files provided in the course web page. The report should be 6-8 pages and should be structured as a research paper. It will be graded based on clarity of presentation and technical content. A typical organization of a report might follow: