CSE 450 - Machine Learning & Data Mining

Module 07: Course Conclusion, Project

Overview

This module will not have a traditional team-based case study as in other modules, instead, you'll have a chance to demonstrate your skills with a small individual project of your choosing.

This project is not intended to be a large, overwhelming project, but rather a chance for you to demonstrate that you can work with a dataset from start to finish, using the tools and processes discussed in the course.

You may choose any dataset, libraries, and tools you would like, with the exception that you should select a dataset we have not already used in class this semester. You do not have to use theses resources, but the UCI Data Repository and Kaggle have many freely available datasets that you might consider.

Requirements

For this project you need to demonstrate your skills in the following areas:

  1. Data preprocessing
  2. Selection, training, and use of a machine learning model
  3. Interpretation of results (for example, using proper metrics, understanding limitations)
  4. Communication of results
  5. Ethical implications

Submission

Please submit a document (.docx or .pdf) with the above requirements as headings. Then, after each heading, briefly describe your work in that area.

At the end, make sure to include:

The standard categories are:

  1. All requirements were met and additional work was done to demonstrate creativity and excellence by going above and beyond.
  2. All requirements were met.
  3. Some attempt was made, but was slightly deficient in approach or understanding.
  4. Some attempt was made, but was significantly deficient in approach or understanding.
  5. Some attempt was made, but was extremely deficient in approach or understanding.
  6. No attempt was made.

Your entire submission should be less than two pages (not including a title page).