CSE 450 - Machine Learning & Data Mining

Module 04 — Bike Rentals, Case Study Discussion

Questions

You're at a strategy meeting with the stakeholders. They want to make sure you have the data required to answer the questions they're most interested in.

Be prepared to answer the following questions:

Network Layers and Hyperparameters

Zhao, CEO of WelcomeBike

My brother-in-law works for a big AI company in Hong Kong. We've chatted a bit about neural networks, and I was wondering how many layers you think the network should have.

If you run your model on the data and the results seem lower than you expected, which of the following hyperparameters do you feel has the most potential for model improvement?

  1. Number of neurons and number of hidden layers.
  2. Learning rate and optimizer selection.
  3. Batch size and number of epochs.
  4. Activation functions of the layers and overall loss function.

Feature Engineering

Johnny, Data Science Intern

Looking at the features we have, how do you think we should handle the temperature features?

Learning Rate

Zhao, CEO of WelcomeBike

My brother-in-law told me that I should be careful with how I handle learning rate in my network.

What approach do you think you're going to take to find the optimal learning rate?

Loss Function

Johnny, Data Science Intern

How will we know if our model has strong predictive power?

What are you planning to use for the loss function?

Predictive Risk Model

William, Investment Banker

We would like use AI to predict the likelihood of damage based on user profile data, such as name, birthday, sex, or address, so that we can add an insurance premium to the rental cost. We are concerned that there may be ethical/legal implications here, what would you recommend?

Based on your initial analysis of the data, your team feels:

  1. We can do this, as long as the users have provided that profile data and we have kept record of previous damages.
  2. Using any profile data in this way would be inappropriate.
  3. As long as we only use name and address, that would be fine.
  4. Instead of using their profile data, we could track their usage via GPS and feed those live statistics (speed, angle, direction) into the model and deactivate the bike if the model predicts they will damage the bike. (For example, if they were going extremely fast down a crowded sidewalk.)

Pandemic and Health Concerns

Zhao, CEO of WelcomeBike

Coming out of the pandemic has put a strain on resources. Our customers are much more concerned about health and safety of sharing bikes.

In your executive summary, would you add your recommendations for when we could pull bikes out of rotation to clean and disinfect them? For example, what days and times we should bring them in?

We also need to know if we have any lasting problems now that we are a few years past COVID-19. What can we expect for forcasts into the future? Are we back on track or still recovering?


  1. CEO photo by Sung Wang on Unsplash 

  2. Investment Banker photo by steffen Wienberg on Unsplash 

  3. Data Science Intern photo by Fábio Lucas on Unsplash