week-11-notes: Space-Time Prediction

Bike Share Demand Forecasting with Panel Data & Temporal Lags

1 The Space-Time Challenge

Goal: Build a system that predicts demand in space and time

Definition: Data that follows the same units over multiple time periods

Core idea: Past demand predicts future demand

The Challenge: Missing Observations Lag calculations break if rows are missing

Calculate all possible combinations - Create every possible station-hour combination - Join to actual trip counts - Fill missing with 0

Joining Station Attributes - Station location, demographics from census - Join to panel

Adding Time-Varying Features - Weather changes hourly - Create time features

You CANNOT train on the future to predict the past!

We’ll build 5 models, adding complexity:

Goal: See which features improve prediction accuracy

Evaluating Models: MAE

For a bike rebalancing system:

Prediction accuracy matters most at high-volume stations
- Running out of bikes downtown causes more complaints
- But: Is this equitable?
Temporal patterns reveal operational windows
- Rebalance during overnight hours (low demand)
- Pre-position bikes before AM rush
Spatial patterns suggest infrastructure gaps
- Persistent errors in certain neighborhoods
- Maybe add more stations? Increase capacity?

More temporal features:
- Precipitation forecast (not just current)
- Event calendars (concerts, sports games)
- School schedules
More spatial features:
- Points of interest (offices, restaurants, parks)
- Transit service frequency
- Bike lane connectivity
Better model specification:
- Interactions (e.g., weekend * hour)
- Non-linear effects (splines for time of day)
- Different models for different station types