Primary Data
| Dataset | Source | Description | Link |
|---|---|---|---|
| Eviction Filings | Eviction Lab | Weekly eviction filing counts by census tract, 2020-2025 | Eviction Lab Philadelphia Tracking |
MUSA 5080 - Final Project README
Zhiyuan Zhao & Fan Yang
December 6, 2025
An early warning system for predicting eviction filings at the census tract level in Philadelphia, developed for MUSA 5080: Public Policy Analytics.
This project develops a predictive model to forecast monthly eviction filings across Philadelphia’s 408 census tracts. The goal is to help the Philadelphia Housing Authority allocate prevention resources more effectively by identifying high-risk neighborhoods before evictions occur.
Philadelphia experiences over 1,000 eviction filings per month, ranking among the highest eviction rates in the nation. Many evictions are preventable with timely intervention—legal aid, rental assistance, and tenant counseling. This model enables proactive resource deployment rather than reactive crisis response.
| Dataset | Source | Description | Link |
|---|---|---|---|
| Eviction Filings | Eviction Lab | Weekly eviction filing counts by census tract, 2020-2025 | Eviction Lab Philadelphia Tracking |
| Dataset | Source | Description | Link |
|---|---|---|---|
| Census Demographics | U.S. Census Bureau | ACS 5-year estimates: poverty rate, median income, race/ethnicity | Census API |
| Renter Population | U.S. Census Bureau | Tenure data (renter vs. owner occupied) by tract | Census API |
| 311 Service Requests | OpenDataPhilly | Housing-related complaints (no heat, dangerous buildings, infestations) | OpenDataPhilly 311 |
| Census Tract Boundaries | U.S. Census Bureau | TIGER/Line Shapefiles for Philadelphia County | Census TIGER |
tidycensus R package (API key required: get key here)tigris R package├── data/ #Data used for this assignment
├── output/ #visualizations and tables
├── slides/ #presentation slides package
├── eviction_presentation.qmd # Presentation slides
├── final_Zhiyuan_Fan.qmd # Main analysis document
├── README.md # This file
└── .gitignore
Weekly data aggregated to monthly to address zero-inflation (68% zeros at weekly level → 34% at monthly level).
| Model | Description | McFadden’s R² |
|---|---|---|
| Baseline Poisson | Temporal lags only | 0.14 |
| Full Poisson | + Spatial lag, demographics, race | 0.23 |
| Full Negative Binomial | Same as above, allows overdispersion | 0.22 |
| Enhanced Poisson | + Hotspots, month effects, interactions | 0.24 |
Clone this repository
Obtain Census API key and store in .Renviron:
CENSUS_API_KEY=your_key_hereDownload Eviction Lab data and place in data/raw/
Download 311 data from OpenDataPhilly and place in data/raw/
Open eviction_prediction_enhanced.qmd in RStudio
Run all chunks to reproduce analysis
This project is for academic purposes (MUSA 5080, University of Pennsylvania).
Last updated: December 2025