Improving Property Tax Assessments
Sub-questions:
“Which factors are most influential in determining house price prediction outcomes?”
“To what extent are house prices predictable, and to what extent are they driven by unobservable factors?”
“Does the predictive performance of the model remain consistent across neighborhoods of varying wealth levels?”
Census ACS (American Community Survey, 2022)
OpenDataPhilly
Philadelphia Properties and Current Assessments (2023-2024)
Crime Incidents: Citywide crime incident reports
Universities: Spatial locations of educational institutions
Neighborhood Boundaries: Official neighborhood and planning district shapefiles
Build models progressively:
Model 1: Structural features only
- number of bathrooms, livable area (logged), garage spaces, house age (Quadratic Effect), exterior condition
Model 2: Census variables
- median_income, percentage of bachelor, percentage of poverty
Model 3: Spatial features
- nearest college, number of nearby crime
Model 4: Interactions and fixed effects
- neighborhood wealthy (interact with livable area)
Model Performance Improves with Each Layer
| Model | CV RMSE (log) | R² |
|---|---|---|
| Structural Only | 0.61 | 0.35 |
| + Census | 0.50 | 0.57 |
| + Spatial | 0.49 | 0.58 |
| + Interactions/FE | 0.48 | 0.59 |
Fourth Model RMSE: $154,200
Key Takeaway
The final interaction model effectively captures both structural and contextual determinants of housing prices in Philadelphia, combining property-level features with socioeconomic and spatial characteristics.
| Feature | Direction | Interpretation |
|---|---|---|
| Living area | ↑ | Strongest driver of housing price |
| Age + Age² | ↓ then ↑ | U-shaped pattern — older historic homes regain value |
| Exterior good | ↑ | Maintenance condition positively impacts price |
| Median income / Education | ↑ | Socioeconomic context drives demand |
| Poverty rate / Crime | ↓ | Negative neighborhood effects |
| Interaction: Living area × Wealthy neighborhood | ↓ | Larger homes add less |

Key Limitations
Next Steps
Risk of encoding structural inequalities from historical disinvestment
Introduce equity-weighted adjustments or localized calibration for underrepresented areas
Use residual maps to guide targeted reinvestment and housing policy
Any questions?