Week-2-notes

Published

September 15, 2025

Algorithmic Decision Making & Census Data

Inputs: Synonyms: Independent variables, “x”’s, predictors, features

Outputs: Synonyms: dependent variable

Government

-            Idea is that using data will result in more consistent and unbiased output

-            Reasons for use: Efficiency, consistency , objectivity, cost savings

1)       Data Science

2)       Data Analytics (MUSA)

3)       Machine Learning 

4)       AI

Subjectiveness

-            Data cleaning decisions

-            Data coding or classification

-            Collection

-            How you interpret results

-            What variables you use

Census Data Foundations

·       9 basic questions (age, race, sex, etc.)

·       Everyone counted every 10 years

·       Constitutional requirement

·       Determines political representation

American Community Survey (ACS)

·       3% of households surveyed annually

·       detailed questions (income, education, housing costs, etc)

o   replacement of the same “long form” in 2005

·       Areas: of only 65,000 people.

·       Pretty small sample but you get them every year. Only aggregate level

·       5-year estimates

o   Take all 1-year and combine together including census tracts (1500-1800 people). Most accurate we can get at the smallest geography and we consider them a neighborhood.

§  It is changed for redistricting

§  Frustrating for us when redistricting happens

o   Most reliable data, largest sample

Tidycensus

-            This package helps organize the census data