Week 3 Notes - Data Visualization & Exploratory Analysis

Published

September 22, 2025

Key Concepts Learned

ggplot2:
- Data is the actual datasets
- Aesthetics, variables mapped to visual properties (x ,y ,color, size )
- Geometries, how to display the data (points, bars, lines)
- Additional layers: scales, themes, facets and annotations
Aesthetics:
- x, y, are data positions
- color of the point/line
- fill, is the area color
- size, point/line size
- shape, point shape
- alpha, transparency
left_join() - keep all rows from left dataset
right_join() - keep all rows from right dataset
inner_join() - Keep matching only
full_join() - just merge the datasets

Analyzing bias within data before running analysis and providing recommendations
this is done to allow us to ensure that no group is discriminated or biased against within the recommendations.