Unfortunately I can't share the code for this project publicly, but you can click
here to access my Github repo containing code for other projects.
This report was a group assignment required for the UC Berkeley MIDS w203 Statistics for Data Science course, which I completed during Fall 2021. The goal of the project was use multivariable linear regression within an causal theory to explain how much Americans consider horsepower when purchasing a new vehicle. I significantly contributed to the introduction, EDA, model building and selection, as well as results sections of the report. All data wrangling, cleaning, and analysis were completed in R using RStudio. To complete our research, we utilized a variety of statistical concepts:
The final report submitted for this project is provided below: