Predicting Spotify Song Popularity

A statistics project using R and linear regression to explain and predict song popularity from audio features in a one-million-track Spotify dataset.

RLinear RegressionStatisticsData Analysis
Spotify audio features plotted against song popularity

A statistics project that investigates what makes a song popular, built as the STAT 420 final project at UIUC. Using the one-million-track Spotify dataset from Kaggle, it builds linear regression models to explain and predict a track’s popularity from its audio features — applying data cleaning, variable transformations, and predictor selection along the way.