Exploring Goodreads: Patterns in Ratings & Pages

This is a PDF I submitted for my final project in Data Engineering at Northwestern. The project documents my work processing a dataset of book ratings from Goodreads, including steps for building a Google Cloud data pipeline and handling new review data. The goal was to demonstrate my ability to understand and work with cloud-based workflows and data pipelines, as well as to extract insights from a dataset.

I used Tableau to create visualizations that highlight interesting patterns in the data, such as the correlation between the number of pages and average book ratings, illustrating my ability to tell stories and make recommendations with data.

Note: The focus of this project was on documenting the workflow and insights, rather than on the aesthetic design of the PDF.

View Project PDF