IMDb
2024
Analyzing a Movie Dataset with SQL (BigQuery)

Project Details
Year
2024
Link
GitHub Repository
As you probably know, IMDb is one of the most popular and comprehensive movie databases, offering extensive data on films, TV shows, and related content. For this project, I aimed to showcase my skills in SQL (BigQuery) using an available dataset on Kaggle. I experimented with a few self-made hypotheses and questions to uncover trends in ratings, genres, and director performance.

The Tools
Since this project was specifically designed to showcase my SQL skills and analyze trends in movie ratings, genres, and director performance, I used SQL (BigQuery) to gain key insights. For visualization, I used Looker.

Finalising
Through this project, I not only learned advanced SQL queries such as CTEs and JOINs, but I also discovered insights about the most popular movie genres during the pandemic era (2019–2024), the correlation between the number of movies a director has made and their average movie ratings, and the top-rated movies overall. Additionally, I finally got the hang of using Looker to create visualizations by directly connecting SQL query results to Looker.