IT 415 Final Project

Final Project visualization : A closer look at movies and their budgets, revenue, and popularity from the years 2008-2018

Lealani Saulo
My finalized topic choice is of the "Movies Dataset" consisting of metadata on over 45,000 movies. This dataset was found on Kaggle and contains data on movies released either on or before July 2017 with the last update being a year ago. This data was originally captured by MovieLens
For this page, I embedded my visualization I created using Public Tableau.

One other part of my final project is a timeline visualization of the Highest and Lowest grossing films for each year from 2008-2017 using the tool Time Toast.


The data has files of around 26 million ratings from 270,000 users for all 45,000 movies with the rating scale being 1-5 and captured from the official GroupLens website. The data set listed on Kaggle includes the following files: movies_metadata.csv, keywords.csv, credits.csv, links.csv, links_small.csv, and ratings_small.csv.
For my final project, I have used the movies_metadata.csv and keywords.csv files and will be focusing on movies released in the years 2008-2018.

Instructional Use and Description:
    Tabs 1 & 2 display a treemap diagram where the size of the box depends on the total revenue generated from the movie and the intensity of the color depends on the movie budget.

    Tab 1 - Movies with budgets less than 190 million

    Tab 2 - Movies with budgets more than 190 million
  • For Tabs 1 & 2, hover over a specific block to display a tooltip presenting information on the movie. Movie Title, Production Company, Release Date, Budget, Revenue
  • On the right, you can filter the movies by Production company. Hovering over the top "Production Company" line displays a search ability
    For example, you can deselect the box [All] and select only specific Production Companies you would like to see fit in the range of movies (less than or greater than 190 Million)-- like if you want to compare Disney movies against Dreamworks.
    Tabs 3 & 4 display the Top 10 Movies in terms of Budget, Revenue, Popularity, and Runtime.

    Tab 3 - Top 10 Movies (2008-2018) and their budget, revenue, popularity and runtime
  • You can see the comparisons of each variable between the movies presented by the bars on the right.
  • The bars are color coded according to their measurement values
  • Movies are sectioned by Year in chronological order.

  • Tab 4 - Top 10 movies and production companies (2008-2018) and their budget and revenue
  • The chart displays information of the Top 10 movies; Production Company, Year of Release, Title, Movie Tagline, and Budget & Revenue
  • Movies are listed in Alphabetical order by Production Company
    Tab 5 - Movies Dashboard
  • The movie dashboard tab holds each worksheet tab
  • Going through the visualization tab by tab instead of through the dashboard is preferred as it gives more room for each worksheet to b displayed.


  • (Note: there is a Production Company name in Tabs 1 & 2s filters titled " * ", I am not sure why the dataset had this but there are specific movies set with " * " as the Production Company)