IT 415 Final Project
Final Project visualization : A closer look at movies and their budgets, revenue, and popularity from the years 2008-2018
Lealani Saulo
My finalized topic choice is of the "Movies Dataset" consisting of metadata on over 45,000 movies. This dataset was found on Kaggle and contains data on movies released either on or before July 2017 with the last update being a year ago. This data was originally captured by MovieLens
For this page, I embedded my visualization I created using Public Tableau.
One other part of my final project is a timeline visualization of the Highest and Lowest grossing films for each year from 2008-2017 using the tool Time Toast.
- The data points within this data set include:
- movie keywords
- budget
- revenue
- posters
- release dates
- languages
- production companies
- countries
- TMDB vote counts and vote averages
The data has files of around 26 million ratings from 270,000 users for all 45,000 movies with the rating scale being 1-5 and captured from the official GroupLens website. The data set listed on Kaggle includes the following files: movies_metadata.csv, keywords.csv, credits.csv, links.csv, links_small.csv, and ratings_small.csv.
For my final project, I have used the movies_metadata.csv and keywords.csv files and will be focusing on movies released in the years 2008-2018.
-
Tabs 1 & 2 display a treemap diagram where the size of the box depends on the total revenue generated from the movie and the intensity of the color depends on the movie budget.
- For Tabs 1 & 2, hover over a specific block to display a tooltip presenting information on the movie. Movie Title, Production Company, Release Date, Budget, Revenue
-
On the right, you can filter the movies by Production company. Hovering over the top "Production Company" line displays a search ability
For example, you can deselect the box [All] and select only specific Production Companies you would like to see fit in the range of movies (less than or greater than 190 Million)-- like if you want to compare Disney movies against Dreamworks.
Tab 1 - Movies with budgets less than 190 million
Tab 2 - Movies with budgets more than 190 million
-
Tabs 3 & 4 display the Top 10 Movies in terms of Budget, Revenue, Popularity, and Runtime.
- You can see the comparisons of each variable between the movies presented by the bars on the right.
- The bars are color coded according to their measurement values
- Movies are sectioned by Year in chronological order.
- The chart displays information of the Top 10 movies; Production Company, Year of Release, Title, Movie Tagline, and Budget & Revenue
- Movies are listed in Alphabetical order by Production Company
Tab 3 - Top 10 Movies (2008-2018) and their budget, revenue, popularity and runtime
Tab 4 - Top 10 movies and production companies (2008-2018) and their budget and revenue
- Tab 5 - Movies Dashboard
- The movie dashboard tab holds each worksheet tab
- Going through the visualization tab by tab instead of through the dashboard is preferred as it gives more room for each worksheet to b displayed.
(Note: there is a Production Company name in Tabs 1 & 2s filters titled " * ", I am not sure why the dataset had this but there are specific movies set with " * " as the Production Company)