This project showcases my proficiency in relational database design, ER modeling, and SQL. It demonstrates my ability to translate complex data into well-structured database systems with secure access and optimized querying.
Combined four streaming platform datasets (Netflix, Prime, Hulu, Disney) into one unified structure containing titles, directors, genres, countries, release dates, ratings, and descriptions.
Appendix A: Sample from merged base dataset
Appendix C: Original flat ER model
The original dataset was not normalized. I restructured it to third normal form and modeled cardinalities, PKs, and FKs.
Appendix G: Final normalized ER diagram
Below are steps for creating the database schema, inserting data, setting privileges, and optimizing the system.
Find which host aired the most content from a specific director, within a date range and country.
Determine top genres by host in a specific country and time range.
Get the latest hosted content by a provider in a given region.