michaelcintolo.info

michaelcintolo.infomichaelcintolo.infomichaelcintolo.info

michaelcintolo.info

michaelcintolo.infomichaelcintolo.infomichaelcintolo.info
  • Home
  • About
  • Project
  • Visualizations
  • More
    • Home
    • About
    • Project
    • Visualizations
  • Home
  • About
  • Project
  • Visualizations

Welcome to My Project

Video and Audio Collection

After earning my Google Data Analytics certification, I was eager to embark on a personal project to showcase my skills and align with my interests. The journey began with a simple goal: to engage my brother's passion for Film Noir movies and introduce him to the convenience of digital movie collections. Little did I know that this endeavor would grow into a multifaceted exploration of entertainment data.


As my brother enthusiastically embraced his newfound digital movie library, my mother joined the journey, reminiscing about the old-time radio shows she cherished as a young girl. Our collection expanded, encompassing a rich tapestry of movies, radio shows, music, and more. Yet, the eclectic mix lacked structure and completeness.


My background in State Reporting, where I transformed a chaotic department into an efficient operation, provided valuable insights. I realized that organization, standardization, and meticulous documentation were crucial to success. In the same way, I set out to bring order to our entertainment collection.


I established standard naming conventions for each category within our vast collection. Retrieving the required information for these conventions was a challenging but enlightening journey. I delved into research, often delving into the nuances of various sources to ensure accuracy. Python programs became my trusted companions, assisting in data retrieval and cleaning.


With naming conventions in hand, the next step was to create a structured database for our data. CSV files served as temporary repositories, but I soon realized the need for a more robust solution. I built a MySQL database with dedicated tables for each file type and developed Python scripts to process and populate the database.


My GitHub repository  houses the Python programs that drove this project. It represents my commitment to data organization, problem-solving, and technical proficiency.


This journey has honed my data analytics skills and deepened my appreciation for the power of data in creating order from chaos. I'm excited to leverage these experiences and skills in future data science endeavors.


Project Technical Specifications

Naming Conventions

There are six collections: Audiobooks, Cartoons, Movies, Music, Radio Shows, and TV Shows. 


  1. Audiobooks: The folder is "Book_Title by Author." The file name is "Book_Title (Chapter_Number) - Chapter_Title.File_Ext".
  2. Cartoons: The Folder is "Show_Title." The file is "Show_title (Production_Year) S00E00 - Episode_Title.File_Ext". "S" is for Season Number, and "E" is for Episode Number. Not all files have a Season Number and Episode Number. Some will have neither. Cartoon movies only have Show_Title and Production_Year and possibly a tag [Animated] after Production_Year and before File_Ext.
  3. Movies: All Movie files are in one main folder, so there is no folder-specific naming convention. The file name is "Movie_Title (Prod_Year) Actor_Name, Actor_Name [Genre, Genre].File_Ext". There may be one or many actors separated by a comma. There may be one or many genres separated by a comma.
  4. Music: The Folder is "Artist_Name." The file name is "Artist_Name (Release_Year) Album_Title - Song_Title.File_Ext" for albums. "Artist_Name (Release_Year) - Song_Title.File_Ext" for a single.
  5. Radio Shows: The Folder is "Show_Title." The file is "Show_title (Production_Year) S00E00 - Episode_Title.File_Ext". "S" is for Season Number, and "E" is for Episode Number. Not all files have a Season Number and Episode Number. Some will have neither.
  6. TV Shows: The Folder is "Show_Title." The file is "Show_Title (Production_Year) S00E00 - Episode_Title.Ext". "S" is for Season Number, and "E" is for Episode Number. Not all files have a Season Number. All files will have an Episode Number, except the Pilot episode, designated by zero.


Database Design

Tools

Batch Renaming: CoreRenamer 4.5.0 and KRename 5.0.2.

Python Programming: Python 3.11, Spyder IDE 5.4.3.

R Programming: R 4.3.2, RStudio IDE 2023.09.1.

Visualization: Tableau Public.

Backup: FreeFileSync 13.0.

Database: MySQL Workbench 8.0.

Video Editing: FFmpeg 6.0 and VidCutter 6.0.5.1.

MetaData: MetaData Cleaner 2.5.4 and VLC Media Player 3.0.19.


Copyright © 2025 michaelcintolo.info - All Rights Reserved.

Powered by

This website uses cookies.

We use cookies to analyze website traffic and optimize your website experience. By accepting our use of cookies, your data will be aggregated with all other user data.

Accept