YouTube Sentiment Analysis Platform

Python Streamlit Transformers VADER PyTorch Plotly YouTube Data API Audience analytics

YouTube Sentiment Analysis Platform supporting evidence

Business context

Digital teams often have plenty of audience comments and very little structured insight. This project was built to show the full path from data collection to scalable sentiment analysis, with deployment choices shaped by the constraints of lightweight hosting.

Outcome

Collected and processed 114,109 comments from a high-profile YouTube video.
Produced a sentiment split of 78.8% negative and 21.2% positive in the analyzed dataset.
Combined VADER for large-scale preprocessing with transformer models for richer interactive inference.
Delivered a multi-page Streamlit dashboard for prediction, exploration, and research views.

Key decisions

Built the full data pipeline from API collection through preprocessing and inference.
Used lighter methods for bulk processing and transformer models where deeper inference mattered.
Added environment detection to switch between model sizes for local versus cloud deployment.
Optimized the product around Streamlit Community Cloud constraints instead of pretending every environment supports a full local stack.

System design

The system collects comments through the YouTube API, cleans and structures the dataset, runs scalable preprocessing and sentiment layers, and then exposes the outputs through interactive dashboard pages for exploration and targeted inference.

Stack

Python, YouTube Data API, VADER, transformers, and PyTorch
Streamlit and Plotly for the product surface
Data collection, preprocessing, sentiment inference, and deployment-aware model routing

YouTube Sentiment Analysis Platform

Business context

Outcome

Key decisions

System design

Stack

Wes Lee

Error

Business context

Outcome

Key decisions

System design

Stack

Templates:

Error