ProjectsFebruary 1, 2024COVID-19 Clustering Analysis
This data mining project compares different clustering algorithms to analyze COVID-19 transmission patterns. By applying spectral clustering, K-means, and KNN clustering techniques, the analysis identifies relationships between peaks in transmission across different regions.
- Algorithm Comparison: Evaluate performance of different clustering methods
- Pattern Discovery: Identify clusters of similar transmission patterns
- Peak Analysis: Understand the timing and relationships between transmission peaks
- Methodology Insights: Determine which clustering approach works best for epidemiological data
- Data Collection: COVID-19 case data from multiple regions over time
- Preprocessing: Time series normalization and feature extraction
- Clustering Algorithms: Implementation of K-means, KNN, and Spectral Clustering
- Evaluation: Silhouette scores and cluster quality metrics
- K-means Clustering: Traditional centroid-based clustering for pattern grouping
- KNN Clustering: Density-based approach using nearest neighbor relationships
- Spectral Clustering: Graph-based method capturing complex cluster structures
- Python: Primary programming language
- Scikit-learn: Clustering algorithm implementations
- NumPy & Pandas: Data manipulation
- Visualization: Cluster visualization and comparison charts
Spectral clustering proved most effective at capturing non-linear relationships between transmission patterns, revealing geographic and temporal clusters that traditional methods missed. Related projects
Airline Revenue Optimization System
Designed a predictive ML system for airline passenger no-shows to maximize overbooking revenue using cost-sensitive learning and Monte Carlo simulationsF1 AI Race Predictor
Built an end-to-end race prediction platform using historical race data, weather, driver performance, and qualifying results, achieving 68.5% accuracyPortfolio Optimization Dashboard
Designed a full-stack investment optimization system supporting strategies like Markowitz, Black-Litterman, and Risk Parity, with real-time analytics dashboardsNashville Airbnb Data Analysis
Enhancing InsideAirbnb.com with Predictive Analytics on Nashville Listings: A Data-Driven Approach to Price and Rating PredictionsRestaurant Review Data Dive
Uncovering Customer Satisfaction Drivers through Sentiment Analysis and Predictive Modeling of Restaurant Reviews across Multiple StatesCar Sales Data Dive
Predicting Car Sale Prices through Advanced Data Cleaning, Feature Engineering, and Regression ModelingLyft Market Analysis
Lyft Market Expansion Strategies and Optimization in Washington D.C. using Tableau visualizationBird Data Analysis
Exploratory data analysis and pattern recognition in bird observation datasets