Projects

✍️ Writing: Shipping Enterprise RAG (4-part series)

Lessons from building a production RAG assistant for a regulated enterprise

A deep-dive series on what actually breaks in production RAG systems — and how to engineer around it. Drawn from shipping a RAG assistant over thousands of internal documents with real compliance stakes.

Part 1 — Stop optimizing your RAG prompt. The bug is two layers up. Why most wrong RAG answers are retrieval failures, not prompt failures.
Part 2 — Chunking strategies that beat the default (coming soon)
Part 3 — Hybrid retrieval: BM25 + vector + reranking (coming soon)
Part 4 — Production reality check: evals, observability, cost (coming soon)
Read Part 1 on this site Read on Substack & subscribe

Full-Stack Lead Assignment System

Production lead-routing platform — owned end to end, from data model to UI

A company-wide platform that automates how inbound leads are scored, routed, and tracked across sales teams — replacing a manual, error-prone hand-off process. I owned the full stack: the SQL Server data model, the FastAPI services, the business-rules engine, and the React dashboard the team uses every day.

The problem. Leads were distributed by hand, which meant slow response times, uneven workloads, and no visibility into where deals stalled. The business needed routing that was fast, fair, and configurable without a developer in the loop.

What I built.

A configurable business-rules engine so operations can change routing logic (territory, capacity, priority, round-robin) without code changes
Real-time dashboards for assignment status, team workload, and response-time analytics
A FastAPI backend exposing RESTful services over an MS SQL Server data model, with a React front end
Deployed and operated on Azure

Architecture: React (UI) → FastAPI (REST services + rules engine) → MS SQL Server, hosted on Azure.

Tech Stack: Python, FastAPI, React, JavaScript, MS SQL Server, RESTful APIs, Azure
Role: Sole full-stack engineer — data model, backend, rules engine, frontend, deployment
Impact: Automated lead distribution, faster and more even response times, and live visibility into the pipeline for the sales team

RAG Systems & SQL AI Agents

Intelligent AI systems for enhanced information retrieval and database queries

Developed production-ready RAG (Retrieval-Augmented Generation) systems and SQL AI agents that enable natural language interaction with databases and document repositories.

Tech Stack: Python, FastAPI, LangChain, LangGraph, Vector Databases, OpenAI API, MS SQL Server
Key Features: Semantic search, natural language to SQL translation, context-aware responses
Impact: Enabled non-technical users to query complex databases, improved information accessibility
Techniques: Embeddings, vector similarity search, prompt engineering

Spotify Music Recommendation API

Machine learning-powered music discovery and recommendation system

Built a comprehensive Spotify API wrapper with custom ML-based song recommendations. Implemented K-Nearest Neighbors algorithm to suggest similar songs based on audio features like danceability, energy, and acousticness.

Tech Stack: Python, Flask, Spotify API, scikit-learn, Heroku
Features: Track search, audio feature analysis, ML-powered recommendations
Results: Successfully deployed 3 production APIs serving recommendations
GitHub Blog Post

Climate Change Visualization Dashboard

Interactive global temperature visualization platform

Created an interactive web application to visualize historical climate data across countries and time periods. Features animated choropleth maps showing temperature changes from 1750 to 2013.

Tech Stack: Python, Plotly, Pandas, Flask, Heroku
Features: Interactive time-series animations, country-level analysis, Celsius/Fahrenheit conversion
Impact: Made complex climate data accessible and engaging
GitHub Blog Post

Heart Disease Prediction Model

Machine learning model with feature importance visualization

Developed a classification model to predict heart disease using patient health metrics. Implemented comprehensive feature importance analysis to identify key health indicators.

Tech Stack: Python, scikit-learn, XGBoost, Flask, Plotly
Results: Achieved 95%+ prediction accuracy
Features: Interactive web interface, real-time predictions, feature importance visualization
GitHub Medium Article

K-Nearest Neighbors from Scratch

Educational implementation of KNN algorithm

Built KNN classification algorithm from scratch using only NumPy, including custom StandardScaler implementation. Demonstrated on breast cancer dataset classification.

Tech Stack: Python, NumPy
Achievement: 97.6% accuracy on breast cancer classification
Purpose: Deep understanding of ML fundamentals, educational resource
GitHub Blog Post

Data Storytelling with Python

Visual analytics and exploratory data analysis

Series of data analysis projects using Miami-Dade County open data, focusing on meaningful insights and compelling data visualizations.

Tech Stack: Python, Pandas, Matplotlib, Seaborn, Plotly
Focus: EDA, statistical analysis, data visualization, storytelling
Medium Article

More Projects

Check out my GitHub profile for more projects and contributions, or read my blog posts for detailed technical writeups and tutorials.

Last updated: June 2026