Sai Yashwanth Dasari

01 · About

I'm a 2025 B.Tech graduate in AI/ML with a focus on Python backend engineering and data infrastructure. I like understanding how things actually work, not just at the surface, but the tradeoffs, the failure modes, and why certain decisions were made.

The intersection of AI and backend systems is where I want to be. Multi-agent pipelines, async infrastructure, systems where intelligence meets scale. Looking for roles where there's always something deeper to figure out.

Projects shipped

Open source PRs

300+

LeetCode problems

Internships

02 · Projects

Featured

Mini Web Crawler

May 2026

Production-grade async BFS web crawler with dual fetch strategy — aiohttp for static sites, automatic Playwright fallback for JS-heavy sites. robots.txt compliance with async per-domain caching. 3-table PostgreSQL schema with background job tracking and duplicate detection. Containerized and published on Docker Hub.

FastAPI Playwright PostgreSQL aiohttp Docker BeautifulSoup4 AsyncIO

GitHub Docker Hub

Insurance Claim Processing Pipeline

Apr 2026

Multi-agent LangGraph pipeline processing insurance PDFs end-to-end — classifies pages into 9 document types via Gemini vision, routes to specialized extraction agents. Exponential backoff retry logic, graceful JSON fallback, keyword-based mock layer for testing without API quota consumption.

LangGraph FastAPI Gemini API PyMuPDF Multi-agent

GitHub

Finance Access Control Backend

Apr 2026

Role-based finance dashboard backend with 3-tier RBAC (Admin, Analyst, Viewer) across 15+ REST endpoints. JWT authentication, bcrypt hashing, soft delete, dynamic filtering, and pagination. 33 passing pytest tests with dependency injection overrides and isolated test database.

FastAPI PostgreSQL SQLAlchemy JWT pytest RBAC

GitHub

Hotel Booking Analytics & RAG Q&A

Mid 2025

RAG pipeline over 119K+ hotel booking records — chunked and embedded domain documents with sentence-transformers (MiniLM-L6-v2), indexed via FAISS cosine similarity, Mistral-7B for natural language Q&A. Hybrid retrieval combining vector search with deterministic analytics for revenue, cancellation rate, and lead-time queries. 4 FastAPI endpoints with background embedding on startup; analytical reports returned as base64 charts.

FastAPI LangChain FAISS Mistral-7B sentence-transformers Matplotlib

GitHub

GhostDrop — Anonymous File Sharing

Jul 2025

Horizontally scalable file sharing platform with NGINX load balancing across 3 FastAPI instances. Supports 2GB chunked uploads. Automated self-cleanup via asyncio background tasks. One-command deployment with Docker Compose.

FastAPI NGINX Docker Compose AsyncIO SQLite

GitHub

Smart Task Analyzer

Nov 2025

Multi-criteria task prioritization system using Weighted Sum Model from MCDM theory with 4 prioritization strategies, dependency management with forward/backward handling and penalty multipliers. RESTful API with 100% pytest coverage.

Django DRF pytest JavaScript MCDM

GitHub ↗ Live Demo

02b · Early Projects

Not maintained · Early learning project

Multi-Modal AI Chatbot

Sep – Nov 2024

First exploration of multimodal API integration — Cohere for NLP, Imagga for image recognition. Built async middleware and a responsive JS frontend. May be broken on free-tier hosting.

Flask Cohere API Imagga API JavaScript

↗ Live Demo

Not maintained · Early learning project

Medical AI Chatbot

May 2024

First attempt at RAG — LangChain, vector database for semantic search, LLM APIs with response streaming. Where I learned what retrieval-augmented generation actually means in practice.

Python LangChain Vector DB Streamlit

↗ Live Demo

03 · Experience

Software Engineer Intern

Workcohol · Chennai, Tamil Nadu

Dec 2024 – Apr 2025

Built 15+ RESTful APIs using Django REST Framework for survey creation, response collection, and analytics in a team of 4
Optimized MySQL database queries achieving 40% faster response times with proper indexing and schema design
Implemented JWT-based authentication and RBAC securing 3 user roles across the platform

Deep Learning Research Intern

University of Hyderabad · Hyderabad, Telangana

Sep 2023 – Jan 2024

Authored research paper presented at IHCI 2023 international conference on Telugu song pattern recognition using GANs
Extracted audio features (MFCC, Chroma, Spectral Contrast) using Librosa and Praat; processed datasets with Pandas and NumPy
Implemented SVM classifier and GAN architecture for raga classification and emotion recognition

04 · Skills

Backend & APIs

Python, FastAPI, Django, DRF, Flask, Pydantic, RESTful API Design

Data & Crawling

Web Crawling, BFS Graph Traversal, aiohttp, BeautifulSoup4, Playwright, Data Pipelines, Pandas, NumPy

Databases

PostgreSQL, MySQL, SQLAlchemy ORM, Django ORM, Query Optimization, Schema Design, Indexing

Async & Concurrency

AsyncIO, aiohttp, Background Jobs, Concurrent Request Handling, Event-Driven Architecture

DevOps & Infrastructure

Docker, Docker Compose, NGINX Load Balancing, Git/GitHub, Linux, CI/CD concepts

AI & LLMs

LangChain, LangGraph, RAG Pipelines, FAISS, Gemini, Claude 3.5 Sonnet, Mistral-7B, Prompt Engineering

Testing & Security

pytest (90%+ coverage), Unit/Integration Testing, JWT, OAuth2, RBAC, API Security

Other

300+ LeetCode, DSA, Open Source (4 PRs to fastapi-boilerplate), Agile, Code Reviews