Scalable LLM Chatbot Backend

By Irfan Muhammad GhaniPublished On: 30 May 2025

Developed a Retrieval-Augmented Generation (RAG) chatbot backend using LangChain, LangGraph, FastAPI.
Integrated tool calling, database access, and agentic LLMs.
Supported both local (Ollama, vLLM) and API-based (OpenAI, Gemini) models.
Deployed on GCP with response times under 5 seconds.