Scalable LLM Chatbot Backend
Developed a Retrieval-Augmented Generation (RAG) chatbot backend using LangChain, LangGraph, FastAPI.
Integrated tool calling, database access, and agentic LLMs.
Supported both local (Ollama, vLLM) and API-based (OpenAI, Gemini) models.
Deployed on GCP with response times under 5 seconds.