Scalable LLM Chatbot Backend

By Published On: 30 May 2025
  • Developed a Retrieval-Augmented Generation (RAG) chatbot backend using LangChain, LangGraph, FastAPI.

  • Integrated tool calling, database access, and agentic LLMs.

  • Supported both local (Ollama, vLLM) and API-based (OpenAI, Gemini) models.

  • Deployed on GCP with response times under 5 seconds.