2025-05-30T07:29:12+07:00

Scalable LLM Chatbot Backend

  • Developed a Retrieval-Augmented Generation (RAG) chatbot backend using LangChain, LangGraph, FastAPI.

  • Integrated tool calling, database access, and agentic LLMs.

  • Supported both local (Ollama, vLLM) and API-based (OpenAI, Gemini) models.

  • Deployed on GCP with response times under 5 seconds.