Semantic discovery at streaming scale
A RAG-based content discovery architecture pairing vector retrieval with tuned generation for more relevant, explainable recommendations.
From retrieval systems to production ML infrastructure, I turn emerging AI capability into dependable products at scale.
Representative production work. Client-sensitive implementation details are intentionally abstracted.
A RAG-based content discovery architecture pairing vector retrieval with tuned generation for more relevant, explainable recommendations.
A production fraud detection system combining BERT language signals with XGBoost decisioning for high-volume banking workflows.
A transformer-powered support experience built to resolve common requests quickly and hand off complex conversations cleanly.

I’m an AI/LLM engineer with 5+ years across intelligent systems, scalable backends, and full-stack products. My work spans model selection, retrieval design, inference optimization, and production operations.
I’m most useful where the problem is still a little messy: when accuracy, latency, cost, and user experience all need a seat at the same table.
Architecting recommendation intelligence, optimizing LLM inference, and building resilient ML services that operate at global scale.
Delivered applied NLP, fraud detection, conversational AI, and document intelligence for enterprise workflows.
M.S. Computer Science · Machine Learning & AI
Missouri University of Science and Technology · 2024
B.E. Electronics & Communication
Sathyabama University · 2021
I’m open to conversations about AI/ML engineering, applied LLM systems, and product-minded technical roles.
sairambodapothula0990@gmail.com ↗