AI / LLM ENGINEER · USA

I build
intelligencethat holds
up inthe real world.

From retrieval systems to production ML infrastructure, I turn emerging AI capability into dependable products at scale.

Explore
selected work

RETRIEVAL SYSTEMS✦MODEL OPTIMIZATION✦AI PLATFORMS✦RETRIEVAL SYSTEMS✦MODEL OPTIMIZATION✦AI PLATFORMS✦

5+Years engineering
production systems

35%Lower inference
latency

200M+Users served by
semantic systems

25%Performance lift from
pipeline optimization

01SELECTED SYSTEMS

Proof over
promises.

Representative production work. Client-sensitive implementation details are intentionally abstracted.

Recommendation intelligence

Semantic discovery at streaming scale

A RAG-based content discovery architecture pairing vector retrieval with tuned generation for more relevant, explainable recommendations.

DECISION NOTESelected RAG over end-to-end fine-tuning to balance retrieval quality, operating cost, and the pace of catalog change.

+30%discovery accuracy

RAGPineconeFAISSFastAPIKubernetes

NLP · financial systems

Real-time fraud intelligence

A production fraud detection system combining BERT language signals with XGBoost decisioning for high-volume banking workflows.

DECISION NOTETurned unstructured transaction context into actionable risk signals while preserving a path for review and iteration.

$2M+annual savings

BERTXGBoostPythonAWSMLOps

Applied language AI

Conversational support engine

A transformer-powered support experience built to resolve common requests quickly and hand off complex conversations cleanly.

DECISION NOTEDesigned around resolution—not novelty—with intent routing, reliable fallbacks, and measurable customer outcomes.

50K+daily interactions

RasaTransformersNLPREST APIsMonitoring

BASED IN THE U.S. · WORKING GLOBALLY

02ABOUT

I care about the space between a promising model and a product people can trust.

I’m an AI/LLM engineer with 5+ years across intelligent systems, scalable backends, and full-stack products. My work spans model selection, retrieval design, inference optimization, and production operations.

I’m most useful where the problem is still a little messy: when accuracy, latency, cost, and user experience all need a seat at the same table.

Download résumé ↘

03CAPABILITIES

Built across the
whole system.

Intelligence

PyTorch
TensorFlow
Hugging Face
LangChain
LlamaIndex
RAG
Fine-tuning
Prompt engineering

Platforms

Python
FastAPI
Spring Boot
Node.js
Microservices
GraphQL
Kafka
Redis

Operations

Docker
Kubernetes
AWS
SageMaker
Azure ML
MLflow
Terraform
CI/CD

Data & interface

MySQL
PostgreSQL
MongoDB
Pinecone
FAISS
React
Angular
TypeScript

04EXPERIENCE

Where I’ve made
the work count.

2024 — NOW

Netflix

AI / LLM Engineer

Architecting recommendation intelligence, optimizing LLM inference, and building resilient ML services that operate at global scale.

10M+ daily requests
99.9% uptime
<200ms response

2019 — 2022

Cognizant

Software Engineer · AI/ML

Delivered applied NLP, fraud detection, conversational AI, and document intelligence for enterprise workflows.

40% less fraud
85% resolution rate
94% document accuracy

EDUCATION

M.S. Computer Science · Machine Learning & AI
Missouri University of Science and Technology · 2024

B.E. Electronics & Communication
Sathyabama University · 2021

05CONTACT

Have a hard
problem? Good.

I’m open to conversations about AI/ML engineering, applied LLM systems, and product-minded technical roles.

sairambodapothula0990@gmail.com ↗

I build intelligencethat holds up inthe real world.

Proof overpromises.