AI / LLM ENGINEER · USA

I build
intelligence
that holds
up in
the real world.

From retrieval systems to production ML infrastructure, I turn emerging AI capability into dependable products at scale.

Explore
selected work
RETRIEVAL SYSTEMSMODEL OPTIMIZATIONAI PLATFORMSRETRIEVAL SYSTEMSMODEL OPTIMIZATIONAI PLATFORMS
5+Years engineering
production systems
35%Lower inference
latency
200M+Users served by
semantic systems
25%Performance lift from
pipeline optimization
01SELECTED SYSTEMS

Proof over
promises.

Representative production work. Client-sensitive implementation details are intentionally abstracted.

01
Recommendation intelligence

Semantic discovery at streaming scale

A RAG-based content discovery architecture pairing vector retrieval with tuned generation for more relevant, explainable recommendations.

DECISION NOTESelected RAG over end-to-end fine-tuning to balance retrieval quality, operating cost, and the pace of catalog change.
+30%discovery accuracy
RAGPineconeFAISSFastAPIKubernetes
02
NLP · financial systems

Real-time fraud intelligence

A production fraud detection system combining BERT language signals with XGBoost decisioning for high-volume banking workflows.

DECISION NOTETurned unstructured transaction context into actionable risk signals while preserving a path for review and iteration.
$2M+annual savings
BERTXGBoostPythonAWSMLOps
03
Applied language AI

Conversational support engine

A transformer-powered support experience built to resolve common requests quickly and hand off complex conversations cleanly.

DECISION NOTEDesigned around resolution—not novelty—with intent routing, reliable fallbacks, and measurable customer outcomes.
50K+daily interactions
RasaTransformersNLPREST APIsMonitoring
Sairam Bodapothula
BASED IN THE U.S. · WORKING GLOBALLY
02ABOUT

I care about the space between a promising model and a product people can trust.

I’m an AI/LLM engineer with 5+ years across intelligent systems, scalable backends, and full-stack products. My work spans model selection, retrieval design, inference optimization, and production operations.

I’m most useful where the problem is still a little messy: when accuracy, latency, cost, and user experience all need a seat at the same table.

Download résumé
03CAPABILITIES

Built across the
whole system.

01

Intelligence

  • PyTorch
  • TensorFlow
  • Hugging Face
  • LangChain
  • LlamaIndex
  • RAG
  • Fine-tuning
  • Prompt engineering
02

Platforms

  • Python
  • FastAPI
  • Spring Boot
  • Node.js
  • Microservices
  • GraphQL
  • Kafka
  • Redis
03

Operations

  • Docker
  • Kubernetes
  • AWS
  • SageMaker
  • Azure ML
  • MLflow
  • Terraform
  • CI/CD
04

Data & interface

  • MySQL
  • PostgreSQL
  • MongoDB
  • Pinecone
  • FAISS
  • React
  • Angular
  • TypeScript
04EXPERIENCE

Where I’ve made
the work count.

2024 — NOW

Netflix

AI / LLM Engineer

Architecting recommendation intelligence, optimizing LLM inference, and building resilient ML services that operate at global scale.

  • 10M+ daily requests
  • 99.9% uptime
  • <200ms response
2019 — 2022

Cognizant

Software Engineer · AI/ML

Delivered applied NLP, fraud detection, conversational AI, and document intelligence for enterprise workflows.

  • 40% less fraud
  • 85% resolution rate
  • 94% document accuracy
EDUCATION

M.S. Computer Science · Machine Learning & AI
Missouri University of Science and Technology · 2024

B.E. Electronics & Communication
Sathyabama University · 2021

05CONTACT

Have a hard
problem? Good.

I’m open to conversations about AI/ML engineering, applied LLM systems, and product-minded technical roles.

sairambodapothula0990@gmail.com