Under Construction — Launching Soon

Your AI-Powered
SRE Companion

Sharpen your skills with real-world challenges across Site Reliability Engineering, MLOps, and Generative AI — built by practitioners, for practitioners.

3 Challenge Tracks
Learning Paths
Free Forever

Master Modern Engineering

Three focused tracks designed for the modern engineer. Each challenge mirrors real-world scenarios you'll face on the job.

Coming Soon

GenAI Challenges

Navigate the complexities of LLMs in production — from prompt engineering to RAG pipelines and responsible AI at scale.

  • LLM Reliability & Observability
  • Prompt Engineering at Scale
  • RAG Pipeline Debugging
  • AI Cost Optimization
  • Guardrails & Safety Patterns
Coming Soon

MLOps Challenges

Bridge the gap between ML development and production. Tackle real challenges in model deployment, monitoring, and the full ML lifecycle.

  • Model Deployment & Serving
  • Feature Store Management
  • Training Pipeline Reliability
  • Data Drift Detection
  • CI/CD for ML Systems
Coming Soon

SRE Challenges

Classic and modern SRE scenarios — from incident management to capacity planning. Build the muscle memory needed for five-nines reliability.

  • Incident Response Simulations
  • SLO / SLI Design
  • Capacity Planning Exercises
  • On-Call Runbook Optimization
  • Chaos Engineering Scenarios

Built for the Trenches

SRE Buddy is being built by engineers who have dealt with production fires, 3 am pages, and the unique challenges of running AI systems at scale.

Our goal is simple: create a platform where you can practise the scenarios that matter most — before you face them on the job.

Scenario-Based

Real incidents, real decisions, real tradeoffs.

Community-Driven

Challenges contributed by working practitioners.

Always Evolving

Content keeps pace with the industry.

Stay in the Loop

Be the first to know when SRE Buddy launches. No spam — just signal.

We'll send one email when we launch. That's it.