SafeOps Academy

Production-Grade Kubernetes with Guardrails and AI-Assisted SRE

A practical course for shipping faster without increasing production risk. Every chapter is built around a real failure mode, a safe operating path, and hands-on labs.

Core track: 14 chapters Advanced track: 4 modules Format: chapter + lab + runbook + quiz Labs & quizzes: members only

Start Here Open Curriculum Prerequisites Get Full Access

Watch the Intro

Five minutes that set the foundation: the SafeOps mental model, the pledge, and a preview of every chapter in the course.

Watch Chapter 00: AI as a Very Well-Read Junior Engineer

Prerequisites

You'll need accounts (GitHub, optionally Hetzner Cloud and Cloudflare), a few CLI tools (Terraform, kubectl, kind, flux, SOPS, age), and Docker running locally.

Two paths are supported: a free Kind path (Docker only, no cloud costs) and a production-like Hetzner Cloud path.

Read the full prerequisites & environment setup →

Core Track

Foundation-first path for platform, CI/CD, GitOps, observability, reliability, and on-call operations.

Core

Advanced Track

Coming soon — SafeOps Advanced: Production AI, Under Control.

Policy, supply-chain trust, progressive delivery, and rollback/data migration safety patterns.

Advanced

Production-Grade Kubernetes with Guardrails and AI-Assisted SRE

Watch the Intro

Prerequisites

Core Track

Chapter 01: Blast Radius & the Shape of Safety

Chapter 02: Infrastructure as Code (IaC) with Kind

Chapter 03: Secrets Management (SOPS + Age)

Chapter 04: GitOps & Version Promotion

Chapter 05: CI/CD & Developer Guardrails

Chapter 06: Network Policies (Production Isolation)

Chapter 07: Security Context & Pod Hardening

Chapter 08: Resource Management & QoS

Chapter 09: Availability Engineering (HPA + PDB)

Chapter 10: Observability (Metrics, Logs, Traces)

Chapter 11: Backup & Restore Basics

Chapter 12: Controlled Chaos

Chapter 13: AI-Assisted SRE Guardian

Chapter 14: 24/7 Production SRE

Advanced Track

Chapter 15: Supply Chain Security

Chapter 16: Admission Policy Guardrails

Chapter 17: Rollback & Data Migrations

Module: Progressive Delivery