From the Blog

Kubernetes Infrastructure Guides

Written by engineers who run production clusters. Real patterns, real kubectl commands, real decisions. No fluff.

📦
GitOps Mar 2026
ArgoCD App-of-Apps Pattern: How We Structure Helm Deployments Across Multiple Environments
The exact repo structure, ApplicationSet config, and Helm value override pattern we use for multi-environment production deployments.
Read the guide →
📊
Observability Feb 2026
The 22 Prometheus Alerting Rules Every Production Kubernetes Cluster Needs
The complete PromQL alert rule set we deploy on every engagement: node pressure, OOMKill risk, pod restarts, PVC near-full, and API server latency.
Read the guide →
🔧
Terraform Feb 2026
The Terraform Module Structure We Use for Every EKS Cluster
Our standard layout: VPC module, EKS module, Karpenter node pool config, and Terragrunt environment structure for dev, staging, and production.
Read the guide →
🧪
Reliability Jan 2026
Kubernetes Resource Requests and Limits: How to Tune Them Without Breaking Production
Why most teams set limits wrong, how OOMKill actually happens, and a data-driven approach using VPA recommendations and Prometheus p95 data.
Read the guide →
🚀
Migration Jan 2026
Migrating 18 Microservices to Kubernetes: The Phased Approach That Kept Production Stable
How we migrated an entire microservice platform with zero downtime: containerization order, Helm chart patterns, ArgoCD setup, and the DNS cutover strategy.
Read the guide →
Helm Dec 2025
Writing Production Helm Charts: The Patterns We Use and the Mistakes to Avoid
Lifecycle hooks, pre-upgrade jobs, optional sidecar injection, and how to structure values.yaml so operators don't have to read the source to configure a chart.
Read the guide →

New Kubernetes guides, when we publish them.

No weekly blasts. We write when we have something worth saying.

Read enough? Let's look at your cluster.

30 minutes. We review your Kubernetes setup and tell you exactly what we see. No deck, no sales pitch.