Blog
Technical articles on platform engineering, DevOps, cloud architecture, and engineering leadership.
Building Production RAG Pipelines: Beyond the Hello World Tutorial
Most RAG tutorials stop at the demo. This guide covers chunking strategies, embedding models, retrieval evaluation, and the infrastructure decisions that make RAG reliable in production.
LLM Inference at Scale: Kubernetes, GPUs, and Keeping Costs Sane
Running LLMs in production on your own infrastructure is genuinely hard. This is what we've learned deploying and operating self-hosted models at scale.
AI Agents in Production: What Nobody Tells You
Agentic AI systems are powerful and notoriously hard to operate reliably. Here's what we've learned shipping agents to production.
MLOps in 2025: What's Changed and What Still Hasn't
The MLOps landscape has matured significantly. Some problems are solved. Others are as painful as ever. Here's an honest assessment.
AI Strategy for CTOs: Build, Buy, or API?
The most important AI decision most CTOs will make in 2025 is not which model to use. It's how to integrate AI into your product and engineering organisation without creating a mess.
Platform Engineering for AI Teams: What's Different
AI and ML workloads break the assumptions that traditional developer platforms are built on. Here's how to extend your IDP to support data scientists and ML engineers.
Building an Internal Developer Platform: From Zero to Production
A practical guide to designing and implementing an IDP that genuinely increases developer velocity without adding complexity.
GitOps in Production: ArgoCD Patterns That Actually Work
Practical ArgoCD configuration patterns for multi-environment deployments, with real-world examples from production Kubernetes clusters.
What Does a Fractional CTO Actually Do? A Founder's Guide
What to expect when engaging a Fractional CTO, how to work with one effectively, and when it makes sense versus hiring full-time.