Blog

Lessons from building and operating infrastructure at scale.

2026-03-25-10 minvictoriametricsobservabilitykubernetes

Running VictoriaMetrics at 50M+ Time Series

What actually breaks when you run VictoriaMetrics at scale, and the specific tweaks that stabilized a 26-node cluster handling 50M+ time series across 100+ Kubernetes clusters.

2026-03-10-8 minprometheusvictoriametricspythonopen-source

Turning Prometheus Label Values Into Metrics You Can Alert On

PromQL can't convert label values to metric values. I built a YAML-driven exporter that bridges this gap, with hot-reload and stale metric cleanup for dynamic clusters.

2026-03-05-11 mingitopsfleetjenkinsvmalertalerting

Deploying Alert Rules at Scale with Fleet and Jenkins

Fleet doesn't have pre-sync hooks like ArgoCD. Here's how I built a Jenkins pipeline that transforms custom alert templates into vmalert rules, validates them with dry-run, and deploys across 100+ clusters via GitOps.