Thanks for the explanation! This definitely reminds me of CrowdStrike outages la...

cocoa19 · 2025-11-19T15:41:36 1763566896

It might remind you of Crowdstrike because of the scale.

Outages are in a large majority of cases caused by change, either deployments of new versions or configuration changes.

harivyom · 2025-11-20T04:56:47 1763614607

zone your deployments first -blue/green. Have a small blue zone, and test it out. If it works, then expand to green deployments.

A configuration file should not grow! design failure here, I want to understand