An Azure service that provides serverless Kubernetes, an integrated continuous integration and continuous delivery experience, and enterprise-grade security and governance.
Hello Himanshu Shekhar,
Thank you for your response regarding the maintenance window. We have to deal with the problem that the system is used in public transportation inftrastructure. The outage directly impacted passenger services. Given the 4-hour minimum requirement, how would you recommend we handle this situation?
Moreover, my primary concern issue 1 was not addressed: The AKS-managed eraser-controller-manager (v1.4.0) uses the deprecated API eraser.sh/v1alpha1, which Azure's own diagnostics flags as deprecated. This caused a cascading failure:
- Ereaser created imagejobs via deprecated API
- Gatekeeper crashed trying to validate them (OOMKill)
- 17 Eraser pods stuck with UnexpectedAdmissionError
- Resource exhaustion → 2.5 hour production outage
How can we prevent this Ereaser issue when we re-enable autmatic Node OS or Kubernetes Upgrade updates?
Greetings, Sophie Yavuz