
This senior leadership role serves as the primary escalation owner for AI and machine learning model-serving pipelines within the Department of War's War Data Platform. The position requires coordinating incident response, diagnosing serving failures using Kubernetes and observability tools, and leading cross-service collaboration to ensure uninterrupted AI performance across classified and unclassified environments. Key responsibilities include managing Tier-4 escalation workflows, conducting post-incident analysis, and maintaining operational readiness for mission-critical defense initiatives. The role offers the opportunity to work on cutting-edge AI technology directly supporting national security missions within a culture defined by innovation and excellence. Candidates must hold a Secret clearance with eligibility for Top Secret and possess extensive experience in enterprise incident management and cloud-native operations.






















