
location_onNYU Paulson Center, 181, Mercer Street, University Village, Manhattan, New York County, New York, 10012, United States
At Synchrony, the Digital Servicing Engineering domain is dedicated to maintaining the stability, observability, and operational efficiency of critical applications. Our Reliability Engineering function sets the standard for service health, incident response, and operational readiness, ensuring that our systems support millions of customers with resilience and speed.
We foster a culture of operational ownership and blameless learning. Our teams work to reduce manual toil, improve system availability, and accelerate safe delivery through automation-first practices and modern architecture.
The VP, Reliability and Automation Engineering Manager leads the Reliability Engineering function, defining how reliability is practiced across the organization. This role is a hybrid of strategic leadership and hands-on technical expertise, responsible for setting standards for monitoring, alerting, and problem management while directly managing and developing a team of reliability engineers.
You will lead by example, influencing cross-functional engineering and product teams to embed reliability requirements into design and delivery processes. Your day-to-day involves driving an automation roadmap to reduce operational work, establishing repeatable engineering practices, and ensuring 24x7 operational coverage. You will partner with architects, security teams, and vendors to build resilient customer communication applications that align with Synchrony's technology strategy.
We are proud to offer flexibility in how you work. At Synchrony, you have the option to work from home near one of our Hubs or come into one of our offices. You will be required to commute to your nearest Hub for in-person engagement activities, such as regular business or team meetings, training, and culture events.
To apply, please submit your resume through our careers portal. We are committed to a fair and inclusive hiring process. If you require a reasonable accommodation to apply for a job or to perform your job, please contact our Career Support Line at 1-866-301-5627 (8am – 5pm CST, Monday to Friday). Representatives are available to discuss your specific situation regarding the application process or work procedures.
We are building a future where everyone can belong, connect, and turn ideals into action. When you join us, you become part of an inclusive culture where your individual skills, experience, and voice are not only heard but valued. More than 50% of our workforce is engaged in our Employee Resource Groups (ERGs), offering a safe space to learn and grow.
Synchrony is an equal opportunity employer. We consider qualified applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or veteran status. Legal authorization to work in the U.S. is required for this position; we will not sponsor individuals for employment visas.
Skills: Solution Architecture, Application Development, Reliability Engineering, Architecture, Ci/cd, Incident Management, Operational Excellence, Problem-Solving, Root Cause Analysis, Monitoring.
Education: Bachelor's degree required with minimum 6 years of experience in solution architecture; High school diploma or equivalent required for eligibility.
Work model: Hybrid
NYU Paulson Center, 181, Mercer Street, University Village, Manhattan, New York County, New York, 10012, United States
New York, New York
Experience defining and managing SLIs/SLOs, error budgets, operational readiness reviews, and reliability KPIs. Strong background in automation-first operations. Experience establishing standards for monitoring/alerts quality. Deep expertise with production telemetry and tooling like Splunk and New Relic. Experience implementing log/metric/trace practices and dashboards that support fast triage and service recovery. Strong understanding of modern application architecture and runtime environments like PCF, AWS, microservices, REST APIs, ReactJS applications, iOS/Android native applications. Ability to influence design decisions to improve reliability, scalability and supportability. Experience delivering automation that uses AI/ML techniques for reliability outcomes. Excellent interpersonal skills and proven track record influencing across a matrixed organization. Desire to work in a dynamic, fast paced environment. Experience developing and supporting financial/banking applications. Strong attention to detail in a team environment.
Recrutus helps candidates discover roles that match their skills and helps teams reach qualified applicants faster. Browse by metro, discipline, or work style — from internships to senior leadership.