
location_onNYU Paulson Center, 181, Mercer Street, University Village, Manhattan, New York County, New York, 10012, United States
At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, we have been an industry leader in using machine learning to create real-time, personalized customer experiences. Our investments in technology infrastructure and world-class talent position us to be at the forefront of enterprises leveraging AI.
The Intelligent Foundations and Experiences (IFX) team is at the center of bringing our vision for AI at Capital One to life. We work hand-in-hand with partners across the company to advance the state of the art in science and AI engineering. Our AI models and platforms empower teams across Capital One to enhance their products with the transformative power of AI in responsible and scalable ways for the highest leverage impact.
We are building an enterprise Generative AI Platform that lets dozens of product teams compose powerful, safe, and explainable AI capabilities without wrestling with model minutiae or infrastructure plumbing. As a Distinguished AI Engineer, you will design the agentic workflow framework and shared services—such as memory, guardrails, vector search, SDKs, and blueprints—that translate foundation model power into production-grade applications used by millions of users across multiple lines of business.
In this role, you will partner with a cross-functional team of engineers, research scientists, and product managers to deliver AI-powered products that change how our associates work and how our customers interact with Capital One. You will contribute to the north star platform architecture, continuously publishing living diagrams and canonical APIs that cover agent orchestration, RAG pipelines, prompt libraries, and multi-tenant policy enforcement. A major emphasis will be placed on standardizing and automating agentic workflows, evaluating frameworks like LangGraph and AutoGen, and hardening patterns that best meet enterprise SLAs.
Developer experience is a cornerstone of this work. You will craft an end-to-end GenAI SDK, CLI, and starter kits that let AI engineers spin up secure, observable agentic workflows in minutes, shrinking prototyping to production timelines. Trust and safety remain paramount; you will help bring together a vision of central guardrail services to ensure zero Sev4 incidents. Additionally, you will collaborate with cross-organization architects to drive end-to-end performance by optimizing orchestration and reducing per-token spend.
Finally, you will coach and evangelize, hosting architecture office hours, mentoring Staff and Principal engineers, authoring technical design documents, and representing Capital One at Tier 1 AI conferences to amplify our platform vision across internal and external communities.
Capital One is committed to a fair and inclusive hiring process. We encourage all qualified candidates to apply. If you require an accommodation during the application or interview process, please contact Capital One Recruiting at 1-800-304-9102 or via email at RecruitingAccommodation@capitalone.com. All information provided will be kept confidential and used only to the extent required to provide needed reasonable accommodations.
Capital One is an equal opportunity employer (EOE, including disability/vet) committed to non-discrimination in compliance with applicable federal, state, and local laws. We promote a drug-free workplace and consider for employment qualified applicants with a criminal history in a manner consistent with the requirements of applicable laws. Capital One values diversity and believes that a diverse workforce drives innovation and better serves our customers.
Work model: On-site
NYU Paulson Center, 181, Mercer Street, University Village, Manhattan, New York County, New York, 10012, United States
New York, New York
8+ years of experience deploying scalable and responsible AI solutions on cloud platforms (e.g. AWS, Google Cloud, Azure, or equivalent private cloud); 2+ years of experience supporting Agentic Frameworks (LangChain, CrewAI, Semantic Kernel, or AutoGen); 2+ years of experience with LLMOps (Google Cloud Vertex AI, Amazon SageMaker, Azure Machine Learning); 8+ years of experience designing mission-critical machine learning platforms; 2+ years of experience architecting, designing, developing, integrating, delivering, and supporting complex AI systems; Demonstrated ability to lead and mentor multiple engineering teams and influence cross-functional stakeholders up to the VP level; Experience developing AI and ML algorithms or technologies (e.g. LLM Inference, Similarity Search and VectorDBs, Guardrails, Memory) using Python, C++, C#, Java, or Golang; Master's degree in Computer Science, Computer Engineering, or relevant technical field; Experience developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, and cost; Excellent communication and presentation skills, with the ability to articulate complex AI concepts to peers; Experience leading GenAI or LLM-Powered application architectures in production; Deep understanding of Responsible AI, data privacy and multi-tenant security patterns; Experience as a Staff-plus or Distinguished IC engineer influencing 50+ engineers and C-suite stakeholders; K8s mastery (multi-region clusters, service mesh); Experience staying abreast of the latest AI research and AI systems and applying novel techniques in production.
Capital One • New York, New York
Capital One • New York, New York
Capital One • New York, New York
Skills: Machine Learning, Python, Go, Scala, Java, Aws, Google Cloud, Azure, Langchain, Crewai.
Education: Bachelor's degree in Computer Science, Engineering, or AI required; Master's degree in Computer Science, Engineering, or relevant technical field preferred.