
This lead AI engineering role within Capital One's Intelligent Foundations and Experiences team focuses on building scalable infrastructure for foundation model hosting and large language model inference. The position involves designing and deploying AI software components, including model training, similarity search, and guardrails, while leveraging technologies like PyTorch and AWS. Key responsibilities include inventing state-of-the-art optimization techniques to enhance system performance, latency, and cost, as well as contributing to the long-term technical vision for foundational AI systems. The role appeals to candidates passionate about applying cutting-edge research to transform banking, offering the opportunity to work on high-impact problems in a collaborative environment that values responsible AI. The position is based in major US hubs including Cambridge, McLean, New York, and San Jose.

