
location_onNYU Paulson Center, 181, Mercer Street, University Village, Manhattan, New York County, New York, 10012, United States
At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, we have been an industry leader in using machine learning to create real-time, personalized customer experiences. Our investments in technology infrastructure and world-class talent, along with our deep experience in machine learning, position us to be at the forefront of enterprises leveraging AI.
The Capital One machine learning platform organization manages our cloud-based enterprise AI+ML system, delivering the high-scale developer and runtime environments required to build, orchestrate, and deploy compute and data-intensive AI systems across real-time and batch workloads. We are committed to continuing to build world-class applied science and engineering teams to deliver industry-leading capabilities with breakthrough product experiences and scalable, high-performance AI infrastructure.
As a Senior Distinguished Engineer, you will be a hands-on technical leader passionate about distributed systems, engineering and scaling foundational compute capabilities for our platform. You will use your experience in building large-scale, highly available, and high-performance systems to develop our common compute infrastructure on top of CPU and GPU substrates.
Your contributions will power everything from developer notebooks to ML/DL model training, model inference, and feature generation pipelines, including pre-training and fine-tuning Transformer-based models as well as generative AI inference and agentic applications. In this role, you will architect and build control and data plane implementations to realize a highly available, multi-tenant, large-scale, and secure machine learning platform. You will direct the technical execution of a diverse project portfolio, collaborating with developers specializing in everything from distributed microservices to running large foundation models.
Beyond technical execution, you will help elevate the Capital One Distinguished Engineering community, establish yourself as a go-to resource on given technologies, and lead the way in creating next-generation talent by mentoring internal talent and actively recruiting external talent to bolster the tech talent pool.
Capital One is open to hiring a remote employee for this opportunity. We are committed to a fair and inclusive recruiting process. If you require an accommodation during the application or interview process, please contact Capital One Recruiting at 1-800-304-9102 or via email at RecruitingAccommodation@capitalone.com. All information provided will be kept confidential and used only to the extent required to provide needed reasonable accommodations.
Capital One is an equal opportunity employer (EOE, including disability/vet) committed to non-discrimination in compliance with applicable federal, state, and local laws. We promote a drug-free workplace and consider for employment qualified applicants with a criminal history in a manner consistent with the requirements of applicable laws.
We are dedicated to building a diverse and inclusive environment where you can bring the transformative power of emerging AI capabilities to reimagine how we serve our customers and businesses. Join us to help bring humanity and simplicity to banking through our applications of AI & ML.
Work model: Remote
NYU Paulson Center, 181, Mercer Street, University Village, Manhattan, New York County, New York, 10012, United States
New York, New York
Master's Degree in Computer Science or Software Engineering. Hands-on experience in the internals of Ray (Actors/GCS/Scheduling) or Spark (Query Optimizer/Memory Management). Experience building platforms that support LLM training, fine-tuning, or high-throughput inference. Hands-on experience with AWS-specific compute primitives (EKS, EC2 UltraClusters, Graviton) and cost-optimization strategies. History of upstream contributions to major distributed systems projects.
Capital One • New York, New York
Capital One • New York, New York
Capital One • New York, New York
Skills: Machine Learning, Ai, Golang, Python, Scala, Java, Spark, Dask, Ray, Flink.
Education: Bachelor's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields; Master's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields; Master's Degree in Computer Science or Software Engineering.