
location_onNYU Paulson Center, 181, Mercer Street, University Village, Manhattan, New York County, New York, 10012, United States
At Capital One, we are creating trustworthy and reliable AI systems, changing banking for good. For years, we have been leading the industry in using machine learning to create real-time, intelligent, automated customer experiences. From informing customers about unusual charges to answering their questions in real time, our applications of AI & ML are bringing humanity and simplicity to banking.
The AI Foundations team is at the center of bringing our vision for AI at Capital One to life. Our work touches every aspect of the research life cycle, from partnering with Academia to building production systems. We work with product, technology, and business leaders to apply the state of the art in AI to our business, committed to building world-class applied science and engineering teams with breakthrough product experiences and scalable, high-performance AI infrastructure.
Work model: On-site
NYU Paulson Center, 181, Mercer Street, University Village, Manhattan, New York County, New York, 10012, United States
New York, New York
PhD in Computer Science, Machine Learning, Computer Engineering, Applied Mathematics, or Electrical Engineering. LLM experience. PhD focus on NLP or Masters with 10 years of industrial NLP research experience. Core contributor to a team that has trained a large language model from scratch (10B+ parameters, 500B+ tokens) or through continued pre-training, post-training pipeline for alignment and reasoning, LLM optimizations, or complex reasoning with multi-agentic LLMs. Numerous publications at ACL, NAACL, EMNLP, Neurips, ICML, or ICLR on topics related to the pre-training of large language models. Experience working on an LLM (open source or commercial) that is currently available for use. Demonstrated ability to guide the technical direction of a large-scale model training team. Experience with common training optimization frameworks (DeepSpeed, NeMo).
Capital One • New York, New York
Capital One • New York, New York
Capital One • New York, New York
Skills: Machine Learning, Ai, Pytorch, Aws, Huggingface, Lightning, Vectordbs, Deep Learning, Training Optimization, Self-Supervised Learning.
Education: PhD in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics or related fields; Master's in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics or related fields; PhD in Computer Science, Machine Learning, Computer Engineering, Applied Mathematics, Electrical Engineering or related fields.