Applied Researcher II (AI Foundations, LLM Core and Agentic AI)

Not Disclosed•Full-TimeOn-site

location_onNYU Paulson Center, 181, Mercer Street, University Village, Manhattan, New York County, New York, 10012, United States

Apply Now

About the Team

At Capital One, we are creating trustworthy and reliable AI systems, changing banking for good. For years, Capital One has been leading the industry in using machine learning to create real-time, intelligent, automated customer experiences. From informing customers about unusual charges to answering their questions in real time, our applications of AI & ML are bringing humanity and simplicity to banking.

The AI Foundations team sits at the center of bringing this vision to life. Our work touches every aspect of the research life cycle, from partnering with academia to building production systems. We collaborate with product, technology, and business leaders to apply the state of the art in AI to our business, helping to reimagine how we serve the customers and businesses who have come to love the products and services we build.

Work location

Work model: On-site

NYU Paulson Center, 181, Mercer Street, University Village, Manhattan, New York County, New York, 10012, United States

New York, New York

Key Responsibilities

check_circlePartner with cross-functional teams to deliver AI-powered products
check_circleBuild AI foundation models through design, training, evaluation, and implementation
check_circleTranslate complex technical work into tangible business goals
check_circleLeverage PyTorch, AWS, and Huggingface to extract insights from large data volumes
check_circleOwn and pursue a research agenda including problem selection and project execution
check_circleConduct high-impact applied research to advance customer experiences

Requirements

verifiedPhD in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields plus 2 years of experience in Applied Research
verifiedM.S. in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields plus 4 years of experience in Applied Research
verifiedExperience building large deep learning models
verifiedExperience delivering libraries, platform level code or solution level code
verifiedFirst author publications or projects in machine learning
verifiedPhD focus on NLP or Masters with 5 years of industrial NLP research experience
verifiedMultiple publications on topics related to pre-training of large language models
verifiedMember of team that has trained a large language model from scratch (10B + parameters, 500B+ tokens)
verifiedPublications in deep learning theory
verifiedPublications at ACL, NAACL, EMNLP, Neurips, ICML or ICLR
verifiedPhD focused on topics related to optimizing training of very large deep learning models
verifiedExperience optimizing training for a 10B+ model
verifiedDeep knowledge of deep learning algorithmic and/or optimizer design
verifiedExperience with compiler design
verifiedPhD focused on topics related to guiding LLMs with further tasks

Nice to Have

PhD in Computer Science, Machine Learning, Computer Engineering, Applied Mathematics, or Electrical Engineering. LLM focus on NLP or Masters with 5 years of industrial NLP research experience. Multiple publications on pre-training of large language models (e.g., technical reports, SSL techniques, model pre-training optimization). Membership in a team that has trained a large language model from scratch (10B+ parameters, 500B+ tokens). Publications in deep learning theory. Publications at ACL, NAACL, EMNLP, Neurips, ICML, or ICLR. PhD focused on optimizing training of very large deep learning models. Multiple years of experience and/or publications on Model Sparsification, Quantization, Training Parallelism/Partitioning Design, Gradient Checkpointing, or Model Compression. Experience optimizing training for a 10B+ model. Deep knowledge of deep learning algorithmic and/or optimizer design. Experience with compiler design. PhD focused on guiding LLMs with further tasks (Supervised Finetuning, Instruction-Tuning, Dialogue-Finetuning, Parameter Tuning). Demonstrated knowledge of principles of transfer learning, model adaptation, and model guidance. Experience deploying a fine-tuned large language model.

Benefits & Perks

check_circleEligible for performance-based incentive compensation including cash bonuses and/or long-term incentivescheck_circleComprehensive, competitive, and inclusive set of health, financial, and other benefits

Similar Job Opportunities

Applied Researcher II

Capital One • New York, New York

$263k-327karrow_forward

Applied Researcher I

Capital One • New York, New York

$219k-272karrow_forward

Applied Researcher I

Capital One • New York, New York

$219k-272karrow_forward

Skills, education and keywords

Skills: Machine Learning, Pytorch, Aws, Huggingface, Lightning, Vector DBS, Ai Foundation Models, Deep Learning, LLM, NLP.

Education: PhD in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields required; Master's in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields with 4 years experience.

Frequently asked questions about Applied Researcher II (AI Foundations, LLM Core and Agentic AI) at Capital One

What does a Applied Researcher II (AI Foundations, LLM Core and Agentic AI) at Capital One do?expand_more

A Applied Researcher II (AI Foundations, LLM Core and Agentic AI) at Capital One is responsible for the following: Partner with cross-functional teams to deliver AI-powered products; Build AI foundation models through design, training, evaluation, and implementation; Translate complex technical work into tangible business goals; and Leverage PyTorch, AWS, and Huggingface to extract insights from large data volumes.

What are the requirements for this Applied Researcher II (AI Foundations, LLM Core and Agentic AI) role?expand_more

Capital One is looking for candidates who meet the following requirements: PhD in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields plus 2 years of experience in Applied Research; M.S. in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields plus 4 years of experience in Applied Research; Experience building large deep learning models; Experience delivering libraries, platform level code or solution level code; First author publications or projects in machine learning; and PhD focus on NLP or Masters with 5 years of industrial NLP research experience.

Where is the Applied Researcher II (AI Foundations, LLM Core and Agentic AI) role at Capital One located?expand_more

Applied Researcher II (AI Foundations, LLM Core and Agentic AI) at Capital One is based in NYU Paulson Center, 181, Mercer Street, University Village, Manhattan, New York County, New York, 10012, United States. This is a on-site role.

Is this Applied Researcher II (AI Foundations, LLM Core and Agentic AI) job remote, hybrid, or on-site?expand_more

Capital One has listed this Applied Researcher II (AI Foundations, LLM Core and Agentic AI) role as on-site.

How much experience is required for this Applied Researcher II (AI Foundations, LLM Core and Agentic AI) role?expand_more

Applied Researcher II (AI Foundations, LLM Core and Agentic AI) at Capital One typically requires 2–4 years of relevant experience at the mid level level.

What skills do you need for the Applied Researcher II (AI Foundations, LLM Core and Agentic AI) role at Capital One?expand_more

Key skills for Applied Researcher II (AI Foundations, LLM Core and Agentic AI) at Capital One include Machine Learning; Pytorch; Aws; Huggingface; Lightning; Vector DBS; Ai Foundation Models; and Deep Learning.

What education is required for Applied Researcher II (AI Foundations, LLM Core and Agentic AI) at Capital One?expand_more

Educational requirements for this role: PhD in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields required; and Master's in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields with 4 years experience.

What category does the Applied Researcher II (AI Foundations, LLM Core and Agentic AI) role belong to?expand_more

Applied Researcher II (AI Foundations, LLM Core and Agentic AI) at Capital One is part of the it job category on Recrutus.

About the Team

Frequently asked questions about Applied Researcher II (AI Foundations, LLM Core and Agentic AI) at Capital One