Applied Researcher II (AI Foundations, LLM Core and Agentic AI)

Not Disclosed•Full-TimeOn-site

location_onNYU Paulson Center, 181, Mercer Street, University Village, Manhattan, New York County, New York, 10012, United States

Apply Now

About Capital One and AI

At Capital One, we are creating trustworthy and reliable AI systems, changing banking for good. For years, we have been leading the industry in using machine learning to create real-time, intelligent, automated customer experiences. From informing customers about unusual charges to answering their questions in real time, our applications of AI & ML are bringing humanity and simplicity to banking. We are committed to building world-class applied science and engineering teams and continue our industry-leading capabilities with breakthrough product experiences and scalable, high-performance AI infrastructure.

About the Team

The AI Foundations team is at the center of bringing our vision for AI at Capital One to life. Our work touches every aspect of the research life cycle, from partnering with academia to building production systems. We work with product, technology, and business leaders to apply the state of the art in AI to our business.

About the Role

In this role, you will help bring the transformative power of emerging AI capabilities to reimagine how we serve our customers and businesses. You will partner with a cross-functional team of data scientists, software engineers, machine learning engineers, and product managers to deliver AI-powered products that change how customers interact with their money. You will leverage a broad stack of technologies to reveal insights hidden within huge volumes of numeric and textual data, building AI foundation models through all phases of development.

You will engage in high-impact applied research to take the latest AI developments and push them into the next generation of customer experiences. A key part of this role involves flexing your interpersonal skills to translate the complexity of your work into tangible business goals. We are looking for individuals who love the process of analyzing and creating, share a passion for doing the right thing, and know that it is ultimately about making the right decision for our customers.

Hiring Process

Candidates hired to work in other locations will be subject to the pay range associated with that location. This role is expected to accept applications for a minimum of 5 business days. Please note that no agencies are needed for this position.

Equal Opportunity and Culture

Capital One is an equal opportunity employer (EOE, including disability/vet) committed to non-discrimination in compliance with applicable federal, state, and local laws. We promote a drug-free workplace and will consider for employment qualified applicants with a criminal history in a manner consistent with the requirements of applicable laws. We are committed to building a diverse and inclusive environment where everyone can thrive.

Work location

Work model: On-site

NYU Paulson Center, 181, Mercer Street, University Village, Manhattan, New York County, New York, 10012, United States

New York, New York

Key Responsibilities

check_circlePartner with cross-functional teams to deliver AI-powered products
check_circleBuild AI foundation models through design, training, evaluation, and implementation
check_circleDeliver scalable models and libraries to existing products
check_circleTranslate complex research concepts into tangible business goals
check_circleLeverage technologies like PyTorch and AWS to extract insights from large data volumes
check_circleOwn and pursue a research agenda including problem selection and project execution
check_circleConduct high-impact applied research to advance customer experiences

Requirements

verifiedPhD in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields plus 2 years of experience in Applied Research
verifiedM.S. in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields plus 4 years of experience in Applied Research
verifiedPhD in Computer Science, Machine Learning, Computer Engineering, Applied Mathematics, Electrical Engineering or related fields
verifiedLLM: PhD focus on NLP or Masters with 5 years of industrial NLP research experience
verifiedLLM: Multiple publications on topics related to the pre-training of large language models
verifiedLLM: Member of team that has trained a large language model from scratch (10B + parameters, 500B+ tokens)
verifiedLLM: Publications in deep learning theory
verifiedLLM: Publications at ACL, NAACL and EMNLP, Neurips, ICML or ICLR

Nice to Have

PhD in Computer Science, Machine Learning, Computer Engineering, Applied Mathematics, or Electrical Engineering. LLM focus on NLP (PhD) or Masters with 5 years of industrial NLP research experience. Multiple publications on pre-training of large language models (e.g., technical reports, SSL techniques, optimization). Membership in a team that trained a large language model from scratch (10B+ parameters, 500B+ tokens). Publications in deep learning theory. Publications at ACL, NAACL, EMNLP, Neurips, ICML, or ICLR. PhD focus on geometric deep learning (Graph Neural Networks, Sequential Models, Multivariate Time Series). Multiple papers on training models on graph and sequential data structures at KDD, ICML, NeurIPs, or ICLR. Experience scaling graph models to greater than 50m nodes. Experience with large-scale deep learning-based recommender systems. Experience with production real-time and streaming environments. Contributions to open-source frameworks (pytorch-geometric, DGL). Proposed new methods for inference or representation learning on graphs or sequences. Experience with datasets of 100m+ users. PhD focused on optimizing training of very large deep learning models. Multiple years of experience and/or publications on model sparsification, quantization, training parallelism/partitioning design, gradient checkpointing, or model compression. Experience optimizing training for a 10B+ model. Deep knowledge of deep learning algorithmic and/or optimizer design. Experience with compiler design. PhD focused on guiding LLMs with further tasks (Supervised Finetuning, Instruction-Tuning, Dialogue-Finetuning, Parameter Tuning). Demonstrated knowledge of transfer learning, model adaptation, and model guidance. Experience deploying a fine-tuned large language model.

Benefits & Perks

check_circleComprehensive health, financial, and other benefits for total well-beingcheck_circlePerformance-based incentive compensation including cash bonuses and long-term incentivescheck_circleEmployment authorization sponsorship for qualified applicants

Similar Job Opportunities

Applied Researcher II (AI Foundations, LLM Core and Agentic AI)

Capital One • New York, New York

$263k-327karrow_forward

Applied Researcher I (AI Foundations, LLM Core and Agentic AI)

Capital One • New York, New York

$219k-272karrow_forward

Applied Researcher II (AI Foundations, LLM Core and Agentic AI)

Capital One • New York, New York

$263k-327karrow_forward

Skills, education and keywords

Skills: Machine Learning, Pytorch, Aws, Huggingface, Lightning, Vectordbs, LLM, NLP, Deep Learning, Graph Neural Networks.

Education: PhD in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields required; Master's in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields with 4 years experience.

Frequently asked questions about Applied Researcher II (AI Foundations, LLM Core and Agentic AI) at Capital One

What does a Applied Researcher II (AI Foundations, LLM Core and Agentic AI) at Capital One do?expand_more

Day-to-day, the Applied Researcher II (AI Foundations, LLM Core and Agentic AI) at Capital One will partner with cross-functional teams to deliver ai-powered products; build ai foundation models through design, training, evaluation, and implementation; deliver scalable models and libraries to existing products; and translate complex research concepts into tangible business goals.

What are the requirements for this Applied Researcher II (AI Foundations, LLM Core and Agentic AI) role?expand_more

To qualify for the Applied Researcher II (AI Foundations, LLM Core and Agentic AI) at Capital One position, applicants should have: PhD in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields plus 2 years of experience in Applied Research; M.S. in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields plus 4 years of experience in Applied Research; PhD in Computer Science, Machine Learning, Computer Engineering, Applied Mathematics, Electrical Engineering or related fields; LLM: PhD focus on NLP or Masters with 5 years of industrial NLP research experience; LLM: Multiple publications on topics related to the pre-training of large language models; and LLM: Member of team that has trained a large language model from scratch (10B + parameters, 500B+ tokens).

Where is the Applied Researcher II (AI Foundations, LLM Core and Agentic AI) role at Capital One located?expand_more

Applied Researcher II (AI Foundations, LLM Core and Agentic AI) at Capital One is based in NYU Paulson Center, 181, Mercer Street, University Village, Manhattan, New York County, New York, 10012, United States. This is a on-site role.

Is this Applied Researcher II (AI Foundations, LLM Core and Agentic AI) job remote, hybrid, or on-site?expand_more

Capital One has listed this Applied Researcher II (AI Foundations, LLM Core and Agentic AI) role as on-site.

How much experience is required for this Applied Researcher II (AI Foundations, LLM Core and Agentic AI) role?expand_more

Applied Researcher II (AI Foundations, LLM Core and Agentic AI) at Capital One typically requires 2–4 years of relevant experience at the mid level level.

What skills do you need for the Applied Researcher II (AI Foundations, LLM Core and Agentic AI) role at Capital One?expand_more

Key skills for Applied Researcher II (AI Foundations, LLM Core and Agentic AI) at Capital One include Machine Learning; Pytorch; Aws; Huggingface; Lightning; Vectordbs; LLM; and NLP.

What education is required for Applied Researcher II (AI Foundations, LLM Core and Agentic AI) at Capital One?expand_more

Educational requirements for this role: PhD in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields required; and Master's in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields with 4 years experience.

What category does the Applied Researcher II (AI Foundations, LLM Core and Agentic AI) role belong to?expand_more

Applied Researcher II (AI Foundations, LLM Core and Agentic AI) at Capital One is part of the it job category on Recrutus.

About Capital One and AI

About the Team

About the Role

Equal Opportunity and Culture

Frequently asked questions about Applied Researcher II (AI Foundations, LLM Core and Agentic AI) at Capital One