IBM

Research Research Scientist - Large Scale Language Models Intern: 2025 Multiple Cities

About the Employer

Job Description

Introduction Research Scientists are charting the future of Artificial Intelligence, creating breakthroughs in quantum computing, discovering how blockchain will reshape the enterprise, and much more. Join a team that is dedicated to applying science to some of today's most complex challenges, whether it’s discovering a new way for doctors to help patients, teaming with environmentalists to clean up our waterways or enabling retailers to personalize customer service. Your Role and Responsibilities This is for a 2025 summer internship with the following start dates: May - August or June - September for quarter system schools. IBM Research is looking for strong BS, MS and PhD level interns to join our team in 2025 to work in the area of large scale language models. Our team directly contributes to the construction of IBM's largest scale AI models as well as the software supporting them, and we have a range of possible internship projects in the general areas of model architecture, analysis, training and alignment techniques, reasoning and planning, tool use, multilinguality, curation of challenging benchmarks and human preference data, structured input and output and programming models for LLM applications. In this role, you are expected to conduct novel research which requires the ability to research and understand state of the art techniques, datasets and results in the specific area of the internship as described above. A desirable outcome of the research is a high quality submission to a conference. Required Technical and Professional Expertise Applicants should be PhD&MS students. Design, validation, and characterization of algorithms and/or systems using deep learning frameworks. Familiarity with state of the art technologies in one or more of the domains of research for the internship projects above. Basic coursework on machine and deep learning. Preferred Technical and Professional Expertise Programming languages: Python, Java, C/C++, JavaScript, R, etc. Experience in training large-scale machine learning models. Experience analyzing large-scale data from a variety of sources. 2 years experience in NLP, machine learning, or computational linguistics and strong programming skills. We prefer candidates with a strong publication record in conferences such as NeurIPS, AAAI, IJCAI, ICML, ICLR, ACL, EMNLP, and ICASSP. J-18808-Ljbffr