Jobs
Locationsexpand_more
All locations
Jobs in TexasJobs in New YorkJobs in CaliforniaJobs in FloridaJobs in ArizonaJobs in Massachusetts
Categoriesexpand_more
All categories
Healthcare & NursingLogistics & WarehouseEngineeringITHospitality & CateringTravel
SkillsCompaniesCareer GuidesBlogSalary
JobsLocationsCategoriesCompaniesCareer GuidesBlogSalary

Top states

TexasNew YorkCaliforniaFloridaArizonaMassachusetts

Top categories

Healthcare & NursingLogistics & WarehouseEngineeringITHospitality & CateringTravel
Recrutus

Curating the world's most innovative career opportunities. We bridge the gap between visionary talent and industry-leading companies.

Search roles by city, category, skill, or job type — explore verified US employers, salary benchmarks, and remote-friendly teams hiring nationwide.

publiclanguageshare
Job seekers
Browse jobsCompanies hiringRemote jobsJobs by locationJobs by cityJobs by categoryJobs by skillCareer guidesCareer blogSalary insights
Job types
Contractor jobsFull-Time jobsIntern jobsOther jobsPart-Time jobsPer-Diem jobsTemporary jobs
Top states
Jobs in TexasJobs in New YorkJobs in CaliforniaJobs in FloridaJobs in ArizonaJobs in MassachusettsAll states →
Top categories
Healthcare & Nursing jobsLogistics & Warehouse jobsEngineering jobsIT jobsHospitality & Catering jobsTravel jobsSales jobs
Popular skills
CDL A jobsRegistered Nurse jobsBLS jobsAcls jobs
Featured employers
Company
About usFAQContactPrivacy policyUS privacy noticeAccessibility

Recrutus helps candidates discover roles that match their skills and helps teams reach qualified applicants faster. Browse by metro, discipline, or work style — from internships to senior leadership.

© 2026 Recrutus. All rights reserved.
Terms of serviceCookie policyAcceptable useDMCA policyEmployer termsCandidate terms
Jobs
Locationsexpand_more
All locations
Jobs in TexasJobs in New YorkJobs in CaliforniaJobs in FloridaJobs in ArizonaJobs in Massachusetts
Categoriesexpand_more
All categories
Healthcare & NursingLogistics & WarehouseEngineeringITHospitality & CateringTravel
SkillsCompaniesCareer GuidesBlogSalary
JobsLocationsCategoriesCompaniesCareer GuidesBlogSalary

Top states

TexasNew YorkCaliforniaFloridaArizonaMassachusetts

Top categories

Healthcare & NursingLogistics & WarehouseEngineeringITHospitality & CateringTravel
Recrutus

Curating the world's most innovative career opportunities. We bridge the gap between visionary talent and industry-leading companies.

Search roles by city, category, skill, or job type — explore verified US employers, salary benchmarks, and remote-friendly teams hiring nationwide.

publiclanguageshare
Job seekers
Browse jobsCompanies hiringRemote jobsJobs by locationJobs by cityJobs by categoryJobs by skillCareer guidesCareer blogSalary insights
Job types
Contractor jobsFull-Time jobsIntern jobsOther jobsPart-Time jobsPer-Diem jobsTemporary jobs
Top states
Jobs in TexasJobs in New YorkJobs in CaliforniaJobs in FloridaJobs in ArizonaJobs in MassachusettsAll states →
Top categories
Healthcare & Nursing jobsLogistics & Warehouse jobsEngineering jobsIT jobsHospitality & Catering jobsTravel jobsSales jobs
Popular skills
CDL A jobsRegistered Nurse jobsBLS jobsAcls jobs
Featured employers
Company
About usFAQContactPrivacy policyUS privacy noticeAccessibility

Recrutus helps candidates discover roles that match their skills and helps teams reach qualified applicants faster. Browse by metro, discipline, or work style — from internships to senior leadership.

© 2026 Recrutus. All rights reserved.
Terms of serviceCookie policyAcceptable useDMCA policyEmployer termsCandidate terms
Jobs
Locationsexpand_more
All locations
Jobs in TexasJobs in New YorkJobs in CaliforniaJobs in FloridaJobs in ArizonaJobs in Massachusetts
Categoriesexpand_more
All categories
Healthcare & NursingLogistics & WarehouseEngineeringITHospitality & CateringTravel
SkillsCompaniesCareer GuidesBlogSalary
JobsLocationsCategoriesCompaniesCareer GuidesBlogSalary

Top states

TexasNew YorkCaliforniaFloridaArizonaMassachusetts

Top categories

Healthcare & NursingLogistics & WarehouseEngineeringITHospitality & CateringTravel
  1. Home
  2. chevron_right
  3. scientific & qa
  4. chevron_right
  5. Research Internship- Multimodal LLM (Speech/Music/Audio/Vision/Language)
Tencent logo

Research Internship- Multimodal LLM (Speech/Music/Audio/Vision/Language)

Not Disclosed•InternOn-site

location_onBellevue Presbyterian Church, 1717, Bellevue Way Northeast, Bellevue, King County, Washington, 98004, United States

Apply Now

About the Team

Tencent AI Lab at the Seattle Area, established in May 2017, strives to continuously improve AI's capability in perception, cognition, and creativity. The lab is dedicated to advancing cutting-edge AI technologies, with a particular focus on innovative breakthroughs in large foundation models. The long-term ambition is to drive the development of Artificial General Intelligence (AGI) and, ultimately, Artificial Superintelligence (ASI). Researchers there aim at solving challenging real-world problems with advanced technologies and publish extensively at top conferences and journals.

The Technology Engineering Group (TEG) supports the company and its business groups on technology and operational platforms, as well as the construction and operation of R&D management and data centers. As the operator of the largest networking, devices, and data center in Asia, TEG leads the Tencent Technology Committee in strengthening infrastructure R&D through internal and distributed open source collaboration, constructing new platforms and supporting business innovation.

About the Role

We are seeking research interns interested in developing novel speech, music, audio, vision, and language processing techniques and large multimodal models for our Seattle area office located in Bellevue, WA. Every research intern will work directly with researchers on a project aimed at attacking one of the core problems by inventing cutting-edge techniques. We encourage discussions and collaborations between researchers and interns, and interns are encouraged to publish the results from their internship.

Our projects span a wide range of areas, including developing more effective multimodal pretraining and post-training strategies for audio, speech, music, image, and video understanding and generation. We aim to enable fully duplex conversations, design more efficient large-model architectures, enhance multimodal memory and reasoning capabilities, and advance novel audio, speech, music, image, and video processing techniques—such as encoding, tokenization, and representation learning—with a focus on multimodal applications and end-to-end large models.

Equal Opportunity

As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.

Work location

Work model: On-site

location_on

Similar Job Opportunities

Amazon logo

Senior Applied Scientist, Leo Satellite Build Intelligence

Amazon • Bellevue, Washington

$167k-226karrow_forward
Ochsner Clinic Foundation logo

Senior Data Scientist

Ochsner Clinic Foundation • New Orleans, Louisiana

Not Disclosedarrow_forward
Microsoft logo

Senior Data and Applied Scientist

Microsoft • Mountain View, California

Skills, education and keywords

Skills: Python, C++, Natural Language Processing, Speech Processing, Audio Processing, Music Processing, Computer Vision, Dialog System, Machine Learning, Deep Learning.

Education: Ph.D. students in computer science, electrical engineering, mathematics or related field required.

Frequently asked questions about Research Internship- Multimodal LLM (Speech/Music/Audio/Vision/Language) at Tencent

What does a Research Internship- Multimodal LLM (Speech/Music/Audio/Vision/Language) at Tencent do?expand_more
In this Research Internship- Multimodal LLM (Speech/Music/Audio/Vision/Language) at Tencent role, you will develop novel speech, music, audio, vision, and language processing techniques; enhance multimodal memory and reasoning capabilities in large foundation models; design efficient large-model architectures for audio, speech, and video understanding; and invent cutting-edge techniques to solve core problems in ai perception and cognition.
What are the requirements for this Research Internship- Multimodal LLM (Speech/Music/Audio/Vision/Language) role?expand_more
Tencent is looking for candidates who meet the following requirements: Ph.D. student in computer science, electrical engineering, mathematics or related field; Research experience in NLP, speech, audio, music processing, computer vision, dialog system, or machine learning; Publication track record; Proficiency in Python and/or C++; and Experience with leading deep learning toolkits.
Where is the Research Internship- Multimodal LLM (Speech/Music/Audio/Vision/Language) role at Tencent located?expand_more
Recrutus

Curating the world's most innovative career opportunities. We bridge the gap between visionary talent and industry-leading companies.

Search roles by city, category, skill, or job type — explore verified US employers, salary benchmarks, and remote-friendly teams hiring nationwide.

publiclanguageshare
Job seekers
Browse jobsCompanies hiringRemote jobsJobs by location

Bellevue Presbyterian Church, 1717, Bellevue Way Northeast, Bellevue, King County, Washington, 98004, United States

Bellevue, Washington

Key Responsibilities

  • check_circleDevelop novel speech, music, audio, vision, and language processing techniques
  • check_circleEnhance multimodal memory and reasoning capabilities in large foundation models
  • check_circleDesign efficient large-model architectures for audio, speech, and video understanding
  • check_circleInvent cutting-edge techniques to solve core problems in AI perception and cognition
  • check_circleDesign and implement large multimodal models for end-to-end applications
  • check_circleConduct research on multimodal pretraining and post-training strategies
  • check_circleCollaborate with researchers to publish results at top conferences and journals

Requirements

  • verifiedPh.D. student in computer science, electrical engineering, mathematics or related field
  • verifiedResearch experience in NLP, speech, audio, music processing, computer vision, dialog system, or machine learning
  • verifiedPublication track record
  • verifiedProficiency in Python and/or C++
  • verifiedExperience with leading deep learning toolkits

Benefits & Perks

check_circle1 hour of paid sick leave for every 30 hours workedcheck_circle13 paid holidays throughout the calendar yearcheck_circleEligibility to enroll in company-sponsored medical plan for full-time interns
Tencent logo
Company

Tencent

Industry

scientific & qa

View company profilearrow_forwardlanguageWebsite
Quick Overview

Experience

Intern

Education

Ph.D. students in computer science, electrical engineering, mathematics or related field required

Job Type

Intern

Skills Required

PythonC++Natural Language ProcessingSpeech ProcessingAudio ProcessingMusic Processing
$120k-235k
arrow_forward
Research Internship- Multimodal LLM (Speech/Music/Audio/Vision/Language) at Tencent is based in Bellevue Presbyterian Church, 1717, Bellevue Way Northeast, Bellevue, King County, Washington, 98004, United States. This is a on-site role.
Is this Research Internship- Multimodal LLM (Speech/Music/Audio/Vision/Language) job remote, hybrid, or on-site?expand_more
Tencent has listed this Research Internship- Multimodal LLM (Speech/Music/Audio/Vision/Language) role as on-site.
How much experience is required for this Research Internship- Multimodal LLM (Speech/Music/Audio/Vision/Language) role?expand_more
Candidates for Research Internship- Multimodal LLM (Speech/Music/Audio/Vision/Language) at Tencent should have intern.
What skills do you need for the Research Internship- Multimodal LLM (Speech/Music/Audio/Vision/Language) role at Tencent?expand_more
Key skills for Research Internship- Multimodal LLM (Speech/Music/Audio/Vision/Language) at Tencent include Python; C++; Natural Language Processing; Speech Processing; Audio Processing; Music Processing; Computer Vision; and Dialog System.
What education is required for Research Internship- Multimodal LLM (Speech/Music/Audio/Vision/Language) at Tencent?expand_more
Educational requirements for this role: Ph.D. students in computer science, electrical engineering, mathematics or related field required.
What category does the Research Internship- Multimodal LLM (Speech/Music/Audio/Vision/Language) role belong to?expand_more
Research Internship- Multimodal LLM (Speech/Music/Audio/Vision/Language) at Tencent is part of the scientific & qa job category on Recrutus.
Jobs by city
Jobs by category
Jobs by skill
Career guides
Career blog
Salary insights
Job types
Contractor jobsFull-Time jobsIntern jobsOther jobsPart-Time jobsPer-Diem jobsTemporary jobs
Top states
Jobs in TexasJobs in New YorkJobs in CaliforniaJobs in FloridaJobs in ArizonaJobs in MassachusettsAll states →
Top categories
Healthcare & Nursing jobsLogistics & Warehouse jobsEngineering jobsIT jobsHospitality & Catering jobsTravel jobsSales jobs
Popular skills
CDL A jobsRegistered Nurse jobsBLS jobsAcls jobs
Featured employers
Company
About usFAQContactPrivacy policyUS privacy noticeAccessibility

Recrutus helps candidates discover roles that match their skills and helps teams reach qualified applicants faster. Browse by metro, discipline, or work style — from internships to senior leadership.

© 2026 Recrutus. All rights reserved.
Terms of serviceCookie policyAcceptable useDMCA policyEmployer termsCandidate terms
Computer Vision
Dialog System
Machine Learning
Deep Learning