
location_onNevada State Capitol, East Musser Street, Carson City, Nevada, 89703, United States
OCI (Oracle Cloud Infrastructure) AI Infrastructure is at the forefront of building cutting-edge, ultra-high-performance GPU platforms designed to support AI, ML, and HPC workloads. Our team is responsible for designing and developing fundamental architectural changes for GPU delivery, health monitoring, testing, triage automation, and diagnostic services. These capabilities are essential for running distributed workloads across thousands of GPUs, leveraging technologies like RoCE and Infiniband.
The role resides within the Compute AI Infrastructure In-Band Engineering team, which owns the critical infrastructure responsible for automated testing of new platform shapes (AMD, Intel, Arm, Nvidia), hardware bring-up, configuration, benchmarking, and debugging. The services operate at the unique intersection of bare metal hardware and full-stack orchestration frameworks. The team interfaces directly with OCI APIs, NICs, SmartNICs, ILOMs, and GPUs to build high-performance, scalable services and tooling that launch, configure, test, and validate server platforms across OCI's massive fleet.
This is your chance to be part of the AI revolution, creating systems that allow customers to scale from tens to thousands of GPUs without compromising performance. As a software developer, you will own the software design and development for major components of Oracle's Cloud Infrastructure. You will partner closely with teams in Compute, Networking, Security, Data Center Engineering, and Hardware Development to ensure OCI can launch, scale, and maintain new server platforms with minimal operational overhead and high reliability.
You will work directly with cutting-edge GPU hardware and see the direct impact of your work on the business. This is a dynamic and flexible workplace where you will be part of a team of smart, motivated, and diverse people, given the autonomy and support to do your best work. We strive for equity, inclusion, and respect for all, and we are constantly learning and taking opportunities to grow our careers and ourselves.
The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability, and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
We are committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, please let us know by emailing accommodation-request_mb@oracle.com or by calling 1-888-404-2494 in the United States.
Work model: On-site
Nevada State Capitol, East Musser Street, Carson City, Nevada, 89703, United States
Carson City, Nevada
Strong background in Linux systems. Familiarity with system-level architecture, data synchronization, fault tolerance, and state management. General enterprise storage, networking, or computing experience. Experience with Server/GPU hardware architecture and system management. Experience with Infiniband or RoCE networking technologies. Hands-on experience designing, developing, and operating public cloud service data planes.