
location_onFirst Drive West, San Francisco, California, 94129, United States
Our Safety Systems team is at the forefront of OpenAI's mission to build and deploy safe AGI, driving our commitment to AI safety and fostering a culture of trust and transparency. Within Safety Systems, the Model Policy team aligns model behavior with desired human values and norms. We co-design policy with models and for models by driving rapid policy taxonomy iteration based on data and defining evaluation criteria for foundational models' ability to reason about safety.
Frontier AI systems are rapidly expanding what is possible in cybersecurity and software engineering. These capabilities create major defensive opportunities, but they also raise serious dual-use and misuse risks across areas such as malware development, exploit discovery, vulnerability chaining, credential abuse, cyber intrusion, and autonomous offensive operations.
In this role, you will help define how OpenAI's models should behave in high-risk cybersecurity contexts. You will develop policy frameworks, threat models, taxonomies, evaluations, and behavioral specifications that guide model behavior across training, deployment, and monitoring systems. This role sits at the intersection of cybersecurity, AI safety, threat modeling, evaluation science, and policy implementation.
You will work closely with research, engineering, safety training, preparedness, and product teams to build policies that are technically grounded, measurable, enforceable, and responsive to real-world cyber risk.
OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.
Our approach is informed by key publications including "Accelerating the cyber defense ecosystem that protects us all," "Safety at every step," "Safety evaluations hub," the "GPT‑5.5 System Card," and the "OpenAI Model Spec."
This role is based in our San Francisco office. We encourage you to apply even if you prefer a different work location as factors may change over time. We offer relocation support to new employees and utilize a hybrid model: three days in the office per week with optional work from home on Thursdays and Fridays.
Our open-plan offices feature height-adjustable desks, conference rooms, phone booths, well-stocked kitchens with three in-house prepared meals daily, a private outdoor space, nap rooms, and private bike storage.
We are an equal opportunity employer and do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic. We are committed to providing reasonable accommodations to applicants with disabilities.
At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.
Work model: Hybrid
First Drive West, San Francisco, California, 94129, United States
San Francisco, California