Software Engineer, Data Infrastructure at OpenAI
Description
About The Team Data Platform at OpenAI owns the foundational data stack powering critical product, research, and analytics workflows.
We operate some of the largest Spark compute fleets in production; design, and build data lakes and metadata systems on Iceberg and Delta with a vision toward exabyte-scale architecture; run high throughput streaming platforms on Kafka and Flink; provide orchestration with Airflow; and support ML feature engineering tooling such as Chronon.
Our mission is to deliver reliable, secure, and efficient data access at scale and accelerate intelligent, AI assisted data workflows.
Join us to build and operate these core platforms that underpin OpenAI products, research, and analytics.
We’re not just scaling infrastructure – we’re redefining how people interact with data.
Our vision includes intelligent interfaces and AI-assisted workflows that make working with data faster, more reliable, and more intuitive.
About The Role This role focuses on building and operating data infrastructure that supports massive compute fleets and storage systems, designed for high performance and scalability.
You’ll help design, build, and operate the next generation of data infrastructure at OpenAI.
- Role: Software Engineer, Data Infrastructure
- Company: OpenAI
- Location: San Francisco, CA
- Job found on: 16th of May, 2026
-
You can now practice a tailored interview designed specifically for this role, or a similar position, to boost your readiness and confidence:
Practice Interview Now


