- Designing and operating large-scale batch data systems using Spark and Ray Data
- Scaling data ingestion pipelines as we grow to billions of rows per day
- Re-architecting prompt and model interaction data storage with a focus on cost, performance, and usability, primarily on S3
- Building and maintaining streaming data infrastructure (Kafka, Flink, or similar)
- Working across data warehouses and lakehouse formats, including Iceberg and Delta Lake (or lower-level storage abstractions)
- Improving data developer experience, especially for Python-heavy workflows
- Supporting database replication and change data capture pipelines (DMS, Debezium, or similar)
- Deep experience with Spark (Databricks or open-source Spark both count)
- Production experience with Ray Data
- Hands-on ownership of large data pipelines and storage systems
- Comfort debugging performance issues across compute, storage, and networking layers
- Clear thinking about data modeling and long-term maintainability
- Experience running or scaling ClickHouse
- Familiarity with dbt, Dagster, or similar orchestration and modeling tools
-
The Bot Company · We're building a helpful robot for every home. · We're a small team of engineers, designers, and operators based in San Francisco. Our team comes from Tesla, Cruise, OpenAI, Google, Pixar, and many other great companies. In the past we've shipped to hundreds of ...
San Francisco $95,000 - $185,000 (USD) per year2 hours ago
-
About the team: Our infrastructure team helps deliver OpenAI's most capable models and products to the world by scaling infrastructure and turning demand into useful FLOPS. · We collaborate across research, engineering, design, and business to turn cutting-edge AI advancements in ...
San Francisco $255,000 - $405,000 (USD)1 month ago
-
This is a job that Jill, our AI Recruiter, is recruiting for on behalf of one of our customers. · She will pick the best candidates from Jack's network · The next step is to speak to Jack. · Job Title: Data Infrastructure Engineer · Company Description: - High-performance AI dat ...
San Francisco Full time4 days ago
-
We build infrastructure for Agent Behavior Monitoring (ABM): surfacing silent behavioral issues, · understanding how agents behave in production, · and turning interaction data into actionable signals.Experience building and tuning high-throughput Petabyte-scale data pipelines · ...
San Francisco, CA1 month ago
-
About The Job · Alignerr · connects top technical experts with leading AI labs to build, evaluate, and improve next-generation models. We work on real production systems and high-impact research workflows across data, tooling, and infrastructure. · *Position · Senior Rust Full-St ...
San Francisco, CA1 week ago
-
About The JobAlignerr connects top technical experts with leading AI labs to build evaluate and improve next-generation models. We work on real production systems and high-impact research workflows across data tooling and infrastructure. · ...
San Francisco $50 - $75 (USD)2 weeks ago
-
About the Team · Our infrastructure team helps deliver OpenAI's most capable models and products to the world by scaling infrastructure and turning demand into useful FLOPS. We collaborate across research, engineering, design, and business to turn cutting-edge AI advancements int ...
San Francisco1 week ago
-
· About the Role · Together AI is hiring a Infrastructure engineer to own and operate the data platform that powers our rapidly scaling data platforms. In this role, you will be the primary engineer responsible for defining, building, and maintaining the AWS infrastructure that ...
San Francisco1 week ago
-
About Together AI is hiring an Infrastructure engineer for data platforms at San Francisco, CA. · This role involves building AWS infrastructure for data engineering systems and collaborating with multiple teams. · ...
San Francisco, CA2 weeks ago
-
Build the data infrastructure for robots operating in the real world. · Robotics is moving from research labs into production across factories, warehouses, vehicles, and field deployments. When robots fail, behave unexpectedly, or need to be improved, engineers rely on data to un ...
San Francisco20 hours ago
-
Judgment Labs builds infrastructure for Agent Behavior Monitoring (ABM). While traditional observability focuses on logging exceptions and latency, our ABM surfaces behavioral anomalies such as instruction drifts and context retrieval loss in scaled production environments. · Hun ...
San Francisco20 hours ago
-
Figma's platform helps teams bring ideas to life—whether you're brainstorming, creating a prototype, translating designs into code, or iterating with AI. · The Data Infrastructure team at Figma builds and operates the foundational platforms that power analytics, AI, and data-driv ...
San Francisco $149,000 - $350,000 (USD)1 month ago
-
This role focuses on building and operating data infrastructure that supports massive compute fleets and storage systems designed for high performance and scalability. · You will scale and harden big data compute and storage platforms build and support high-throughput streaming s ...
San Francisco, CA1 month ago
-
Help customers integrate Foxglove seamlessly into their large-scale data ecosystems. · ...
San Francisco $175,000 - $240,000 (USD)1 month ago
-
+ Design build and maintain data infrastructure systems such as distributed compute data orchestration distributed storage streaming infrastructure machine learning infrastructure while ensuring scalability reliability and security + Ensure our data platform can scale by orders o ...
San Francisco $210,000 - $405,000 (USD)1 month ago
-
About the Team · Data Platform at OpenAI owns the foundational data stack powering critical product, research, and analytics workflows. We operate some of the largest Spark compute fleets in production; design, and build data lakes and metadata systems on Iceberg and Delta with a ...
San Francisco $130,000 - $220,000 (USD) per year1 week ago
-
· Thinking Machines Lab's mission is to empower humanity through advancing collaborative general intelligence. We're building a future where everyone has access to the knowledge and tools to make AI work for their unique needs and goals. · We are scientists, engineers, and buil ...
San Francisco $350,000 - $475,000 (USD) per year1 week ago
-
About Fluidstack · At Fluidstack, we're building the infrastructure for abundant intelligence. We partner with top AI labs, governments, and enterprises - including Mistral, Poolside, Black Forest Labs, Meta, and more - to unlock compute at the speed of light. · We're working wit ...
San Francisco2 hours ago
-
The Workload team is responsible for designing and running OpenAI's LLM training and inference infrastructure that powers frontier models at massive scale. · Have strong engineering fundamentals with experience in distributed systems, data pipelines or infrastructure. · ...
San Francisco $250,000 - $380,000 (USD)1 month ago
-
About the Team · The Workload team is responsible for designing and running OpenAI's LLM training and inference infrastructure that powers frontier models at massive scale. Our systems unify how researchers train and serve models, abstracting away the complexity of performance, p ...
San Francisco $130,000 - $220,000 (USD) per year1 week ago
-
About Persona · Persona is the configurable identity platform built for businesses in a digital-first world. Verifying individuals and organizations is harder — but more important — than ever, with AI enabling fraudsters to launch sophisticated accounts at scale and regulations e ...
San Francisco $130,000 - $220,000 (USD) per year1 week ago
Software Engineer, Data Infrastructure - San Francisco - Anysphere
Description
Software Engineer, Data Infrastructure
You'll build and evolve the data systems that power Cursor's product and internal decision-making. The work is hands-on and high-impact.
This includes:
We're looking for someone who has built real systems at scale and cares about correctness, cost, and ergonomics.
Strong signals include:
Nice to have:
Our mission is to automate coding. The first step in our journey is to build the best tool for professional programmers, using a combination of inventive research, design, and engineering.
Our organization is very flat, and our team is small and talent dense. We particularly like people who are truthseeking, passionate, and creative. We enjoy spirited debate, crazy ideas, and shipping code.
We're in-person with cozy offices in North Beach, San Francisco and Manhattan, New York, replete with well-stocked libraries.
-
Data Infrastructure
Only for registered members San Francisco
-
Data Scientist, Infrastructure
Only for registered members San Francisco
-
Data Infrastructure Engineer
Full time Only for registered members San Francisco
-
Senior Data Infrastructure
Only for registered members San Francisco, CA
-
Data Infrastructure Engineer
Only for registered members San Francisco, CA
-
Data Infrastructure Engineer
Only for registered members San Francisco
-
Data Scientist, Infrastructure
Only for registered members San Francisco
-
Infrastructure Engineer, Data Platform
Only for registered members San Francisco
-
Infrastructure Engineer, Data Platform
Only for registered members San Francisco, CA
-
Solutions Engineer, Data Infrastructure
Only for registered members San Francisco
-
Senior Data Infrastructure Engineer
Only for registered members San Francisco
-
Software Engineer, Data Infrastructure
Only for registered members San Francisco
-
Software Engineer, Data Infrastructure
Only for registered members San Francisco, CA
-
Solutions Engineer, Data Infrastructure
Only for registered members San Francisco
-
Software Engineer, Data Infrastructure
Only for registered members San Francisco
-
Software Engineer, Data Infrastructure
Only for registered members San Francisco
-
Software Engineer, Data Infrastructure
Only for registered members San Francisco
-
Data Center Infrastructure Architect
Only for registered members San Francisco
-
Software Engineer, Data Infrastructure
Only for registered members San Francisco
-
Software Engineer, Data Infrastructure
Only for registered members San Francisco
-
Software Engineer, Data Infrastructure
Only for registered members San Francisco