- Training open-source models that outperform GPT-5 for a leading digital insurer
- Specialized, continually learning models as the future of AI
- Training docs overview
- Research we've done
- Design, build, and maintain distributed training infrastructure for large-scale foundation models
- Implement scalable pipelines for fine-tuning and training across heterogeneous GPU/accelerator clusters
- Optimize training performance through techniques like FSDP, DDP, ZeRO, and mixed precision training
- Contribute to frameworks and tooling that make training workflows efficient, reproducible, and developer-friendly
- Collaborate with cross-functional teams (Product, Forward Deployed Engineering, Inference Infra) to ensure training systems meet real-world requirements
- Research and apply emerging techniques in training efficiency and model adaptation, and productionize them in the Baseten platform
- Participate in code reviews, system design discussions, and technical deep dives to maintain a high engineering bar
- Bachelor's degree in Computer Science, Engineering, or related field, or equivalent experience
- 5+ years of experience in ML infrastructure, distributed systems, or ML platform engineering, including 2+ years in a tech lead or manager role
- Strong expertise in distributed training frameworks and orchestration (FSDP, DDP, ZeRO, Ray, Kubernetes, Slurm, or similar)
- Hands-on experience building or scaling training infrastructure for LLMs or other foundation models
- Deep understanding of GPU/accelerator hardware utilization, mixed precision training, and scaling efficiency
- Proven ability to lead and mentor technical teams while delivering complex infrastructure projects
- Excellent communication skills, with the ability to bridge technical depth and business needs
- Experience building APIs, SDKs, or developer tools for ML workflows
- Familiarity with cluster management and scheduling (Kubernetes, Ray, Slurm, etc.)
- Knowledge of parameter-efficient fine-tuning methods (LoRA, QLoRA) and evaluation pipelines
- Contributions to open-source distributed training or ML infra projects
- Experience with cloud environments (AWS, GCP, Azure) and container orchestration
- Competitive compensation, including meaningful equity.
- 100% coverage of medical, dental, and vision insurance for employee and dependents
- Generous PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day)
- Paid parental leave
- Company-facilitated 401(k)
- Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.
-
Lawrence Harvey has been engaged by a technology organization building the software infrastructure behind complex, real-world hardware systems, supporting advanced robotics, autonomous platforms and high-performance industrial environments. · They are looking for a Software Engin ...
New York2 weeks ago
-
+Principal Software Engineer - (Data Engineering / Java / Python) · This role involves providing engineering excellence within the Corporate Risk Technology team to build secure and scalable technology products. · +ResponsibilitiesArchitect and implement complex, scalable data en ...
New York3 weeks ago
-
The NetApp Keystone team develops cutting-edge technologies that power NetApps pay-as-you-go subscription services, enabling customers to manage data seamlessly on-premises or in the cloud. · ...
New York1 month ago
-
This role will influence the technical direction and optimize customer satisfaction for a collection of internally developed products in addition to architecting and delivering new software solutions with broad impact across the IT and Product organizations.You will work closely ...
New York1 month ago
-
+h2>Job summary · The company has raised more than $150m from top venture capital companies and founders of the most successful companies. · +ul>Engineers own their work end-to-end and are trusted to make a real impact. · ...
New York1 month ago
-
We are seeking a Senior Software Engineer with deep expertise in system and software design to join our team. · ...
New York1 month ago
-
We are seeking an experienced software engineer to join our team at Purple Drive Technologies LLC. · Design and develop scalable systems using Java or Python. · ...
New York1 week ago
-
Job summaryWe are working with a high-performance global trading and technology firm seeking Software Engineer to join their engineering team in New York. · ...
New York1 month ago
-
We're partnering with a profitable climate tech startup that's building software to accelerate the transition to a 100% fossil-free power grid. · ...
New York1 month ago
-
We're seeking elite C++ engineers to join a world-class quantitative systems team focused on building next-generation, low-latency trading infrastructure.This is a rare opportunity to contribute to the greenfield development of systems that redefine speed, scale, and performance ...
New York1 week ago
-
Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you'd like, where you'll be supported and inspired by a collaborative community of colleagues around the world, and where you'll be able to reimagine what's possible. · ...
New York $90,000 - $95,000 (USD)2 weeks ago
-
We're looking for a Founding Engineer to help build the core operating system for private neurology practices. · This is a rare opportunity to join at the ground floor and shape both the product and the engineering culture from day one. · ...
New York1 month ago
-
You will join a small team working directly with the founders on a platform with real-world impact. · ...
New York Full time1 month ago
-
+Build an AI-native product enabled by recent LLM advances+ · +Ship fast and see immediate customer impact+ · +Work on a complex, high-impact problem in a $20B+ supply chain+ Caddie is hiring on behalf of an early-stage startup building an AI-powered procurement platform for the ...
New York1 month ago
-
We are excited to meet software engineers who are interested in financial markets and have real impact in a fast-paced environment. · ...
New York1 month ago
-
Pangram Labs is hiring an AI Tooling and Evaluation Engineer to join our AI research team. You'll work directly with AI research scientists and engineers to build internal tools, platforms and systems that power the model development lifecycle. · ...
New York3 weeks ago
-
We're always hiring software engineers for all of our offices. Technology is at the core of how we approach our work, · We also believe in the value of open source software. · ...
New York, New York, United States6 days ago
-
We are hiring Software Engineers to join our team.This is an opportunity to join us in-person as an early employee and make a large impact at a high growth start-up. · ...
New York1 week ago
-
Join a small team working directly with the founders on a platform with real-world impact. · ...
New York1 month ago
-
Handshake seeks experienced Software Engineers for flexible hourly contract work to evaluate AI-generated code. · ...
New York, NY4 days ago
-
We are building the frontier platform that enables a billion non engineers to create software with AI. · ...
New York Full time2 weeks ago
Senior Software Engineer - New York - Baseten
Description
ABOUT BASETENBaseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E, backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products.
THE ROLE
As a Senior Software Engineer - Model Training at Baseten, you'll be at the forefront of building the infrastructure that powers large-scale training and fine-tuning of foundation models. You'll design and implement distributed training systems, optimize GPU utilization, and create scalable pipelines that enable Baseten and our customers to adapt models efficiently and reliably. This role is highly technical and hands-on: you'll own key components of our training stack, collaborate with product and infra teams to surface customer needs, and push forward the state of the art in scalable training infrastructure.
EXAMPLE WORK:
-
Software Engineer
Only for registered members New York
-
Software Engineer
Only for registered members New York
-
Software Engineer
Only for registered members New York
-
Software Engineer
Only for registered members New York
-
Software Engineer
Only for registered members New York
-
Software Engineer
Only for registered members New York
-
Software Engineer
Only for registered members New York
-
Software Engineer
Only for registered members New York
-
Software Engineer
Only for registered members New York
-
Software Engineer
Only for registered members New York
-
Software Engineer
Only for registered members New York
-
Software Engineer
Only for registered members New York
-
Software Engineer
Full time Only for registered members New York
-
Software Engineer
Only for registered members New York
-
Software Engineer
Only for registered members New York
-
Software Engineer
Only for registered members New York
-
Software Engineer
Only for registered members New York, New York, United States
-
Software Engineer
Only for registered members New York
-
Software Engineer
Only for registered members New York
-
Software Engineer
Only for registered members New York, NY
-
Software Engineer
Full time Only for registered members New York