Staff Hardware Systems Engineer - San Francisco, CA - US
9 hours ago

Job description
Crusoe's mission is to accelerate the abundance of energy and intelligence. We're crafting the engine that powers a world where people can create ambitiously with AI — without sacrificing scale, speed, or sustainability.
Be a part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that's setting the pace for responsible, transformative cloud infrastructure.
About This Role:
We are seeking a Hardware Production / Sustaining Engineer to strengthen Crusoe's Hardware Systems Engineering team and close critical skill gaps in debugging, validation, and production support of high-performance compute systems. In this role, you will take ownership of the full hardware lifecycle—from prototype bring-up to large-scale production—while driving automation, deep issue resolution, and reliability across Crusoe Cloud's GPU- and CPU-based infrastructure.
You will work closely with cross-functional teams to support, debug, and improve hardware platforms at scale, with a particular focus on PCIe, InfiniBand, and NVMe/storage, which have been identified as essential areas for deeper expertise. Your work will directly impact Crusoe's ability to deploy and operate sustainable, AI-first compute systems with world-class performance and reliability.
What You'll Be Working On:
Drive the full hardware development and sustaining lifecycle, including feasibility, bring-up, validation, deployment, and ongoing production support.
Develop and maintain scripting and automation frameworks for hardware testing, diagnostics, and continuous reliability improvements.
Lead deep troubleshooting and debugging across:
PCIe (link training, topology, performance issues)
InfiniBand (fabric debugging, throughput, connectivity issues)
NVMe/storage (performance bottlenecks, firmware interactions, failure analysis)
Conduct rigorous system validation and characterization for GPU, CPU, and high-performance compute platforms.
Support E2E integration and solution testing to ensure Crusoe Cloud products meet performance, reliability, and scalability expectations.
Collaborate with mechanical, thermal, firmware, software, and manufacturing teams to resolve system-level issues and enable stable production operation.
Drive prototyping, qualification, and readiness for high-volume manufacturing with both internal teams and external vendors.
Identify opportunities for new hardware technologies, testing methods, and sustainability improvements aligned with Crusoe's long-term objectives.
Provide data-driven insights to influence Crusoe's hardware roadmap and reliability strategy.
What You'll Bring to the Team:
8–10+ years of experience in hardware development, validation, sustaining engineering, or production engineering.
Strong hands-on expertise in PCIe, InfiniBand, and NVMe/storage debugging and development.
Deep proficiency in hardware bring-up, board-level debugging, and system-level validation.
Ability to design and implement automation frameworks for hardware testing (Python, Shell, or similar).
Technical background in digital and analog design, server architecture, and high-performance compute hardware.
Experience working across thermal, mechanical, firmware, and software functions in multidisciplinary environments.
Strong analytical and problem-solving skills with a data-driven approach.
Excellent communication and collaboration skills for working with internal teams and external partners.
Bachelor's or Master's degree in Electrical Engineering, Computer Engineering, or equivalent experience.
Bonus Points:
Experience designing or optimizing GPU-to-GPU communication architectures for AI/ML workloads.
Direct experience integrating NVLink or other next-generation GPU interconnect technologies.
Familiarity with cutting-edge GPU architectures and how to leverage them in AI/HPC environments.
Expertise supporting or designing systems across both ARM and x86 server architectures.
Background in sustainable or energy-efficient hardware design practices.
Advanced certifications or coursework in AI/HPC hardware systems.
Benefits:
Industry competitive pay
Restricted Stock Units in a fast growing, well-funded technology company
Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
Employer contributions to HSA accounts
Paid Parental Leave
Paid life insurance, short-term and long-term disability
Teladoc
401(k) with a 100% match up to 4% of salary
Generous paid time off and holiday schedule
Cell phone reimbursement
Tuition reimbursement
Subscription to the Calm app
MetLife Legal
Company paid commuter benefit; $300 per month
Compensation:
Compensation will be paid in the range of $208,000 - $253,000 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicant's education, experience, knowledge, skills, and abilities, as well as internal equity and alignment with market data.
Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
Similar jobs
We're seeking a visionary systems engineer to define, architect, and guide the development of cutting-edge safety-critical sensing technologies that power next-generation automotive and industrial applications. Lead system architecture & safety strategy: Design and refine system- ...
1 month ago
· Systems Engineer · MedTech / Medical Devices / SF Bay · Representing a pioneering medical device company that specializes in the development of unique life-saving Medical Technologies. · They are currently seeking an experienced Systems Engineer to join their dynamic team. Th ...
9 hours ago
We're building advanced AI systems that combine learning, reasoning and automated experimentation to operate directly in the physical world. · We're hiring a Senior Neuro-Symbolic Systems Engineer to design and build the structured reasoning layer that enables autonomous decision ...
1 month ago
We're seeking a proactive and skilled engineer who thrives in a dynamic, fast-paced environment to be an integral part of our San Francisco hardware team. · The role involves architecting, defining, producing and managing system-level requirements for functionally safe sensing pr ...
1 week ago
We believe software and data are the answer, thoughtfully applied to solve our customers' unique challenges. Through intelligent automation, we give factories newfound flexibility, scalability, · We are transforming the future of manufacturing with intelligent, software-driven au ...
1 month ago
We are seeking a Systems Engineer to join our rapidly growing organization. This is a highly technical role providing excellent career development opportunities for the successful candidate. · Perform IT Support services to users onsiteMaintain documentation and comply with rules ...
1 month ago
Ouster is seeking a proactive and skilled engineer to be an integral part of its San Francisco hardware team. The ideal candidate will thrive in a dynamic fast-paced environment and have hands-on experience with the complete lifecycle of a functionally safe sensing product. · Dem ...
1 month ago
You are a systems engineer with deep aerospace expertise, motivated by solving hard problems in national defense and advancing next-generation air dominance. · ,Experience in requirements development, decomposition, and requirements traceability including operational requirements ...
1 month ago
We are transforming the future of manufacturing with intelligent, software-driven automation. Our applications help designers create products that are optimized for automated assembly, making factories more efficient and effective. Working with us means you'll have the opportunit ...
4 weeks ago
Substack is building a new economic engine for culture, · giving the brightest, most interesting and creative people on the internet · the power of their own publishing platform. · ...
1 month ago
This role has been designated as 'Remote/Teleworker', which means you will primarily work from home. · Hewlett Packard Enterprise is at the heart of this transformation with its industry-leading technologies that are ever more software-defined, helping to answer the most challeng ...
2 weeks ago
We are looking for a motivated organized and data driven engineer with a history of developing test fixtures for medical device manufacturing. · This individual will be responsible for development and releasing of our test fixtures for our next generation products. · ...
1 month ago
We are seeking a Systems Engineer to take on the toughest challenges in distributed, high-performance computing.You'll be responsible for designing and building key parts of a base-layer blockchain. · ...
1 month ago
· About the Team · DoorDash Labs, established in 2018, serves as the innovation hub for DoorDash, focusing on developing automation and robotics solutions to enhance last-mile logistics. The team's mission is to create technologies that support and augment human networks, aiming ...
10 hours ago
Shape the future of intelligent sensing systems. · ...
3 weeks ago
Energize is partnered with a fast-growing robotics company on a mission to automate repetitive tasks within the multi-trillion-dollar infrastructure construction industry. Their innovative technology in robotics and autonomous navigation is transforming the way large-scale projec ...
9 hours ago
We are transforming the future of manufacturing with intelligent, software-driven automation. Our applications help designers create products that are optimized for automated assembly, making factories more efficient and effective. · We are seeking an experienced Systems Engineer ...
1 month ago
+We are seeking a Systems Engineer to join our rapidly growing organization. This is a highly technical role providing excellent career development opportunities for the successful candidate. · ...
1 month ago
Determine system needs and install software solutions that meet business requirements. Develop queries, reports and dashboards to analyze and visualize data. · Determine system needs and install software solutions that meet business requirements · Develop queries, reports and das ...
1 month ago
The IT Engineering team builds, secures, and maintains the systems and infrastructure that enable every team at Anthropic to do their best work. · Design and implement secure IT infrastructure that scales with rapid growthBuild automation frameworks that eliminate manual processe ...
3 weeks ago
+We are building a team of innovators, problem-solvers, and visionaries who are passionate about shaping the future of AI and technology. At , you'll have the opportunity to work on impactful projects... · + · +Serve as the first responder for IT support requests via ticketing sy ...
1 day ago