Member of Technical Staff, Software Co-Design AI HPC Systems - Mountain View
4 days ago

Job summary
We operate at the intersection of models, systems software, networking storage artificial intelligence hardware optimizing end-to-end performance efficiency reliability cost.We pursue this mission through deep hardware–software co-design combining rigorous systems thinking with hands-on engineering.
This role sits at the boundary between exploration production working closely internal infrastructure hardware compiler product teams as well as external partners across hardware systems ecosystem.Our operating model emphasizes rapid ideation prototyping followed by disciplined execution drive high-leverage ideas into production systems operate massive scale.
- Lead co-design AI systems across hardware software boundaries spanning accelerators interconnects memory storage runtimes distributed training inference frameworks
- Drive architectural decisions analyzing real workloads identifying bottlenecks compute communication data movement translating findings actionable system hardware requirements
Job description
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.
Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.
Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Access all high-level positions and get the job of your dreams.
Similar jobs
+Job summary · The High-Performance Computing (HPC) System Administrator is an expert role responsible for designing configuring optimizing and operating HPC infrastructure. · +ResponsibilitiesManages entire lifecycle of compute nodes including procuring installing configuring ma ...
2 weeks ago
The High-Performance Computing (HPC) System Administrator is an expert role responsible for designing · ...
2 weeks ago
The High-Performance Computing (HPC) System Administrator is an expert role responsible for designing configuring optimizing operating high-performance computing infrastructure. · ...
1 week ago
KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem. · HPC server systems are increasingly an essential and enabling component to the performance of advanced equipment supporting the next-generation of semiconductor manufacturing. · As- ...
4 weeks ago
We are looking for an HPC Systems Engineer to join our team. The ideal candidate will have experience in architecting and deploying HPC cluster products used in IC fabs and mask-shops around the world. Responsibilities include identifying and assessing developer requirements, dev ...
3 weeks ago
We are seeking a High Performance Computing Systems Engineer to design and optimize server-class compute clusters for high-performance systems. · ...
1 week ago
We're looking for innovative minds to join our mission of shaping the future of technology. · Design next-generation AI and HPC infrastructure that utilizes advanced memory solutions to optimize performance and power efficiency. · ...
3 weeks ago
· Collaborate with hardware and software teams to optimize end-to-end communication pathways for large-scale distributed training workloads, · ...
3 days ago
We are seeking an HPC Systems Engineer to join our AMD IT compute platforms engineering team. · The Role · Develop, implement, and maintain GPU-based clusters ensuring optimal performance Administer ML/AI platforms Distributed ML services LLMs and AI inferencing by managing deplo ...
1 month ago
+Job summary · Meta's AI Training and Inference Infrastructure is growing exponentially to support ever increasing uses cases of AI. · +ResponsibilitiesCollaborate with hardware and software teams to optimize end-to-end communication pathways for large-scale distributed training ...
2 days ago
We are seeking a full-time HPC/Linux Systems Engineer for a position in San Jose, California or Austin, · Texas. · Required Solid understanding and proven operational experience with compute farms, · job submission/management technologies, cloud, and associated management tools. ...
3 weeks ago
We need to build and evolve our network infrastructure that connects myriads of training accelerators like GPUs together. In addition, we need to ensure that the network is running smoothly and meets stringent performance and availability requirements of RDMA workloads. · ...
5 days ago
We are seeking an HPC Systems Engineer to join our AMD IT compute platforms engineering team. · Develop, implement, and maintain GPU-based clusters, ensuring optimal performance · Administer ML/AI platforms - Distributed ML services, LLMs and AI inferencing, by managing deploymen ...
1 month ago
+We need to build and evolve our network infrastructure that connects myriads of training accelerators like GPUs together. · ...
1 month ago
We are seeking an HPC Systems Engineer to join our AMD IT compute platforms engineering team. · ...
1 month ago
Do you want to be part of a team that brings new Artificial Intelligence (AI) hardware and software technologies to production in the field? · We are looking for a compute and networking savvy Solution Architect to join the NVIDIA Solution Architecture Engineering (SA) team focus ...
3 weeks ago
NVIDIA is looking for an experienced systems and network infrastructure Solutions Architect. Do you want to be part of a team that brings new Artificial Intelligence (AI) hardware and software technologies to production in the field? We are looking for a compute and networking sa ...
3 weeks ago
NVIDIA is looking for an experienced systems and network infrastructure Solutions Architect to join the NVIDIA Solution Architecture Engineering (SA) team focused on supporting accelerated computing applications. · ...
3 weeks ago
NVIDIA is at the forefront of revolutionizing healthcare by harnessing the power of GPU computing and AI to redefine data analysis in fields such as genomics and personalized medicine. · ...
1 week ago
+ · ++Active member of a multi-disciplinary team to develop solutions for large scale training systems · Responsible for the overall performance of the communication system, including performance benchmarking, monitoring and troubleshooting production issues · Identify potential ...
5 days ago
NVIDIA is at the forefront of revolutionizing healthcare by harnessing the power of GPU computing and AI to redefine data analysis in fields such as genomics and personalized medicine. · ...
1 week ago