Kernel Engineer- GPU - San Francisco

Only for registered members San Francisco, United States

14 hours ago

Default job background
ABOUT BASETEN · Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies op ...
Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Get full access

Access all high-level positions and get the job of your dreams.



Similar jobs

  • Work in company

    GPU Kernel Engineer

    Only for registered members

    Sciforium is an AI infrastructure company developing next-generation multimodal AI models and a proprietary, high-efficiency serving platform. Backed by multi-million-dollar funding and direct sponsorship from AMD with hands-on support from AMD engineers the team is scaling rapid ...

    San Francisco

    14 hours ago

  • Work in company

    GPU Optimization Engineer

    Only for registered members

    We are looking for a GPU Optimization Engineer to join our core engineering team. · The ideal candidate will optimize GPU-accelerated workloads for performance, efficiency and scalability. · ...

    San Francisco

    1 month ago

  • Work in company

    GPU Performance Engineer

    Only for registered members

    We are Genmo a research lab dedicated to building open state-of-the-art models for video generation towards unlocking the right brain of AGI.Join us in shaping the future of AI and pushing the boundaries of what's possible in video generation.We're seeking a GPU Performance Engin ...

    San Francisco

    1 month ago

  • Work in company

    GPU Infrastructure Engineer

    Only for registered members

    Scion Technology has been engaged to conduct a search for an experienced GPU Infrastructure Engineer for our client, an innovative startup leading the future of AI infrastructure. · ...

    San Francisco

    1 month ago

  • Work in company

    GPU Optimization Engineer

    Only for registered members

    Goliath Partners has exclusively partnered with a fast-growing, venture-backed AI company operating at serious scale. · ...

    San Francisco

    1 month ago

  • Work in company

    GPU Optimization Engineer

    Only for registered members

    · Job Description · GPU Optimisation Engineer — Real-Time Inference · Want to push GPU performance to its limits — not in theory, but in production systems handling real-time speech and multimodal workloads? · This team is building low-latency AI systems where milliseconds actua ...

    San Francisco, CA

    14 hours ago

  • Work in company

    GPU Kernel Engineer

    Only for registered members

    We are building next-generation multimodal foundation models and a highly optimized training and serving platform. The team is looking for a GPU Kernel Engineer to push the limits of performance on modern accelerators and help power large-scale AI systems. · Custom Kernel Develop ...

    San Francisco, CA

    1 month ago

  • Work in company

    Engineer - GPU Optimisation & Inference

    Only for registered members

    We are building low-latency AI systems where milliseconds actually matter. · ...

    San Francisco, CA

    3 weeks ago

  • Work in company

    Engineer - GPU Optimisation & Inference

    Only for registered members

    Job summary · Want to push GPU performance to its limits — not in theory, but in production systems handling real-time speech and multimodal workloads? · This team is building low-latency AI systems where milliseconds actually matter.The target isn't "faster than baseline." It's ...

    San Francisco

    1 month ago

  • Work in company

    Senior GPU Optimisation Engineer

    Only for registered members

    We're hiring a GPU Optimization Engineer who understands GPUs at a deep architectural level — someone who knows exactly how to squeeze every last millisecond out of a model, · Optimize model architectures (ASR, TTS, SLMs) for maximum performance on specific GPU hardware · Benchma ...

    San Francisco, CA

    1 month ago

  • Work in company

    System Engineer, GPU Fleet

    Only for registered members

    About Fluidstack · At Fluidstack, we're building the infrastructure for abundant intelligence. We partner with top AI labs, governments, and enterprises - including Mistral, Poolside, Black Forest Labs, Meta, and more - to unlock compute at the speed of light. · We're working wit ...

    San Francisco, CA

    14 hours ago

  • Work in company

    Senior GPU Optimisation Engineer

    Only for registered members

    We're hiring a GPU Optimization Engineer who understands GPUs at a deep architectural level — someone who knows exactly how to squeeze every last millisecond out of a model, · Optimize model architectures (ASR, TTS, SLMs) for maximum performance on specific GPU hardware · Profile ...

    San Francisco $200,000 - $300,000 (USD)

    1 month ago

  • Work in company

    Systems/GPU Research Engineer

    Only for registered members

    +We seek engineers/researchers with strong intrinsic drive, a true passion for advancing the state of the art, and a mix of excellent research, coding, and communication skills. · +Develop or extend parallel generic GPU libraries and kernelsHelp design and deploy market-based res ...

    San Francisco, CA

    1 month ago

  • Work in company

    Systems/GPU Research Engineer

    Only for registered members

    About Us' cloud powers AI projects and businesses all over the world. We are democratizing and decentralizing AI computing—reshaping our future for the benefit of humanity. · We seek engineers/researchers with strong intrinsic drive, a true passion for advancing the state of the ...

    San Francisco $160,000 - $320,000 (USD)

    1 month ago

  • Work in company

    System Engineer, GPU Fleet

    Only for registered members

    + Operate and maintain large-scale GPU server fleet (H100, B200, GB200) supporting AI/ML workloads; · + Perform hands-on troubleshooting and root cause analysis of complex hardware, firmware, OS, and application issues across GPU clusters; · + Develop and maintain automation scri ...

    San Francisco $200,000 - $300,000 (USD)

    1 month ago

  • Work in company

    Software Engineer — GPU Networking

    Only for registered members

    About Baseten · Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies op ...

    San Francisco $150,000 - $250,000 (USD)

    2 weeks ago

  • Work in company

    Senior GPU Optimisation Engineer

    Only for registered members

    We're hiring a · GPU Optimization Engineer who understands GPUs at a deep, · architectural level — someone who knows exactly how to squeeze · every last millisecond out of a model, · what GPU constraints matter,and how to restructure models for real-world inference performance.S ...

    San Francisco Full time

    1 month ago

  • Work in company

    Software Engineer — GPU Networking

    Only for registered members

    ABOUT BASETEN · Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies op ...

    San Francisco $150,000 - $250,000 (USD) Full time

    2 weeks ago

  • Work in company

    GPU Systems Engineer – HPC

    Only for registered members

    We're looking for a systems engineer with HPC or parallel programming experience to help scale AI inference. · You'll leverage your knowledge of high-performance systems to optimize GPU performance at the bleeding edge of AI. · Tech Stack: CUDA/C++, GPGPU, Python, Linux. · ...

    San Francisco, CA

    1 month ago

  • Work in company

    Software Engineer — GPU Networking

    Only for registered members

    · ABOUT BASETEN · Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies ...

    San Francisco

    2 weeks ago