Software Engineer, Triton Compiler - San Francisco, CA

Only for registered members San Francisco, CA, United States

1 month ago

Default job background

Job summary

We are an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products.

Responsibilities

  • Designing and optimizing core ML systems
  • writing highly reliable low-level code
  • advancing the algorithms and infrastructure that power our models

Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Get full access

Access all high-level positions and get the job of your dreams.



Similar jobs

  • Only for registered members San Francisco

    As a Software Engineer, you will help build AI systems that achieve levels of performance that were previously impossible. · ...

  • Only for registered members San Francisco

    We are looking for engineers who are self-directed, comfortable in ambiguous technical spaces, and excited to identify and solve the highest-leverage problems. · We partner closely with the hardware team to unlock new capabilities and ensure our custom silicon can support the nex ...

  • Only for registered members California City, California, United States

    Implement and optimize compiler for vector-related components in AI chips. · Implement and optimize AI heterogeneous computing framework. · ...

  • Only for registered members San Francisco $275,000 - $315,000 (USD)

    +We are building a Kernel Optimizer that automatically transforms code into its most efficient form, combined with Tensor Cloud for adaptive, cross-cloud compute and Emma Lang, a new programming language for high-performance, hardware-aware computation. · +Design and implement RL ...

  • Only for registered members San Francisco $285,000 - $315,000 (USD)

    We are building a Kernel Optimizer that automatically transforms code into its most efficient form combined with Tensor Cloud for adaptive cross-cloud compute and Emma Lang a new programming language for high-performance hardware-aware computation together these technologies rein ...

  • Only for registered members San Francisco, California, US

    We are building a Kernel Optimizer that automatically transforms code into its most efficient form. · ...

  • Only for registered members San Francisco Full time $120,000 - $250,000 (USD)

    We are seeking a highly skilled AI Optimization Engineer to optimize training and inference workloads on compute accelerators and clusters through the Lightning Thunder compiler and the broader PyTorch Lightning ecosystem. · We have offices in New York City , San Francisco ,and L ...

  • Only for registered members San Francisco, CA

    We're looking for a performance engineer to squeeze every FLOP out of modern accelerators. You'll write the kernels and low-level optimizations that make vLLM the fastest inference engine in the world. · Design and implement high-performance kernels for attention, GEMM, sampling, ...

  • Only for registered members San Francisco, California, United States

    We are seeking a highly skilled AI Optimization Leader to work on optimizing training and inference workloads on compute accelerators and clusters, through the Lightning stack, including the Thunder compiler and the broader PyTorch Lightning ecosystem. · ...

  • Only for registered members San Francisco

    We are seeking a highly skilled GPU Kernel Engineer who is passionate about pushing the limits of performance on modern accelerators. · In this role, you will design and optimize custom GPU kernels that power next-generation large-scale AI systems. · ...

  • Only for registered members San Francisco $200,000 - $400,000 (USD)

    We're looking for a performance engineer to squeeze every FLOP out of modern accelerators. · You'll write the kernels and low-level optimizations that make vLLM the fastest inference engine in the world. Your code will run on hundreds of accelerator types, · from NVIDIA GPUs to e ...

  • Only for registered members San Francisco Full time $200,000 - $400,000 (USD)

    We're looking for a performance engineer to squeeze every FLOP out of modern accelerators. · You'll write kernels and low-level optimizations that make vLLM the fastest inference engine in the world. · Your code will run on hundreds of accelerator types from NVIDIA GPUs to emergi ...

  • Only for registered members San Francisco

    fal is pioneering the next generation of generative-media infrastructure. We're pushing the boundaries of model inference performance to power seamless creative experiences at unprecedented scale. · We're looking for a Staff Technical Lead for Inference & ML Performance · ...

  • Only for registered members California, San Francisco, United States Of America

    We are looking for a Machine Learning Infrastructure Engineer to join our AI Infrastructure team. As a key member of this team, you will develop technologies to accelerate the training and inference of computer vision models that power smart spaces and cities. · Optimizing Core M ...

  • Only for registered members San Francisco

    The AI Infrastructure team at Zensors builds the engine that powers our visual sensing platform.We provide the tools to automate the lifecycle of our AI workflow, including model development, evaluation, optimization, deployment, and monitoring across thousands of video streams. ...

  • Only for registered members San Francisco, California, US

    The AI Infrastructure team at Zensors builds the engine that powers our visual sensing platform. We provide the tools to automate the lifecycle of our AI workflow, · We are looking for a Machine Learning Engineer in ML Runtime & Optimization to develop technologies to accelerate ...

  • Runtime Engineer

    4 weeks ago

    Only for registered members San Francisco

    We are building a high-performance, portable compiler that lets developers 'build once, deploy anywhere.' · We care deeply about the impact AI has on our society and planet. · We're talking about seamless cross-platform compatibility, · ...

  • Only for registered members San Francisco

    Weare an AI platform engineering team building large-scale end-to-end AI production pipelines covering model training optimization deployment and real-world applications. · We are seeking an experienced AI model optimization engineer specializing in large model inference accelera ...

  • Only for registered members San Francisco

    +Job summary · We are seeking an experienced AI model optimization engineer specializing in large model inference acceleration. · +Design and optimize large model inference pipelines for low-latency and high-throughput production deployments. · Benchmark and profile deep learning ...

  • Only for registered members San Francisco, CA

    The AI Infrastructure team at Zensors builds the engine that powers our visual sensing platform. We provide the tools to automate the lifecycle of our AI workflow, · Optimizing Core ML Pipelines: · Cross-Stack Collaboration: · Model Acceleration: · Building Efficient Operators: · ...