Principal Machine Learning Engineer, Distributed vLLM Inference - Boston, MA

Only for registered members Boston, MA, United States

1 month ago

Default job background
+ Principal Machine Learning Engineer focused on distributed vLLM infrastructure in the llm-d project
+ Develop and maintain distributed inference infrastructure leveraging Kubernetes APIs, operators, and the Gateway Inference Extension API for scalable LLM deployments
+ Create system components in Go and/or Rust to integrate with the vLLM project and manage distributed inference workloads
+, Benefits include comprehensive medical, dental, vision coverage flexible spending account healthcare dependent care health savings account high deductible medical plan retirement 401(k) employer match paid time off holidays paid parental leave plans new parents leave benefits disability paid family medical leave military leave additional benefits employee stock purchase plan family planning reimbursement tuition reimbursement transportation expense account employee assistance program
Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Get full access

Access all high-level positions and get the job of your dreams.



Similar jobs

  • Only for registered members Boston Full time $206,600 - $351,050 (USD)

    We believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. We accelerate AI for the enterprise and brings operational simplicity to GenAI deployments. · You would be joining the core team behind 2025's most pop ...

  • Only for registered members Boston

    We believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. · At Red Hat we accelerate AI for the enterprise and brings operational simplicity to GenAI deployments. · As leading developers maintainers of state- ...

  • Only for registered members Boston

    We believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments. · ...

  • Only for registered members Boston $206,600 - $351,050 (USD)

    At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. · ...

  • Only for registered members Boston Full time $206,600 - $351,050 (USD)

    At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. As leading developers, maintainers of the vLLM and LLM-D projects, and inventors of state-of-the-art techniques for model quantization and s ...

  • Only for registered members Boston $206,600 - $351,050 (USD)

    We believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. · ...

  • Only for registered members Boston, MA

    We believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. As leading developers, maintainers of the vLLM and LLM-D projects, and inventors of state-of-the-art techniques for model quantization and sparsificati ...

  • Only for registered members Boston

    Red Hat is seeking a Principal Machine Learning Engineer to join our AI Inference team. · ...

  • Only for registered members Boston $189,600 - $312,730 (USD)

    The vLLM and LLM-D Engineering team at Red Hat is looking for a customer obsessed developer to join our team as a Forward Deployed Engineer. · In this role, you will not just build software; you will be the bridge between our cutting-edge inference platform (LLM-D, and vLLM) and ...

  • Only for registered members Boston

    Red Hat is looking for a customer-obsessed developer to join their team as a Forward Deployed Engineer. · ...

  • Only for registered members Boston $170,770 - $281,770 (USD)

    +At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. · +The Red Hat AI Inference Engineering team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments. · As ...

  • Only for registered members Boston $133,650 - $220,680 (USD)

    At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. · The Red Hat AI Inference Engineering team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments. · As l ...

  • Only for registered members Boston $189,600 - $312,730 (USD)

    At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. · The Red Hat AI Inference Engineering team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments.As lead ...

  • Only for registered members Boston, MA

    At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. The Red Hat AI Inference Engineering team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments. · ...

  • Only for registered members Boston, MA

    At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. · The Red Hat AI Inference Engineering team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments. · You ...

  • Only for registered members Boston Full time $133,650 - $220,680 (USD)

    We believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. · ...

  • Only for registered members Boston $206,600 - $351,050 (USD)

    At Red Hat we believe the future of AI is open. The Red Hat AI Inference Engineering team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments. · We provide a stable platform for enterprises to build optimize scale LLM deployments. · ...

  • Only for registered members Boston $189,600 - $312,730 (USD)

    +Job summary · At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise.The Red Hat AI Inference Engineering team accelerates AI for the enterprise and brings operational simplicity to GenAI deploym ...

  • Only for registered members Boston Full time $170,770 - $281,770 (USD)

    We believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. · The Red Hat AI Inference Engineering team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments. · As leading deve ...

  • Only for registered members Boston Full time $206,600 - $351,050 (USD)

    At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. · This role will involve architecting new features for Red Hat AI Inference. · ...