Member of Technical Staff, Inference - San Francisco

Only for registered members San Francisco, United States

1 month ago

Default job background
Full time $200,000 - $400,000 (USD)
Inferact's mission is to grow vLLM as the world's AI inference engine and accelerate AI progress by making inference cheaper and faster. Founded by the creators and core maintainers of vLLM, we sit at the intersection of models and hardware—a position that took years to build. · ...
Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Get full access

Access all high-level positions and get the job of your dreams.



Similar jobs

  • Work in company

    Staff Technical Lead for Inference

    Only for registered members

    · fal is pioneering the next generation of generative-media infrastructure. We're pushing the boundaries of model inference performance to power seamless creative experiences at unprecedented scale. We're looking for a Staff Technical Lead for Inference & ML Performance, someone ...

    San Francisco $150,000 - $200,000 (USD) per year

    5 days ago

  • Work in company

    Member of Technical Staff, Inference

    Only for registered members

    Magic's mission is to build safe AGI that accelerates humanity's progress on the world's most important problems. We believe the most promising path to safe AGI lies in automating research and code generation to improve models and solve alignment more reliably than humans can alo ...

    San Francisco $65,000 - $115,000 (USD) per year

    4 days ago

  • Work in company

    Member of Technical Staff, Inference

    Only for registered members

    We're looking for an inference runtime engineer to push the boundaries of what's possible in LLM and diffusion model serving. · Optimizing how models execute across diverse hardware and architectures. · ...

    San Francisco $200,000 - $400,000 (USD)

    1 month ago

  • Work in company

    Member of Technical Staff, Inference

    Only for registered members

    Magic's mission is to build safe AGI that accelerates humanity's progress on the world's most important problems. We believe the most promising path to safe AGI lies in automating research and code generation to improve models and solve alignment more reliably than humans can alo ...

    San Francisco $225,000 - $550,000 (USD)

    1 week ago

  • Work in company

    Member of Technical Staff, Inference

    Only for registered members

    Magic's mission is to build safe AGI that accelerates humanity's progress on the world's most important problems. We believe the most promising path to safe AGI lies in automating research and code generation to improve models and solve alignment more reliably than humans can alo ...

    San Francisco, CA

    1 week ago

  • Work in company

    Member of Technical Staff, Inference

    Only for registered members

    Description · Inception creates the world's fastest, most efficient AI models. Our Mercury model is the world's fastest reasoning LLM and first commercially available diffusion LLM, delivering 5x greater speed and efficiency than today's LLMs, with best-in-class quality. · We ar ...

    San Francisco Full time

    2 weeks ago

  • Work in company

    Member of Technical Staff, Inference

    Only for registered members

    The Role · We're looking for engineers and scientists to design, optimize, and scale the systems that power our diffusion LLMs in production. Your work will make inference faster, more cost-effective, and more reliable. · Key ResponsibilitiesBuild and optimize high-performance mo ...

    San Francisco $65,000 - $115,000 (USD) per year Full time

    17 hours ago

  • Work in company

    Member of Technical Staff, Inference

    Only for registered members

    Magic's mission is to build safe AGI that accelerates humanity's progress on the world's most important problems. We believe the most promising path to safe AGI lies in automating research and code generation to improve models and solve alignment more reliably than humans can alo ...

    San Francisco $225,000 - $550,000 (USD) Full time

    1 week ago

  • Work in company

    Staff Product Manager, Managed Inference

    Only for registered members

    Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of the stack — from electrons to tokens — to power the world's most ambitious AI work ...

    San Francisco, CA - US

    4 days ago

  • Work in company

    Staff Software Engineer, Inference Infrastructure

    Only for registered members

    Who are we? · Our mission is to scale intelligence to serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that ou ...

    San Francisco

    1 week ago

  • Work in company

    Staff Software Engineer, Inference Infrastructure

    Only for registered members

    +We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. · ...

    San Francisco, CA

    1 month ago

  • Work in company

    Staff Software Engineer, Inference Infrastructure

    Only for registered members

    +We are looking for Members of Technical Staff to join the Model Serving team at Cohere. · +5+ years of engineering experience running production infrastructure at a large scale · Experience designing large, highly available distributed systems with Kubernetes, and GPU workloads ...

    San Francisco Full time

    1 month ago

  • Work in company

    Staff Software Engineer, Inference Infrastructure

    Only for registered members

    +We are looking for Members of Technical Staff to join the Model Serving team at Cohere. The team is responsible for developing, deploying, and operating the AI platform delivering Cohere's large language models through easy to use API endpoints. · +Work closely with many teams t ...

    San Francisco

    1 month ago

  • Work in company

    Member of Technical Staff, Inference Infrastructure

    Only for registered members

    Inferact is looking for an infrastructure engineer to build the distributed systems that power inference at global scale. · ...

    San Francisco, CA

    1 month ago

  • Work in company

    Member of Technical Staff, Inference Runtime

    Only for registered members

    inferact's mission is to make vllm the world's inference engine, and accelerate ai by building the systems to run it everywhere. · we're looking for an inference runtime engineer to push the boundaries of what's possible in llm and diffusion model serving. · bachelor's degree or ...

    San Francisco, CA

    1 month ago

  • Work in company

    Member of Technical Staff, Inference

    Only for registered members

    This role involves optimizing model inference latency and throughput as well as building reliable and performant production serving systems to serve billions of users. · ...

    Palo Alto $180,000 - $440,000 (USD)

    1 month ago

  • Work in company

    Member of Technical Staff, Inference

    Only for registered members

    About xAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated and focused on engineering excellence. · This organization is for individuals who appreciate challenging thems ...

    Palo Alto, CA

    1 month ago

  • Work in company

    Member of Technical Staff, Inference

    Only for registered members

    · About xAI · xAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challengi ...

    Palo Alto, CA; San Francisco, CA

    1 week ago

  • Work in company

    Member of Technical Staff, Applied Inference

    Only for registered members

    About xAI is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. · We operate with a flat organizational structure. · All employees are expected to be hands-on and to contribute directly to the company's mission. · ...

    Palo Alto $180,000 - $440,000 (USD)

    1 month ago

  • Work in company

    Member of Technical Staff, Applied Inference

    Only for registered members

    · About xAI · xAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challengi ...

    Palo Alto, CA; San Francisco, CA

    1 week ago