
Inference Jobs in United States
25 jobs at Inference in United States
-
A technology company in San Francisco is seeking a Senior Full-Stack Engineer to develop frontend features for its AI platform. Responsibilities include optimizing performance, mentoring team members, and creating user-centric applications. Candidates should have 5+ years of expe ...
San Francisco1 week ago
-
We are seeking a Machine Learning Researcher to push the boundaries of what's possible in LLM post-training. · ...
San Francisco2 months ago
-
Help us build the systems that train specialized AI models for the fastest-growing companies in the world. · You will be responsible for building and improving the core ML systems that power our custom model training platform, while also applying these systems directly for custom ...
San Francisco2 months ago
-
is hiring a Senior Full-Stack (Frontend-Focused) Engineer · Help us build beautiful, performant web experiences that give users super-powers over our globally distributed LLM inference platform. If you love shipping React apps that feel snappy at planet-scale, we'd love to meet y ...
San Francisco1 week ago
-
We're hiring a Senior Full-Stack (Frontend-Focused) Engineer to build beautiful, performant web experiences for our globally distributed LLM inference platform. · Own the user experience – design, build, and polish the dashboards, consoles, and customer-facing apps that let users ...
San Francisco2 weeks ago
-
We are looking for a Machine Learning Researcher to join our team in San Francisco. You will be responsible for conducting research into experimental models, training systems and modalities to create novel products for our customers. Your work will span from exploring new archite ...
San Francisco, CA1 month ago
-
Filmmaker / Storyteller · is seeking a Filmmaker / Storyteller to join our team and help define the narrative of building the world's largest distributed GPU cluster. This role combines creative vision with eye-catching content production, crafting stories that capture the magic ...
San Francisco1 week ago
-
body{font-family:sans-serif;}h1,h2,h3{text-align:left;margin-bottom:0.5em;}ul{list-style-position:inside;}Fullstack Engineer · We are seeking a skilled Full-Stack (Frontend-Focused) Engineer to join our team at Inference.net. As a key member of our team, you will play a crucial r ...
San Francisco3 days ago
-
We are a well-funded ten-person team of engineers who work in-person in downtown San Francisco on difficult, high-impact engineering problems. Everyone on the team has been writing code for over 10 years, and has founded and run their own software companies. · ...
San Francisco, CA1 month ago
-
Help us make inference blazingly fast. If you love squeezing every last drop of performance out of GPUs, diving deep into CUDA kernels, and turning optimization techniques into production systems, we'd love to meet you. · About · trains and hosts specialized language models for ...
San Francisco15 hours ago
-
+Help us make inference blazingly fast. If you love squeezing every last drop of performance out of GPUs, diving deep into CUDA kernels, and turning optimization techniques into production systems, we'd love to meet you. · +Implement and productionize optimization techniques incl ...
San Francisco1 month ago
-
We are hiring a Senior Software Engineer to join our team in San Francisco. The ideal candidate will have experience in ML systems, inference optimization, and GPU programming. They will be responsible for making our inference stack as fast and efficient as possible. · The role r ...
San Francisco1 month ago
-
Help us push the boundaries of what's possible in LLM post-training. If you love training models, exploring new architectures, running experiments, and turning research insights into products that ship, we'd love to meet you. · ...
San Francisco, CA1 month ago
-
Join Our Team To Push The Boundaries Of What's Possible In Llm Post-Training · Help us push the boundaries of what's possible in LLM post-training. If you love training models, exploring new architectures, running experiments, and turning research insights into products that ship ...
San Francisco15 hours ago
-
· ...
San Francisco2 months ago
-
is hiring a Senior Full-Stack (Frontend-Focused) Engineer · Help us build beautiful, performant web experiences that give users super-powers over our globally distributed LLM inference platform. If you love shipping React apps that feel snappy at planet-scale, we'd love to meet y ...
San Francisco1 week ago
-
Help us push the boundaries of what's possible in LLM post-training. If you love training models, exploring new architectures, running experiments, and turning research insights into products that ship, we'd love to meet you. · About · trains and hosts specialized language model ...
San Francisco1 week ago
-
Transforming AI Systems: A Cutting-Edge Role · Lead custom model training projects from data intake to deployment, utilizing expertise in machine learning engineering and software development. · Design and implement efficient data processing pipelines for large datasets, ensuring ...
San Francisco3 days ago
-
Help us make inference blazingly fast. If you love squeezing every last drop of performance out of GPUs, diving deep into CUDA kernels, and turning optimization techniques into production systems, we'd love to meet you. · About · trains and hosts specialized language models for ...
San Francisco1 week ago
-
Join our quest to revolutionize language models. If you're passionate about training and fine-tuning AI, we'd love for you to collaborate with us. · We develop cutting-edge solutions that push the boundaries of what's possible in LLM post-training. Our platform leverages advanced ...
San Francisco4 days ago