- Optimize model architectures and loss objectives to handle combinations of images, video, text, speech, and audio data.
- Implement and optimize kernels for efficient training on Hopper and Blackwell GPUs
- Performance optimization (floating point formats, sparsity, systems level optimization)
- Distributed optimization and training
- Experience in writing clean and efficient code; Master or Doctoral degree in computer science or equivalent.
- Proficiency in at least one deep learning framework, such as PyTorch or JAX.
- Participated in at least 1 research project related to distributed training or inference.
- Experience in implementing your own kernels in CUDA or another compiler/toolkit (Triton, ThunderKittens, PTX, etc.)
- Experience in distributed optimization (e.g. using DeepSpeed, FSDP), ideally designing performance optimizations
- Experience in computer networking (e.g. Infiniband, using SHARP)
- Experience in handling data at billions-scale
- Work in company
Staff Software Engineer, ML Training and Inference Infrastructure
Only for registered members
As a Staff Software Engineer, ML training and inference infrastructure, you will be a member of the Perception team at Rivian, which develops advanced machine learning algorithms that directly impact safety critical self-driving features of our category defining vehicles. · We ar ...
Palo Alto, CA2 days ago
- Work in company
Multimodal Model Training and Inference Optimization Engineer
Only for registered members
We are seeking an experienced Multimodal Model Training and Inference Optimization Engineer with expertise in optimizing AI model training and inference, · Optimize large model training pipelines to improve efficiency speed scalability. · Develop distributed training strategies s ...
San Jose $136,800 - $359,720 (USD)3 weeks ago
- Work in company
Multimodal Model Training and Inference Optimization Engineer
Only for registered members
The Vision-Applied Research team focuses on applied research in Generative AI and CV/Multimodal Understanding, · and delivering intelligent solutions to ByteDance products. · Optimize large model training pipelines to improve efficiency,speed,and scalability. · Benchmark and prof ...
San Jose $136,800 - $359,720 (USD)3 weeks ago
- Work in company
Multimodal Model Training and Inference Optimization Engineer
Only for registered members
We are seeking an experienced Multimodal Model Training and Inference Optimization Engineer with expertise in optimizing AI model training and inference, including distributed training/inference and acceleration. The ideal candidate will work at the cutting edge of AI efficiency, ...
San Jose, CA3 weeks ago
- Work in company
Multimodal Model Training and Inference Optimization Engineer
Only for registered members
We are seeking an experienced Multimodal Model Training and Inference Optimization Engineer with expertise in optimizing AI model training and inference, · including distributed training/inference and acceleration.The ideal candidate will work at the cutting edge of AI efficiency ...
San Jose, CA3 weeks ago
- Work in company
Sr. Multimodal Model Training and Inference Optimization Engineer
Only for registered members
We are seeking an experienced Multimodal Model Training and Inference Optimization Engineer with expertise in optimizing AI model training and inference, including distributed training/inference and acceleration.The ideal candidate will work at the cutting edge of AI efficiency, ...
San Jose $208,800 - $438,000 (USD)3 weeks ago
- Work in company
Sr. Multimodal Model Training and Inference Optimization Engineer
Only for registered members
We are seeking an experienced Multimodal Model Training and Inference Optimization Engineer with expertise in optimizing AI model training and inference. · Optimize large model training pipelines to improve efficiency, speed, and scalability. · Benchmark and profile deep learning ...
San Jose, CA3 weeks ago
- Work in company
Sr. Multimodal Model Training and Inference Optimization Engineer
Only for registered members
We are seeking an experienced Multimodal Model Training Optimization Engineer with expertise in optimizing AI model training and inference including distributed traininginference. · Optimize large model training pipelines to improve efficiency speedand scalability. · Benchmarkand ...
San Jose, CA3 weeks ago
- Work in company
Sr. Multimodal Model Training and Inference Optimization Engineer
Only for registered members
We are seeking an experienced Multimodal Model Training and Inference Optimization Engineer with expertise in optimizing AI model training and inference, including distributed training/inference and acceleration. · The ideal candidate will work at the cutting edge of AI efficienc ...
San Jose $208,800 - $438,000 (USD)3 weeks ago
-
· About The Role · Together AI is rapidly scaling its infrastructure, and we need versatile supply chain professionals to help evolve and strengthen our supply chain. In this role, you will develop sourcing strategies for our hardware supply chain with a focus on system integrat ...
San Francisco3 days ago
-
A1 is building a proactive AI system that understands context across conversations, plans actions, and carries work forward over time. You will be responsible for turning research direction into working, production-grade ML systems. · Build and own end-to-end ML pipelines spannin ...
Palo Alto, CA1 month ago
-
We're seeking an experienced LLM Inference Engineer to optimize our large language model (LLM) serving infrastructure.The ideal candidate has: · Extensive hands-on experience with state-of-the-art inference optimization techniques · A track record of deploying efficient, scalable ...
Palo Alto, CA1 month ago
-
We are recruiting for our Palo Alto office: Machine Learning Engineer. Design, train, and evaluate machine learning models for code understanding and generation using transformer-based architectures. · ...
Palo Alto, CA1 month ago
-
A1 is building a proactive AI system that understands context across conversations and carries work forward over time. · Build and own end-to-end ML pipelines spanning data, training, evaluation, inference, and deployment. · Fine-tune and adapt models using state-of-the-art metho ...
Palo Alto, CA1 month ago
-
About the Role · A1 is building a proactive AI system that understands context across conversations, plans actions, and carries work forward over time. · You will be responsible for turning research direction into working, production-grade ML systems. This role owns the execution ...
Palo Alto, CA1 week ago
-
· Headquartered in Silicon Valley, we are a newly established start-up, where a collective of visionary scientists, engineers, and entrepreneurs are dedicated to transforming the landscape of biology and medicine through the power of Generative AI. Our team comprises leading min ...
Palo Alto, CA $175,000 - $260,000 (USD) per year1 week ago
-
Headquartered in Silicon Valley, we are a newly established start-up, where a collective of visionary scientists, engineers, and entrepreneurs are dedicated to transforming the landscape of biology and medicine through the power of Generative AI. Our team comprises leading minds ...
Palo Alto, CA $120,000 - $200,000 (USD) per year5 days ago
-
About Luma AI · Luma's mission is to build multimodal AI to expand human imagination and capabilities. We believe that multimodality is critical for intelligence. To go beyond language models and build more aware, capable and useful systems, the next step function change will com ...
Palo Alto, CA $110,000 - $190,000 (USD) per year1 week ago
-
ML Performance Engineer · Location: · Palo Alto (Onsite) · We're hiring an · ML Performance Engineer · to join a · Series B AI lab valued at $1.4B · that is building · best-in-class image models and products · (not a wrapper). · They have developed · in-house foundation models at ...
Palo Alto, CA1 week ago
-
About the role · We're looking for an experienced and forward-thinking Principal Machine Learning Engineer to lead the design and implementation of our end-to-end Machine Learning infrastructure for industry leading Voice AI products. This is a high impact role where you will sha ...
Palo Alto, CA $220,000 - $350,000 (USD) per year1 week ago
-
Luma's mission is to build multimodal AI to expand human imagination and capabilities. · ...
Palo Alto, CA1 month ago
Member of Technical Staff, Training and Inference - Palo Alto - Boson AI
Description
Boson AI is an early-stage startup building large audio models for everyone to enjoy and use. Our founders (Alex Smola, Mu Li), and a team of Deep Learning, Optimization, NLP, and Statistics scientists and engineers are working on high quality generative AI models for language and beyond.
We are seeking research scientists and engineers to join our team full-time in our Santa Clara office. As part of your role, you will work on implementing and improving distributed optimization algorithms, performance tune architectures, improve inference and help us make Deep Networks perform efficiently on our cluster. The ideal candidate will possess a strong background in CUDA, Triton, PyTorch, distributed optimization and deep learning architectures.
We encourage you to apply even if you do not believe you meet every single qualification. As long as you are motivated to learn and join the development of foundation models, we'd love to chat.
Responsibilities
You may be a good fit if you have
Strong candidates may also have
$150,000 - $600,000 a year
Total compensations includes base pay, equity, and benefits.
#J-18808-Ljbffr
-
Staff Software Engineer, ML Training and Inference Infrastructure
Only for registered members Palo Alto, CA
-
Multimodal Model Training and Inference Optimization Engineer
Only for registered members San Jose
-
Multimodal Model Training and Inference Optimization Engineer
Only for registered members San Jose
-
Multimodal Model Training and Inference Optimization Engineer
Only for registered members San Jose, CA
-
Multimodal Model Training and Inference Optimization Engineer
Only for registered members San Jose, CA
-
Sr. Multimodal Model Training and Inference Optimization Engineer
Only for registered members San Jose
-
Sr. Multimodal Model Training and Inference Optimization Engineer
Only for registered members San Jose, CA
-
Sr. Multimodal Model Training and Inference Optimization Engineer
Only for registered members San Jose, CA
-
Sr. Multimodal Model Training and Inference Optimization Engineer
Only for registered members San Jose
-
Global Hardware Sourcing
Only for registered members San Francisco
-
Principal Machine Learning Engineer
Only for registered members Palo Alto, CA
-
LLM Inference Engineer
Only for registered members Palo Alto, CA
-
Sr. Machine Learning Engineer
Only for registered members Palo Alto, CA
-
Technical Lead, Machine Learning
Only for registered members Palo Alto, CA
-
Technical Lead, Machine Learning
Only for registered members Palo Alto, CA
-
Research Engineer
Only for registered members Palo Alto, CA
-
Research Engineer
Only for registered members Palo Alto, CA
-
ML Engineer
Only for registered members Palo Alto, CA
-
ML Performance Engineer
Only for registered members Palo Alto, CA
-
Principal ML Engineer
Only for registered members Palo Alto, CA
-
ML Engineer
Only for registered members Palo Alto, CA