Member of Technical Staff, Training and Inference - Palo Alto - Boson AI

Boson AI Palo Alto

1 week ago

Description

Boson AI is an early-stage startup building large audio models for everyone to enjoy and use. Our founders (Alex Smola, Mu Li), and a team of Deep Learning, Optimization, NLP, and Statistics scientists and engineers are working on high quality generative AI models for language and beyond.

We are seeking research scientists and engineers to join our team full-time in our Santa Clara office. As part of your role, you will work on implementing and improving distributed optimization algorithms, performance tune architectures, improve inference and help us make Deep Networks perform efficiently on our cluster. The ideal candidate will possess a strong background in CUDA, Triton, PyTorch, distributed optimization and deep learning architectures.

We encourage you to apply even if you do not believe you meet every single qualification. As long as you are motivated to learn and join the development of foundation models, we'd love to chat.

Responsibilities

Optimize model architectures and loss objectives to handle combinations of images, video, text, speech, and audio data.
Implement and optimize kernels for efficient training on Hopper and Blackwell GPUs
Performance optimization (floating point formats, sparsity, systems level optimization)
Distributed optimization and training

You may be a good fit if you have

Experience in writing clean and efficient code; Master or Doctoral degree in computer science or equivalent.
Proficiency in at least one deep learning framework, such as PyTorch or JAX.
Participated in at least 1 research project related to distributed training or inference.

Strong candidates may also have

Experience in implementing your own kernels in CUDA or another compiler/toolkit (Triton, ThunderKittens, PTX, etc.)
Experience in distributed optimization (e.g. using DeepSpeed, FSDP), ideally designing performance optimizations
Experience in computer networking (e.g. Infiniband, using SHARP)
Experience in handling data at billions-scale

$150,000 - $600,000 a year

Total compensations includes base pay, equity, and benefits.

#J-18808-Ljbffr

Work in company
Staff Software Engineer, ML Training and Inference Infrastructure
Only for registered members

As a Staff Software Engineer, ML training and inference infrastructure, you will be a member of the Perception team at Rivian, which develops advanced machine learning algorithms that directly impact safety critical self-driving features of our category defining vehicles. · We ar ...

Palo Alto, CA
2 days ago
Work in company
Multimodal Model Training and Inference Optimization Engineer
Only for registered members

We are seeking an experienced Multimodal Model Training and Inference Optimization Engineer with expertise in optimizing AI model training and inference, · Optimize large model training pipelines to improve efficiency speed scalability. · Develop distributed training strategies s ...

San Jose $136,800 - $359,720 (USD)
3 weeks ago
Work in company
Multimodal Model Training and Inference Optimization Engineer
Only for registered members

The Vision-Applied Research team focuses on applied research in Generative AI and CV/Multimodal Understanding, · and delivering intelligent solutions to ByteDance products. · Optimize large model training pipelines to improve efficiency,speed,and scalability. · Benchmark and prof ...

San Jose $136,800 - $359,720 (USD)
3 weeks ago
Work in company
Multimodal Model Training and Inference Optimization Engineer
Only for registered members

We are seeking an experienced Multimodal Model Training and Inference Optimization Engineer with expertise in optimizing AI model training and inference, including distributed training/inference and acceleration. The ideal candidate will work at the cutting edge of AI efficiency, ...

San Jose, CA
3 weeks ago
Work in company
Multimodal Model Training and Inference Optimization Engineer
Only for registered members

We are seeking an experienced Multimodal Model Training and Inference Optimization Engineer with expertise in optimizing AI model training and inference, · including distributed training/inference and acceleration.The ideal candidate will work at the cutting edge of AI efficiency ...

San Jose, CA
3 weeks ago
Work in company
Sr. Multimodal Model Training and Inference Optimization Engineer
Only for registered members

We are seeking an experienced Multimodal Model Training and Inference Optimization Engineer with expertise in optimizing AI model training and inference, including distributed training/inference and acceleration.The ideal candidate will work at the cutting edge of AI efficiency, ...

San Jose $208,800 - $438,000 (USD)
3 weeks ago
Work in company
Sr. Multimodal Model Training and Inference Optimization Engineer
Only for registered members

We are seeking an experienced Multimodal Model Training and Inference Optimization Engineer with expertise in optimizing AI model training and inference. · Optimize large model training pipelines to improve efficiency, speed, and scalability. · Benchmark and profile deep learning ...

San Jose, CA
3 weeks ago
Work in company
Sr. Multimodal Model Training and Inference Optimization Engineer
Only for registered members

We are seeking an experienced Multimodal Model Training Optimization Engineer with expertise in optimizing AI model training and inference including distributed traininginference. · Optimize large model training pipelines to improve efficiency speedand scalability. · Benchmarkand ...

San Jose, CA
3 weeks ago
Work in company
Sr. Multimodal Model Training and Inference Optimization Engineer
Only for registered members

We are seeking an experienced Multimodal Model Training and Inference Optimization Engineer with expertise in optimizing AI model training and inference, including distributed training/inference and acceleration. · The ideal candidate will work at the cutting edge of AI efficienc ...

San Jose $208,800 - $438,000 (USD)
3 weeks ago
Work in company
Global Hardware Sourcing
Only for registered members

· About The Role · Together AI is rapidly scaling its infrastructure, and we need versatile supply chain professionals to help evolve and strengthen our supply chain. In this role, you will develop sourcing strategies for our hardware supply chain with a focus on system integrat ...

San Francisco
3 days ago
Work in company
Principal Machine Learning Engineer
Only for registered members

A1 is building a proactive AI system that understands context across conversations, plans actions, and carries work forward over time. You will be responsible for turning research direction into working, production-grade ML systems. · Build and own end-to-end ML pipelines spannin ...

Palo Alto, CA
1 month ago
Work in company
LLM Inference Engineer
Only for registered members

We're seeking an experienced LLM Inference Engineer to optimize our large language model (LLM) serving infrastructure.The ideal candidate has: · Extensive hands-on experience with state-of-the-art inference optimization techniques · A track record of deploying efficient, scalable ...

Palo Alto, CA
1 month ago
Work in company
Sr. Machine Learning Engineer
Only for registered members

We are recruiting for our Palo Alto office: Machine Learning Engineer. Design, train, and evaluate machine learning models for code understanding and generation using transformer-based architectures. · ...

Palo Alto, CA
1 month ago
Work in company
Technical Lead, Machine Learning
Only for registered members

A1 is building a proactive AI system that understands context across conversations and carries work forward over time. · Build and own end-to-end ML pipelines spanning data, training, evaluation, inference, and deployment. · Fine-tune and adapt models using state-of-the-art metho ...

Palo Alto, CA
1 month ago
Work in company
Technical Lead, Machine Learning
Only for registered members

About the Role · A1 is building a proactive AI system that understands context across conversations, plans actions, and carries work forward over time. · You will be responsible for turning research direction into working, production-grade ML systems. This role owns the execution ...

Palo Alto, CA
1 week ago
Work in company
Research Engineer
Only for registered members

· Headquartered in Silicon Valley, we are a newly established start-up, where a collective of visionary scientists, engineers, and entrepreneurs are dedicated to transforming the landscape of biology and medicine through the power of Generative AI. Our team comprises leading min ...

Palo Alto, CA $175,000 - $260,000 (USD) per year
1 week ago
Work in company
Research Engineer
Only for registered members

Headquartered in Silicon Valley, we are a newly established start-up, where a collective of visionary scientists, engineers, and entrepreneurs are dedicated to transforming the landscape of biology and medicine through the power of Generative AI. Our team comprises leading minds ...

Palo Alto, CA $120,000 - $200,000 (USD) per year
5 days ago
Work in company
ML Engineer
Only for registered members

About Luma AI · Luma's mission is to build multimodal AI to expand human imagination and capabilities. We believe that multimodality is critical for intelligence. To go beyond language models and build more aware, capable and useful systems, the next step function change will com ...

Palo Alto, CA $110,000 - $190,000 (USD) per year
1 week ago
Work in company
ML Performance Engineer
Only for registered members

ML Performance Engineer · Location: · Palo Alto (Onsite) · We're hiring an · ML Performance Engineer · to join a · Series B AI lab valued at $1.4B · that is building · best-in-class image models and products · (not a wrapper). · They have developed · in-house foundation models at ...

Palo Alto, CA
1 week ago
Work in company
Principal ML Engineer
Only for registered members

About the role · We're looking for an experienced and forward-thinking Principal Machine Learning Engineer to lead the design and implementation of our end-to-end Machine Learning infrastructure for industry leading Voice AI products. This is a high impact role where you will sha ...

Palo Alto, CA $220,000 - $350,000 (USD) per year
1 week ago
Work in company
ML Engineer
Only for registered members

Luma's mission is to build multimodal AI to expand human imagination and capabilities. · ...

Palo Alto, CA
1 month ago

Staff Software Engineer, ML Training and Inference Infrastructure
Only for registered members Palo Alto, CA
Multimodal Model Training and Inference Optimization Engineer
Only for registered members San Jose
Multimodal Model Training and Inference Optimization Engineer
Only for registered members San Jose
Multimodal Model Training and Inference Optimization Engineer
Only for registered members San Jose, CA
Multimodal Model Training and Inference Optimization Engineer
Only for registered members San Jose, CA
Sr. Multimodal Model Training and Inference Optimization Engineer
Only for registered members San Jose
Sr. Multimodal Model Training and Inference Optimization Engineer
Only for registered members San Jose, CA
Sr. Multimodal Model Training and Inference Optimization Engineer
Only for registered members San Jose, CA
Sr. Multimodal Model Training and Inference Optimization Engineer
Only for registered members San Jose
Global Hardware Sourcing
Only for registered members San Francisco
Principal Machine Learning Engineer
Only for registered members Palo Alto, CA
LLM Inference Engineer
Only for registered members Palo Alto, CA
Sr. Machine Learning Engineer
Only for registered members Palo Alto, CA
Technical Lead, Machine Learning
Only for registered members Palo Alto, CA
Technical Lead, Machine Learning
Only for registered members Palo Alto, CA
Research Engineer
Only for registered members Palo Alto, CA
Research Engineer
Only for registered members Palo Alto, CA
ML Engineer
Only for registered members Palo Alto, CA
ML Performance Engineer
Only for registered members Palo Alto, CA
Principal ML Engineer
Only for registered members Palo Alto, CA
ML Engineer
Only for registered members Palo Alto, CA