Member of Technical Staff, Training and Inference - Palo Alto - Boson AI

    Boson AI
    Boson AI Palo Alto

    1 week ago

    Description

    Boson AI is an early-stage startup building large audio models for everyone to enjoy and use. Our founders (Alex Smola, Mu Li), and a team of Deep Learning, Optimization, NLP, and Statistics scientists and engineers are working on high quality generative AI models for language and beyond.

    We are seeking research scientists and engineers to join our team full-time in our Santa Clara office. As part of your role, you will work on implementing and improving distributed optimization algorithms, performance tune architectures, improve inference and help us make Deep Networks perform efficiently on our cluster. The ideal candidate will possess a strong background in CUDA, Triton, PyTorch, distributed optimization and deep learning architectures.

    We encourage you to apply even if you do not believe you meet every single qualification. As long as you are motivated to learn and join the development of foundation models, we'd love to chat.

    Responsibilities

    • Optimize model architectures and loss objectives to handle combinations of images, video, text, speech, and audio data.
    • Implement and optimize kernels for efficient training on Hopper and Blackwell GPUs
    • Performance optimization (floating point formats, sparsity, systems level optimization)
    • Distributed optimization and training

    You may be a good fit if you have

    • Experience in writing clean and efficient code; Master or Doctoral degree in computer science or equivalent.
    • Proficiency in at least one deep learning framework, such as PyTorch or JAX.
    • Participated in at least 1 research project related to distributed training or inference.

    Strong candidates may also have

    • Experience in implementing your own kernels in CUDA or another compiler/toolkit (Triton, ThunderKittens, PTX, etc.)
    • Experience in distributed optimization (e.g. using DeepSpeed, FSDP), ideally designing performance optimizations
    • Experience in computer networking (e.g. Infiniband, using SHARP)
    • Experience in handling data at billions-scale

    $150,000 - $600,000 a year

    Total compensations includes base pay, equity, and benefits.


    #J-18808-Ljbffr

  • As a Staff Software Engineer, ML training and inference infrastructure, you will be a member of the Perception team at Rivian, which develops advanced machine learning algorithms that directly impact safety critical self-driving features of our category defining vehicles. · We ar ...

    Palo Alto, CA

    2 days ago

  • Work in company

    Multimodal Model Training and Inference Optimization Engineer

    Only for registered members

    We are seeking an experienced Multimodal Model Training and Inference Optimization Engineer with expertise in optimizing AI model training and inference, · Optimize large model training pipelines to improve efficiency speed scalability. · Develop distributed training strategies s ...

    San Jose $136,800 - $359,720 (USD)

    3 weeks ago

  • Work in company

    Multimodal Model Training and Inference Optimization Engineer

    Only for registered members

    The Vision-Applied Research team focuses on applied research in Generative AI and CV/Multimodal Understanding, · and delivering intelligent solutions to ByteDance products. · Optimize large model training pipelines to improve efficiency,speed,and scalability. · Benchmark and prof ...

    San Jose $136,800 - $359,720 (USD)

    3 weeks ago

  • Work in company

    Multimodal Model Training and Inference Optimization Engineer

    Only for registered members

    We are seeking an experienced Multimodal Model Training and Inference Optimization Engineer with expertise in optimizing AI model training and inference, including distributed training/inference and acceleration. The ideal candidate will work at the cutting edge of AI efficiency, ...

    San Jose, CA

    3 weeks ago

  • Work in company

    Multimodal Model Training and Inference Optimization Engineer

    Only for registered members

    We are seeking an experienced Multimodal Model Training and Inference Optimization Engineer with expertise in optimizing AI model training and inference, · including distributed training/inference and acceleration.The ideal candidate will work at the cutting edge of AI efficiency ...

    San Jose, CA

    3 weeks ago

  • We are seeking an experienced Multimodal Model Training and Inference Optimization Engineer with expertise in optimizing AI model training and inference, including distributed training/inference and acceleration.The ideal candidate will work at the cutting edge of AI efficiency, ...

    San Jose $208,800 - $438,000 (USD)

    3 weeks ago

  • We are seeking an experienced Multimodal Model Training and Inference Optimization Engineer with expertise in optimizing AI model training and inference. · Optimize large model training pipelines to improve efficiency, speed, and scalability. · Benchmark and profile deep learning ...

    San Jose, CA

    3 weeks ago

  • We are seeking an experienced Multimodal Model Training Optimization Engineer with expertise in optimizing AI model training and inference including distributed traininginference. · Optimize large model training pipelines to improve efficiency speedand scalability. · Benchmarkand ...

    San Jose, CA

    3 weeks ago

  • We are seeking an experienced Multimodal Model Training and Inference Optimization Engineer with expertise in optimizing AI model training and inference, including distributed training/inference and acceleration. · The ideal candidate will work at the cutting edge of AI efficienc ...

    San Jose $208,800 - $438,000 (USD)

    3 weeks ago

  • Work in company

    Global Hardware Sourcing

    Only for registered members

    · About The Role · Together AI is rapidly scaling its infrastructure, and we need versatile supply chain professionals to help evolve and strengthen our supply chain. In this role, you will develop sourcing strategies for our hardware supply chain with a focus on system integrat ...

    San Francisco

    3 days ago

  • Work in company

    Principal Machine Learning Engineer

    Only for registered members

    A1 is building a proactive AI system that understands context across conversations, plans actions, and carries work forward over time. You will be responsible for turning research direction into working, production-grade ML systems. · Build and own end-to-end ML pipelines spannin ...

    Palo Alto, CA

    1 month ago

  • Work in company

    LLM Inference Engineer

    Only for registered members

    We're seeking an experienced LLM Inference Engineer to optimize our large language model (LLM) serving infrastructure.The ideal candidate has: · Extensive hands-on experience with state-of-the-art inference optimization techniques · A track record of deploying efficient, scalable ...

    Palo Alto, CA

    1 month ago

  • Work in company

    Sr. Machine Learning Engineer

    Only for registered members

    We are recruiting for our Palo Alto office: Machine Learning Engineer. Design, train, and evaluate machine learning models for code understanding and generation using transformer-based architectures. · ...

    Palo Alto, CA

    1 month ago

  • Work in company

    Technical Lead, Machine Learning

    Only for registered members

    A1 is building a proactive AI system that understands context across conversations and carries work forward over time. · Build and own end-to-end ML pipelines spanning data, training, evaluation, inference, and deployment. · Fine-tune and adapt models using state-of-the-art metho ...

    Palo Alto, CA

    1 month ago

  • Work in company

    Technical Lead, Machine Learning

    Only for registered members

    About the Role · A1 is building a proactive AI system that understands context across conversations, plans actions, and carries work forward over time. · You will be responsible for turning research direction into working, production-grade ML systems. This role owns the execution ...

    Palo Alto, CA

    1 week ago

  • Work in company

    Research Engineer

    Only for registered members

    · Headquartered in Silicon Valley, we are a newly established start-up, where a collective of visionary scientists, engineers, and entrepreneurs are dedicated to transforming the landscape of biology and medicine through the power of Generative AI. Our team comprises leading min ...

    Palo Alto, CA $175,000 - $260,000 (USD) per year

    1 week ago

  • Work in company

    Research Engineer

    Only for registered members

    Headquartered in Silicon Valley, we are a newly established start-up, where a collective of visionary scientists, engineers, and entrepreneurs are dedicated to transforming the landscape of biology and medicine through the power of Generative AI. Our team comprises leading minds ...

    Palo Alto, CA $120,000 - $200,000 (USD) per year

    5 days ago

  • Work in company

    ML Engineer

    Only for registered members

    About Luma AI · Luma's mission is to build multimodal AI to expand human imagination and capabilities. We believe that multimodality is critical for intelligence. To go beyond language models and build more aware, capable and useful systems, the next step function change will com ...

    Palo Alto, CA $110,000 - $190,000 (USD) per year

    1 week ago

  • Work in company

    ML Performance Engineer

    Only for registered members

    ML Performance Engineer · Location: · Palo Alto (Onsite) · We're hiring an · ML Performance Engineer · to join a · Series B AI lab valued at $1.4B · that is building · best-in-class image models and products · (not a wrapper). · They have developed · in-house foundation models at ...

    Palo Alto, CA

    1 week ago

  • Work in company

    Principal ML Engineer

    Only for registered members

    About the role · We're looking for an experienced and forward-thinking Principal Machine Learning Engineer to lead the design and implementation of our end-to-end Machine Learning infrastructure for industry leading Voice AI products. This is a high impact role where you will sha ...

    Palo Alto, CA $220,000 - $350,000 (USD) per year

    1 week ago

  • Work in company

    ML Engineer

    Only for registered members

    Luma's mission is to build multimodal AI to expand human imagination and capabilities. · ...

    Palo Alto, CA

    1 month ago

Jobs
>
Palo Alto