Deep Learning Scientist, Speech AI - Santa Clara, United States - NVIDIA

    Default job background
    Description


    Widely considered to be one of the technology world's most desirable employers, NVIDIA is an industry leader with groundbreaking developments in High-Performance Computing, Artificial Intelligence and Visualization.

    The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services.

    GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, autonomous cars and conversational AI that can perceive and understand the world.

    Today, we are increasingly known as "the AI computing company." We're looking to grow our company, and build our teams with the most resourceful people in the world.

    Join us at the forefront of technological advancement.


    NVIDIA is looking for Speech Data Scientists to develop high-impact, high-visibility Speech AI product "Riva" & improve the experience of millions of customers.

    If you're creative & passionate about solving real world conversational AI problems, come join our Riva Product engineering team. For more details on Riva check


    What you'll be doing:
    Train Speech Recognition (Acoustic, Language, Punctuation), Speech-to-text translation (AST), and Speech-to-speech (S2S) translation models.
    Measure and benchmark model performance as well as maintain ASR, AST, and S2S model evaluation systems.
    Analyze model accuracy and bias and recommend the next course of action & Improvements.
    Improve processes for speech data processing, augmentation, filtering & Training sets preparation.
    Gather knowhow on speech datasets for training & evaluation.
    Characterize performance and quality metrics across platforms for various speech AI components.
    Collaborate with various teams on new product features and improvements of existing products.
    Participate in developing and reviewing code, design documents, use case reviews, and test plan reviews.
    Help innovate, identify problems, recommend solutions and perform triage in a collaborative team environment.
    Lead and mentor junior team members

    What we need to see:


    Master's degree (or equivalent experience) or PhD in Computer Science, Electrical Engineering, Artificial Intelligence, or Applied Math with 5+ years of experience.

    Native or near-native fluency in a non-English language - Spanish / Mandarin / German / Japanese / Russian / French / UK English / Arabic / Korean / Italian / Portuguese
    Excellent programming skills in Python with strong fundamentals in Programming, optimizations and Software design.

    Strong knowledge of ML/DL techniques, algorithms and tools with exposure to CNN, RNN (LSTM), transformers and transformer decoders and RNN-T, CTC.

    Know-how of Deep learning applications to Speech and NLP.

    Hands-on experience on Speech Technologies like Automatic Speech Recognition, Speech Command detection, Text to Speech, Speaker Recognition and Identification, speaker diarization, Noise robustness Techniques, Voice activity detection, End of utterance detection etc.

    Experience with Training acoustic models and with KenLM, OpenLM and other tools to create Language models.
    Experience with "PyTorch" Deep Learning Frameworks and with leading efforts and/or teams developing Speech AI products
    Exposure to basic speech digital signal processing and feature extraction techniques like FFT, MFCC, Mel Spectrogram, etc.
    General background around version control and code review tools like Git, Gerrit, Gitlab.
    Ways to stand out from the crowd:

    Experience developing end-to-end and unified speech recognition and translation systems for multiple languages
    Strong C++ programming skills.
    Familiarity with GPU based technologies like CUDA, CuDNN and TensorRT
    Background with Dockers and Kubernetes
    Background with deploying machine learning models on data center, cloud, and embedded systems
    Strong collaborative and interpersonal skills, specifically a proven ability to effectively guide and influence within a dynamic matrix environment.

    With highly competitive salaries and a comprehensive benefits package, we have some of the most forward-thinking and hardworking people in the world working with us and our engineering teams are growing fast in some of the hottest pioneering fields: Deep Learning, Artificial Intelligence, and Large Language Models.

    The base salary range is $176,000 - $333,500. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

    You will also be eligible for equity and

    benefits

    .

    NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer.

    As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

    #J-18808-Ljbffr