Staff Data Scientist - Santa Clara, United States - Cloudera

    Default job background
    Full time
    Description

    Business Area:

    Engineering

    Seniority Level:

    Mid-Senior level

    Job Description:

    At Cloudera, we empower people to transform complex data into clear and actionable insights. With as much data under management as the hyperscalers, we're the preferred data partner for the top companies in almost every industry. Powered by the relentless innovation of the open source community, Cloudera advances digital transformation for the world's largest enterprises.

    Cloudera is looking for a Staff Data Scientist to join the machine learning platform team and help drive development of Cloudera's next-generation AI and machine learning platform. You will be responsible for helping design, build, and deliver a platform that not only accelerates machine learning & AI from exploration to production but also enables enterprises to create & deploy Generative AI applications using foundation models with enterprise data at scale. This role requires an empathetic mindset and close collaboration with software engineers, designers, and product management.

    We look for "The Startup Spark", a desire to create new things, dive in wherever there's a need, eagerness to make an impact as an individual and the willingness to learn new things. You must be self-motivated, innovative, and proactive. The role offers significant opportunities for growth.

    As a Staff Data Scientist you will:

    • Work closely with enterprise adopters of Generative AI on Cloudera Machine Learning to help build the leading platform for machine learning & AI in the enterprise.
    • Design, code, and implement elegant, scalable, enterprise-quality AI application services powered by machine learning models.
    • Build strong relationships and collaborate with platform and UI engineers, quality engineers, UX designers, as well as, Product Management, Field Engineering, Professional Services, and other external partners.

    We're excited about you if you have:

    • 7+ years of experience with building machine learning models and adapting foundation models that power AI applications using data science and machine learning tools (Python, Tensorflow, Spark, mlflow, R, etc.).
    • Experience with foundation models, prompt engineering, fine-tuning, semantic search and Retrieval-Augmented Generation (RAG) using vector databases such as Pinecone, Milvus, etc.
    • Experience with Generative AI frameworks (LangChain, Guidance, NeMo etc.).
    • Experience building and deploying Generative AI applications.
    • Experience building scalable, robust and secure Enterprise applications.
    • Self-driven and motivated, with a strong sense of ownership and craftsmanship.
    • Strong written and verbal communication skills.

    You may also have:

    • Experience with AI/ML orchestration & serving software such as Kubeflow, KServe, Knative, vLLM, Nvidia Triton Inference Server, etc.
    • Experience with cloud technologies (AWS, GCE, Azure).
    • Experience using Big Data technologies like Spark, Hive etc.

    What you can expect from us:

    • Generous PTO Policy
    • Support work life balance with Unplugged Days
    • Flexible WFH Policy
    • Mental & Physical Wellness programs
    • Phone and Internet Reimbursement program
    • Access to Continued Career Development
    • Comprehensive Benefits and Competitive Packages
    • Paid Volunteer Time
    • Employee Resource Groups

    Cloudera is an Equal Opportunity / Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, pregnancy, sexual orientation, gender identity, national origin, age, protected veteran status, or disability status.

    #LI-RB1

    #LI-Remote