Jobs
>
Santa Clara

    Senior Principal Software Engineer - Santa Clara, United States - Oracle

    Default job background
    Description
    Job Description

    Cloud Engineering Infrastructure Development

    Oracle Cloud Infrastructure (OCI) Cluster Networking team is building an ultra-high performance network required to support AI/ML/HPC workloads.

    This is your opportunity to join the AI revolution and designing systems which allow customers to scale from tens to thousands of GPU without compromising on performance.


    This team will be responsible for designing, developing and performance tuning the networking stack required to run distributed AI/ML/HPC workload across thousands of GPUs leveraging technologies like RoCE or Infiniband.

    This is your opportunity to build innovative solutions for our customers from the ground up. These are exciting times and our team is still young and growing fast, working on ambitious new initiatives. We are looking for adaptable, self-motivated engineers with ability to learn quickly.

    You should be both a rock solid developer and a distributed systems generalist, able to dive deep into any part of the stack and low-level systems, as well as design broad distributed system interactions.

    You should value simplicity and scale, work comfortably in a collaborative, agile environment, and be excited to learn.

    Career Level -Responsibilities


    Required Qualifications:
    10+ years of experience with software (systems/application) development

    3+ years of experience with RDMA over Infiniband network (including setup, troubleshooting, tuning and scaling).

    3+ years of experience with collective communications libraries like NCCL, RCCL, MPI and GPU frameworks like CUDA and ROCm.

    Proficient with data structures, algorithms, operating systems

    Excellent organizational, verbal, and written communication skills

    Bachelors in computer science and Engineering or related engineering fields


    Preferred Qualifications:
    Masters / PhD degree in Computer Science or related engineering fields

    Experience with distributed workload managers like Slurm or K8s

    Experience with ML training frameworks like PyTorch, TensorFlow

    Experience with Linux Performance tools

    Experience in SDN, NFV, Cloud Networking

    Experience in Infrastructure-as-a-Service, viz. OpenStack, AWS, GCP, Azure


    Disclaimer:


    Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.

    Range and benefit information provided in this posting are specific to the stated locations only

    US:

    Hiring Range:
    from $96,800 to $251,600 per annum. May be eligible for bonus, equity, and compensation deferral.


    Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.

    Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.

    Oracle US offers a comprehensive benefits package which includes the following:

    Medical, dental, and vision insurance, including expert medical opinion

    Short term disability and long term disability

    Life insurance and AD&D

    Supplemental life insurance (Employee/Spouse/Child)

    Health care and dependent care Flexible Spending Accounts

    Pre-tax commuter and parking benefits

    401(k) Savings and Investment Plan with company match

    Paid time off:
    Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits.

    For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment.

    Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.

    11 paid holidays

    Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.

    Paid parental leave

    Adoption assistance

    Employee Stock Purchase Plan

    Financial planning and group legal

    Voluntary benefits including auto, homeowner and pet insurance


    The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.

    About Us

    As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's problems. True innovation starts with diverse perspectives and various abilities and backgrounds.

    When everyone's voice is heard, we're inspired to go beyond what's been done before. It's why we're committed to expanding our inclusive workforce that promotes diverse insights and perspectives.


    We've partnered with industry-leaders in almost every sector—and continue to thrive after 40+ years of change by operating with integrity.

    Oracle careers open the door to global opportunities where work-life balance flourishes. We offer a highly competitive suite of employee benefits designed on the principles of parity and consistency. We put our people first with flexible medical, life insurance and retirement options. We also encourage employees to give back to their communities through our volunteer programs.

    We're committed to including people with disabilities at all stages of the employment process.

    If you require accessibility assistance or accommodation for a disability at any point, let us know by calling , option one.


    Disclaimer:
    Oracle is an Equal Employment Opportunity Employer*.

    All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law.

    Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.


    • Which includes being a United States Affirmative Action Employer


  • Johnson & Johnson Santa Clara, United States

    Johnson & Johnson MedTech is recruiting for a Manager Software Engineering, GUI located in Santa Clara, CA. This position is based in Santa Clara, CA and may require up to 10% travel. (NOT REMOTE) · Johnson & Johnson MedTech innovates at the intersection of biology and technology ...

  • AMD

    Software Engineer

    39 minutes ago


    AMD Santa Clara, United States Full time

    · WHAT YOU DO AT AMD CHANGES EVERYTHING · We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for th ...

  • Order, Inc.

    Software Engineer

    1 week ago


    Order, Inc. Santa Clara, United States

    ENGINEERING · Ordr, Inc. in Santa Clara, CA seeks Software Engineer: Meet w/ sales/mktg teams to gather new reqs from the customer. Review & analyze the reqs within the engineering team. Part-time tele-commuting allowed. $283,442/ yr. Email res (must reference Job Code #43010) to ...

  • Comrise

    Software Engineer

    6 days ago


    Comrise San Jose, United States

    Position: Software Engineer (Backend) · Location: San Jose, CA (Hybrid) · Duration: 6+ Month · Job Responsibilities: · Design new experiences for Client sellers and advertisers to promote their products, manage their advertising campaigns, and run their businesses · Build highly ...


  • Tech Firefly Santa Clara, United States

    We are offering excellent opportunities for Senior Software Engineers · Applicants are required to be eligible to lawfully work in the U.S. immediately; employer will not transfer or sponsor applicants for U.S. work authorization (such as an H-1B visa) for this opportunity. · Dir ...


  • NVIDIA Santa Clara, United States Full time

    We are the GPU Communications Libraries and Networking team at NVIDIA. We deliver communication libraries like NCCL, NVSHMEM, UCX for Deep Learning and HPC. DL and HPC applications have a huge compute demand already and run on scales which go up to tens of thousands of GPUs. The ...


  • Palo Alto Networks Santa Clara, United States

    Our Mission · At Palo Alto Networks everything starts and ends with our mission: · Being the cybersecurity partner of choice, protecting our digital way of life. · Our vision is a world where each day is safer and more secure than the one before. We are a company built on the fou ...

  • Stellar Consulting Solutions, LLC

    Software Engineer

    3 weeks ago


    Stellar Consulting Solutions, LLC San Jose, United States

    Role Overview: As an Open Source Software Expert, you will play a pivotal role in managing and contributing to the development of our open-source projects. You will evaluate and implement new OSS technologies, collaborate with internal teams to ensure successful implementation, p ...


  • Palo Alto Networks Santa Clara, United States

    Our Mission · At Palo Alto Networks everything starts and ends with our mission: · Being the cybersecurity partner of choice, protecting our digital way of life. · Our vision is a world where each day is safer and more secure than the one before. We are a company built on the fou ...


  • Terran Systems Santa Clara, United States

    Terran Systems is in search of a · Senior Software Engineer - JavaScript UI · Requirements: · Ability to work effectively in a team · A strong history of clean, robust, well-documented and creative Front End code · 4+ years experience with JavaScript, HTML5 & CSS3 or other J ...

  • Cisco

    Software Engineer

    10 hours ago


    Cisco San Jose, United States

    What You'll Do · Design, develop, and implement high quality SRE applications and tools for Cisco SDWAN management layer · Collaborate with team members to determine the root cause of problems and the best course of action to resolve problems and avoid them in the future · Maint ...


  • Palo Alto Networks Santa Clara, United States

    Our Mission · At Palo Alto Networks everything starts and ends with our mission: · Being the cybersecurity partner of choice, protecting our digital way of life. · Our vision is a world where each day is safer and more secure than the one before. We are a company built on the fou ...

  • Checkpoint Technologies, LLC

    Software Engineer

    3 weeks ago


    Checkpoint Technologies, LLC San Jose, United States

    Job Description · Job DescriptionCheckpoint Technologies, LLC, located in San Jose, CA is the world's leader in non-destructive optical probing for semiconductor failure analysis. Our tools combine advanced laser scanning (LSM) and photon emission (PEM) techniques with state of a ...


  • EVONA Santa Clara, United States

    Senior Embedded Software Engineer · Santa Clara, California · My client builds, and operates a diverse range of small satellite systems supporting space-based turnkey missions for several business applications, including earth observation, communications, in-orbit demonstration ...


  • Palo Alto Networks Santa Clara, United States Full time

    Job Description · Your Career · The Cortex Vulnerability Management Scanning team is expanding, and we're looking for a Senior Software Engineer to join our team. This team builds the software that provides our customers visibility into their behind-the-firewall attack surface, a ...


  • Petlibro Santa Clara, United States

    About Petlibro · Petlibro is a design thinking company creating products that nurture the intertwined lives of pets & their people. We launched with a philosophy that good design, in form & in function, can make a difference. Petlibro innovates with the latest technology to solve ...


  • Bruker Nano Surfaces & Metrology San Jose, United States

    Sr. Software Engineer · Overview · As one of the world's leading analytical instrumentation companies, Bruker covers a broad spectrum of advanced solutions in all fields of research and development. All our systems and instruments are designed to improve safety of products, to ac ...


  • NVIDIA Santa Clara, United States

    NVIDIA's invention of the GPU in 1999 fueled the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of compute ...

  • Palo Alto Networks

    Sr. Software Engineer

    3 weeks ago


    Palo Alto Networks Santa Clara, United States

    Job Description · Job DescriptionCompany Description · Our Mission · At Palo Alto Networks everything starts and ends with our mission: · Being the cybersecurity partner of choice, protecting our digital way of life. · Our vision is a world where each day is safer and more secure ...


  • The Argonaut US San Jose, United States

    The Argonaut US, a dynamic and expanding creative agency and digital solutions provider in Silicon Valley, specializes in retail strategy, program development, store planning, visual merchandising, and ongoing store operations for Fortune 500 clients. The Senior Software Engineer ...