Jobs
>
Roseland

    Site Reliability Engineer, Observability - Roseland, United States - CoreWeave

    CoreWeave
    CoreWeave Roseland, United States

    3 weeks ago

    Default job background
    Description

    CoreWeave is a specialized cloud provider, delivering a massive scale of GPU compute resources on top of the industry's fastest and most flexible infrastructure. CoreWeave builds cloud solutions for compute intensive use cases - VFX and rendering, machine learning and AI, batch processing, and Pixel Streaming - that are up to 35 times faster and 80% less expensive than the large, generalized public clouds. Learn more at

    About the role:

    The Observability Team performs a critical role in enabling CoreWeave to understand, troubleshoot, and optimize complex systems by providing comprehensive insights into their behavior and performance. This team is responsible for the development, integration, and operation of observability platforms with the ultimate objective of enabling engineers across CoreWeave to do more, better. Central to the Observability Teams mission is the operation of our observability stack which leverages CoreWeave's deep investment in the Kubernetes ecosystem.

    We are seeking a Site Reliability Engineer with specialization in the observability stack who can help us execute on the mission of providing a comprehensive logging and metrics ecosystem. Integrating logging, metrics, tracing, and monitoring tools for proactive insights into system performance. This individual will work with a team of 6-8 engineers and have the opportunity to work on the full gamut of rewarding challenges that come with the business of building a cloud in a communicative, supportive, and high-performing environment. As a member of the Observability Team you will have the opportunity to:

    • Design and implement the platform that improves visibility into how the services are performing and operating.
    • Improve the performance, security, reliability, and scalability of our observability, and related services and participate in the teams on-call rotation.
    • Assist engineers in maximizing the observability stack to gain insights into the service's functionality and operation.
    • Develop dashboards, alerts, and insights into the customer experience using Grafana-ecosystem tools such as Mimir and Loki.
    • Develop meaningful insights by analyzing the gathered data.
    • Enable and evangelize the best practices around alerting. Collaborate with teams to establish observability standards.
    • Grow, change, invest in your teammates, be invested-in, share your ideas, listen to others, be curious, have fun, and, above all, be yourself.
    Wondering if you're a good fit? We believe in investing in our people, and value candidates who can bring their own diversified experiences to our teams - even if you aren't a 100% skill or experience match. Here are some qualities we've found compatible with our team. If a portion of this resonates with you, we'd love to talk.
    • You have one or more years of experience in a software or infrastructure engineering industry.
    • You enjoy helping your colleagues achieve more with less effort.
    • You have experience operating services in production and at scale and are versed in reliability engineering concepts such as the different types of testing, progressive deployments, error budgets, the role observability, and fault-tolerant design.
    • You're familiar with various logging and metrics systems like ELK, Victoria Metrics, Thanos or Grafana.
    • You enjoy understanding the data model for observability systems.
    • You're familiar with Kubernetes and have interest or experience with using it for event-driven and/or stateful orchestration.
    • You're comfortable with the idea of using Go as your primary programming language.
    • You know your way around a Linux distro, shell scripting, and/or the Linux storage and networking stacks.
    • You're excited about being part of a team of diverse perspectives and backgrounds that believe in tackling challenges, growing hand in hand, and winning together.
    Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $160,000/year in our lowest geographic market up to $185,000/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience.

    Hybrid Workplace

    Successful candidates will be expected to attend onboarding training at our NJ Headquarters within their first several weeks of employment, with subsequent quarterly travel requirements of 1 week duration.

    If you reside within a 30-mile radius of our New Jersey, New York, or Philadelphia offices, we're excited for you to join us at the office at least three times a week, recognizing the significance we place on fostering connections, collaboration, and creativity within our office culture. Our commitment to operating as a hybrid workplace underscores our dedication to enabling our employees to tailor their work-life balance to their individual preferences

    Why CoreWeave?

    At CoreWeave, we work hard, have fun, and move fast We're in an exciting stage of hyper-growth that you will not want to miss out on. We're not afraid of a little chaos, and we're constantly learning. Our team cares deeply about how we build our product and how we work together, which is represented through our core values:
    • Be Curious at your Core
    • Act like an Owner
    • Empower Employees
    • Deliver Best In-Class Client Experience
    • Achieve More Together
    We support and encourage an entrepreneurial outlook and independent thinking. We foster an environment that encourages collaboration and provides the opportunity to develop innovative solutions to complex problems. As we get set for take off, the growth opportunities within the organization are constantly expanding. You will be surrounded by some of the best talent in the industry, who will want to learn from you, too. Come join us

    Benefits

    We offer a competitive salary and benefits, including:
    • Medical, dental and vision insurance - 100% paid for the employee
    • Company paid Life Insurance
    • Voluntary supplemental life insurance
    • Short and long-term disability insurance
    • Flexible Spending Account
    • Tuition Reimbursement
    • Mental Wellness Benefits through Spring Health
    • Family-Forming support provided by Carrot
    • Paid Parental Leave
    • Flexible, full-service childcare support with Kinside
    • 401(k) with a generous employer match
    • Flexible PTO
    • Catered lunch each day in our offices
    • Weekly massages in NJ office
    • A casual work environment
    • Work culture focused on innovative disruption
    California Consumer Privacy Act - California applicants only

    CoreWeave is an equal opportunity employer, committed to our diversity and inclusiveness. We will consider all qualified applicants without regard to race, color, nationality, gender, gender identity or expression, sexual orientation, religion, disability or age.


  • CoreWeave Roseland, United States

    Job Description · Job Description · CoreWeave is a specialized cloud provider, delivering a massive scale of GPU compute resources on top of the industry's fastest and most flexible infrastructure. CoreWeave builds cloud solutions for compute intensive use cases — VFX and rende ...


  • Motion Recruitment New York, United States

    Required Skills & Experience · Bachelor's degree/University degree. · 10+ years of engineering of 3rd party software and services in the event management space. · Working knowledge with IBM Tivoli Netcool (NOI) and BigPanda. · Working knowledge of Event Management Rules developme ...


  • Warner Media New York, United States Full time

    Overview: · Warner Bros. Discovery, a premier global media and entertainment company, offers audiences the world's most differentiated and complete portfolio of content, brands, and franchises across television, film, streaming, and gaming. The new company combines WarnerMedia's ...


  • Wells Fargo Summit, United States

    About this role: · Wells Fargo is seeking a Lead Software Engineer to join our Technology organization and be part of our Chief Technology Office (CTO) team. Learn more about the career areas and business divisions at . · You will be a hands-on Cloud Observability expert and be ...


  • Johnson & Johnson Somerville, United States

    Job Description · As the Network Observability Engineer, you will play a crucial role in crafting, implementing, and maintaining network monitoring, observability and management tools to ensure the stability and performance of our network infrastructure. You will work closely wit ...


  • Elastic New York, United States

    Elastic is a free and open search company that powers enterprise search, observability, and security solutions built on one technology stack that can be deployed anywhere. From finding documents to monitoring infrastructure to hunting for threats, Elastic makes data usable in rea ...


  • Wells Fargo New York, United States

    About this role: · Wells Fargo is seeking a Lead Software Engineer to join our Technology organization and be part of our Chief Technology Office (CTO) team. Learn more about the career areas and business divisions at . · You will be a hands-on Cloud Observability expert and be ...


  • Roku New York, United States

    Teamwork makes the stream work. · Roku is changing how the world watches TV · Roku is the #1 TV streaming platform in the US, and we've set our sights on powering every television in the world. Roku pioneered streaming to the TV. Our mission is to be the TV streaming platform tha ...


  • Experis New York, United States

    Senior Site Reliability/Observability Engineer · Overview · Under the supervision of the Director of Infrastructure, the Senior DevOps Engineer will collaborate with infrastructure developers, architects, and vendors to maintain Site Reliability Engineering Practice. He/she will ...


  • State Street Jersey City, United States Regular

    Who we are looking for · The ideal candidate for this role (Observability - Logs & Metrics) is responsible for working closely with the Application and infrastructure teams to ensure that all aspects of the telemetry from applications, business events, appliances and infrastructu ...


  • Hanover Marriott Whippany, United States

    **Job Type**: · Full-time · **Description**: · Department: Maintenance · **Reports To**: Engineer Supervisor, Chief Engineer · **Supervises**: N/A · **Job Purpose**: To repair and maintain physical structure of hotel using hand tools and power tools. · **Responsibilities**: · - M ...


  • UniFirst Whippany, United States

    Our Team is Kind of a Big Deal · UniFirst is seeking a reliable and hardworking Maintenance Tech I to join our UniFirst community. As a Team Partner in the Maintenance Department, you will handle repairs, maintenance, installation and troubleshooting of industrial equipment, syst ...

  • the Tides Inn

    Guest Service Agent

    2 weeks ago


    the Tides Inn Irvington, United States

    To represent the resort to the guest throughout all stages of the guest's stay. To welcome the guest at check-in and expedite the processing of their account at departure. Upgrade room type when possible to maximize room revenue received by the resort. Provide any assistance to r ...


  • Actalent Parsippany, United States Full time

    Great Opportunity to support new drug development for a growing biopharmaceutical company · Animal Husbandry Technician is responsible for the care and well-being of animals. This position is responsible for various tasks related to animal husbandry, including health assessments, ...

  • corbion

    Facilities Technician

    3 weeks ago


    corbion Totowa, United States

    At Corbion, we exist to champion preservation in all its forms, preserving food and food production, health, and our planet. · **Currently recruiting for 3rd shift 10:00 PM - 6:30 AM.** Facility Technicians are responsible for the operational inspection and upkeep of all faciliti ...

  • Barclays

    Internal Auditor

    1 day ago


    Barclays Whippany, United States

    **Internal Auditor - Technology AVP** · **Whippany, New Jersey** · Barclays Services Corp. · **What will you be doing?** · Barclays Services Corp. seeks Internal Auditor-Technology AVP in Whippany, New Jersey (multiple positions available): · - Support the audit team in the scopi ...

  • Tiffany & Co.

    Analyst - Waving

    1 week ago


    Tiffany & Co. Whippany, United States

    **Position Overview**: · The Waving Analyst analyzes current demand to develop a staff deployment strategy that yields optimal use of budgeted hours and human resources to meet the needs of the business using Engineered Labor Standards. This position helps creates and maintains t ...


  • Quest Diagnostics Clifton, United States

    Overview: · The data analyst with collaborate with corporate medical and corporate compliance in assigning standard codes (order and result/LOINC codes) to all BU databases, in order to establish and maintain corporate wide code mapping system. · **Responsibilities**: · The data ...

  • Brown & Root

    Railcar Switchman

    1 week ago


    Brown & Root Morris, United States

    JOB REQUIREMENTS · - Work within precise limits or standards of accuracy. · - Plan work and select proper tools. · - Compare and see differences in the size, shape and form of lines, figures, and objects. · - Visualize objects in three dimensions from plans and drawings. · - Make ...


  • DYNAMIC MANUFACTURING INC Hillside, United States

    Dynamic Manufacturing has** **2** **Lead CNC Set-Up Technician/Operator openings** (2nd and 3rd shift) for someone hands-on with an experienced level manufacturing technical skill to work in day to day production at one of Dynamic's remanufacturing facilities. These positions are ...