Jobs
>
Baltimore

    Staff Site Reliability Engineer - Baltimore, United States - Platform Science

    Default job background
    Description
    San Diego, CA or Remote + 5% travel to San Diego headquarters for business reasons
    At Platform Science, we're working to connect everything that moves.

    Founded in 2015, we are an open IoT platform that partners with innovative fleets, application developers, vehicle manufacturers, and equipment providers in the transportation industry to deliver revolutionary solutions to supply chain professionals across the globe.

    Our employees are an engaging, diverse group of people who believe in the power of great ideas. We hire people with different experiences and perspectives to build a company culture that fuels growth through innovation.
    We value thoughtful actions and empathy for others.

    We approach challenges with resiliency and creativity, while encouraging transparency because, no matter our backgrounds or responsibilities, we are one team.

    About the Role
    We are looking for a qualified Staff SRE to join our team in San Diego, CA (or remote).

    You will be hired to solve operational problems and provide support to development teams for critical business applications in production.

    Our focus is to ensure reliability in all production services and enable dev teams to be able to measure their reliability to effectively make decisions.

    The SRE team has the unique opportunity to work with all aspects of our platform. We run entirely in the cloud—AWS, Azure and GCP. Our applications and services are containerized and serverless. If you're excited about learning and supporting new technologies and many different types of products (including mobile apps, hardware, websites, messaging queues, serverless pipelines, and more), and working with an incredibly talented team, then this is the position for you
    As a Staff SRE, you have a software development background or systems background with strong coding skills.

    Ideal candidates want to deeply understand how our systems work from the infrastructure level, their dependencies to other systems, to the customer experience, and how to mitigate risk.

    You are comfortable with giving and taking technical direction. You are a great communicator and self-starter who strives to make the company and our technologies better.
    Lead the development and enhancement of Continuous Integration/Continuous Deployment (CI/CD) pipelines, along with refining release management processes and associated toolsets
    Architect and maintain Helm charts to streamline application deployment and management
    Establish standardized observability solutions to empower development teams in efficiently managing their applications
    Lead the effort in promoting and prioritizing reliability, driving achievement of uptime goals and mentoring colleagues in SRE best practices
    Conduct comprehensive Production Readiness Reviews, working with teams to identify and establish Service Level Objectives (SLOs), and ensure high-quality and dependable services
    Design and develop software solutions to address operational challenges effectively to improve system stability and reliability
    Fulfill on-call duties, providing expert support to development teams for mission-critical applications in production environments
    Improve the resiliency of applications and systems using chaos engineering
    Experience
    Possess 9+ years of hands-on experience in SRE or Platform Engineering roles
    Demonstrated expertise (4+ years) with automation technologies like Jenkins, ArgoCD, or similar
    Extensive (3+ years) experience with Kubernetes, Helm, and Docker within production environments
    Proficiency with current software development lifecycle (SDLC) concepts and best practices, CI/CD pipelines, and test-driven development
    Experience with AWS, encompassing proficiency in EKS, IAM, autoscaling, networking, and load balancing/request routing in a production environment
    Proficient in Python, Bash, Nodejs, and/or Go
    Proficient with distributed tracing methodologies and observability tools such as Prometheus, ELK, or Datadog
    Strong emphasis on documentation and fostering knowledge-sharing practices within the team and organization
    Track record of successfully training and mentoring engineers
    Proven expertise in optimizing performance and managing costs within cloud environments
    Sound understanding of SLI/SLO concepts and adherence to SRE best practices
    Platform Science Benefits Highlights
    The company offers various benefits to regular, full-time employees including:
    Medical, dental, and vision insurance
    Short-term and long-term disability insurances
    AD&D and life insurance
    401k plan
    Paid vacation, sick leave and holidays
    Six weeks of paid parental leave
    For more information please see the Benefits Highlights

    brochure for regular, full-time employees.
    This is an exempt role. Our job titles for each posting may span across more than one job level. The estimated base salary for this role is between $145,292 and $227,680.

    The range displayed on each job posting reflects the minimum and maximum target range for new hire base salaries across all US locations.

    Compensation packages are based on many factors unique to each candidate, including but not limited to skill set, work experience, relevant trainings and certifications, business needs, market demands and specific geographical location.

    The base pay range is subject to change and may be modified in the future. This role may also be eligible for bonus, equity, and benefits.


    Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits.

    Platform Science collects your personal information to support its business operations, including for human resources, employment, benefits administration, health and safety, and other business-related purposes as well as to be in legal compliance.

    You can review further details of such collection and use in our Privacy Policy.

    At this time we only consider candidates in these states: AL, AR, AZ, CA, CO, FL, GA, ID, IL, KY, MA, MD, MI, MN, MO, NC, NH, NV, NY, OH, OK, OR, PA, SC, TN, TX, UT, VA, WA, and WI.

    In the future we plan to add more states.
    Join Our Monthly Newsletter

    Stay up-to-date on the latest industry trends, partner and product news, events, and more.

    Receive more information from Platform Science and agree to our Privacy Notice .
    Accreditations

    #J-18808-Ljbffr

  • W. R. Grace

    Reliability Engineer

    21 hours ago


    W. R. Grace Baltimore, United States

    Reliability Engineer · Apply now » · Date:Jun 3, 2024 · Location: Baltimore, MD, US, 21226 · Company: W. R. Grace & Co. · Requisition ID: 22915 · Built on talent, technology, and trust, Grace is a leading global supplier of catalysts and engineered materials. The company's ...


  • The Sotland Group Baltimore, United States

    The role of the Corporate Senior Reliability Engineer (Sr. RE) is to identify and mitigate risks that could adversely affect plant or business operations. The Sr. RE provides leadership and technical expertise in support of asset management strategies and associated practices, to ...


  • The Sotland Group Inc Baltimore, United States

    The role of the Corporate Senior Reliability Engineer (Sr. RE) is to identify and mitigate risks that could adversely affect plant or business operations. The Sr. RE provides leadership and technical expertise in support of asset management strategies and associated practices, to ...


  • Archesys Inc Baltimore, United States

    Job Description · Job Description · Archesys is a technology firm specializing in innovative cloud solutions and services for clients across various industries. We pride ourselves on our cutting-edge technologies, exceptional customer service, and collaborative work environment ...


  • SITEC Consulting LLC Baltimore, United States

    About SITEC · SITEC is an employee and customer focused Information Technology and Professional Services Firm specializing in design, development, and delivery of state-of-the-art technology solutions, as well as cybersecurity, software and systems engineering services. · Summa ...

  • W. R. Grace

    Reliability Engineer

    11 hours ago


    W. R. Grace Curtis Bay, United States

    Requisition ID: 22915 · Built on talent, technology, and trust, Grace is a leading global supplier of catalysts and engineered materials. The company's two industry-leading business segments-Catalysts Technologies and Materials Technologies-provide innovative products, technolog ...


  • Booz Allen Hamilton Baltimore, United States

    Engineering to make a system more resilient and efficient frees up time and money to build more capabilities. Whether you come from a background in network engineering, systems administration, or · sof tware development—if you have a passion for making systems better, we need yo ...


  • Archesys Inc Baltimore, United States

    Job Description · Job DescriptionArchesys is a technology firm specializing in innovative cloud solutions and services for clients across various industries. We pride ourselves on our cutting-edge technologies, exceptional customer service, and collaborative work environment. We ...


  • Akina Baltimore, United States

    TS/SCI w/Polygraph required · Approved for 60% telework · 06-10-SRE · Description: · DevOps refers to a software development concept that unites and brings together developers and IT staff. The DevOps approach involves consistent, small edits to software coding. This means fr ...


  • Enterprize Software LLC Baltimore, United States

    We are seeking a dedicated Site Reliability Engineer for our Operations & Sustainment Team. Our ideal candidate would be someone with a comprehensive knowledge of IT environments, software applications, and technical frameworks. They should have a proven record of providing Tier ...


  • American Sugar Refining Baltimore, United States

    ASR Group is the world's largest refiner and marketer of cane sugar, with an annual production capacity of more than 6 million tons of sugar. The company produces a full line of grocery, industrial, food service and specialty sweetener products. Across North America, ASR Group ow ...

  • W. R. Grace

    Reliability Engineer

    3 weeks ago


    W. R. Grace Baltimore, United States

    Requisition ID: 22915 · Built on talent, technology, and trust, Grace is a leading global supplier of catalysts and engineered materials. The company's two industry-leading business segments-Catalysts Technologies and Materials Technologies-provide innovative products, technolog ...


  • Clarity Innovations Baltimore, United States

    Clarity Innovations connects human creativity with emerging technology to design, develop, and deploy software that enhances mission success. Our focus is redefining the Government's relationship with technology by encouraging the use of DevSecOps and Agile methodologies, small-t ...


  • Florida Crystals / ASR Group Baltimore, United States

    ASR Group is the world's largest refiner and marketer of cane sugar, with an annual production capacity of more than 6 million tons of sugar. The company produces a full line of grocery, industrial, food service and specialty sweetener products. Across North America, ASR Group ow ...


  • Salesforce Baltimore, United States

    Inc's Candidate Privacy Notice contains more details about the handling and use of the personal data of job applicants. · For more information about our website privacy practices, please see our Privacy Statement. · DevOps/Site Reliability Engineer (SRE) with TS/SCI (on site Nort ...


  • Florida Crystals / ASR Group Baltimore, United States

    ASR Group is the world's largest refiner and marketer of cane sugar, with an annual production capacity of more than 6 million tons of sugar. The company produces a full line of grocery, industrial, food service and specialty sweetener products. Across North America, ASR Group ow ...


  • 00100 LEIDOS, INC. Baltimore, United States Full time

    Leidos is seeking a Senior Kafka Site Reliability Engineer to be part of the mission solution and help lead SSA's Digital Modernization Strategy. Join one of our high performing teams responsible for building the next-generation enterprise APIs and modern applications using dat ...


  • The Sotland Group Inc Baltimore, United States

    The role of the Corporate Senior Reliability Engineer (Sr. RE) is to identify and mitigate risks that could adversely affect plant or business operations. The Sr. RE provides leadership and technical expertise in support of asset management strategies and associated practices, to ...


  • Stanley Black & Decker Towson, United States

    Reliability Engineer Towson MD Hybrid · Hybrid: On-site 3 days a week in Towson MD · This is a long-term temporary position - up to 24 months. · Come build something that matters. · It takes great people to achieve greatness . People with a sense of purpose and integrity. P ...


  • Archesys Inc Baltimore, United States

    Archesys is a technology firm specializing in innovative cloud solutions and services for clients across various industries. We pride ourselves on our cutting-edge technologies, exceptional customer service, and collaborative work environment. We seek a skilled Site Reliability E ...