Jobs
>
Austin

    Service Reliability Engineer - Austin, United States - EOS IT Solutions

    Default job background
    Description

    WHO WE ARE:


    EOS IT Solutions is a Global Technology and Logistics company, providing Collaboration and Business IT Support services to some of the world's largest industry leaders, delivering forward-thinking solutions based on multi-domain architecture.

    Customer satisfaction and commitment to superior quality of service are our top business priorities, along with investing in and supporting our partners and employees.


    We are a true International IT provider and are proud to deliver our services through global simplicity with trusted transparency.


    POSITION OVERVIEW:


    EOS IT Solutions is seeking a technically proficient Service Reliability Engineer to join our managed services infrastructure engineering team, supporting advanced collaboration technologies in a fast paced and industry leading environment.

    The ideal candidate is a highly motivated technical enthusiast with a strong foundation in IT, networking, and collaboration technologies, and a passion for continuous learning.

    WHAT YOU'


    LL DO:
    Troubleshoot and resolve technical issues related to collaboration technologies, networking, and infrastructure, utilizing advanced diagnostic tools and techniques

    Perform routine maintenance, upgrades, and patching on collaboration systems, network infrastructure, and associated hardware and software components

    Contribute to the development and implementation of automation solutions using scripting languages like Bash, Python, and industry-standard frameworks to streamline infrastructure management tasks

    Monitor system performance, perform capacity planning, and optimize infrastructure for maximum efficiency and reliability

    Ensure the security and compliance of collaboration systems by implementing and maintaining industry best practices and standards

    Work closely with the Service Delivery Manager and other team members to provide timely technical support to clients, ensuring high-quality service and adherence to SLAs

    Participate in cross-functional projects and collaborate with other teams, such as network, security, and cloud teams, to ensure seamless integration of collaboration solutions

    Maintain up-to-date documentation of technical processes, systems, configurations, and network topologies

    Efficiently handle live production incidents, debug/troubleshoot application and infrastructure issues, follow and implement SRE best practices

    Provide On-Call support as an escalation point for VC infrastructure and network troubleshooting

    Build end-to-end monitoring infrastructure (Logging, Metrics, Tracing) and work closely with the other Production Engineers to provide the right tooling to measure the reliability of our systems

    Collaborate with development and operations teams to ensure availability and reliability of the application and infrastructure

    Work closely with software engineers and QAs to ensure the system is responding properly to non-functional requirements such as performance, security, and availability

    Perform testing and quality assurance around software and hardware used in our environment


    WHAT YOU NEED TO SUCCEED:
    At least 3 years of prior demonstrated experience in a Site Reliability Engineering, DevOps, or an Infrastructure-focused role.

    Linux expertise

    Support of internet-facing production services and distributed systems via deployments, onCall and Incident Management.

    Proficiency in implementing and coordinating telemetry using monitoring and observability tools like Splunk, Grafana, and Prometheus, or similar.

    Experience in solving and resolving issues in VMware from both an operating system and application perspective.

    Understanding of ITIL processes, service management principles, and IT service delivery best practices

    Building and operating container orchestrating systems like VMware.

    Designing, building and maintaining infrastructure with a cloud provider such as AWS.

    Automation advocate - prior history of removing operational toil via software.

    Self motivated, inquisitive and always looking to learn more.

    Familiarity with scripting languages like Bash, Python, or similar, and experience with REST APIs

    Experience with systems automation tools like Chef, Ansible, Terraform, or similar.


    ADDITIONAL REQUIREMENTS:
    Strong foundational knowledge in networking protocols, infrastructure, and troubleshooting techniques, including TCP/IP, DNS, DHCP, VLANs, and routing protocols

    Disaster recovery and capacity planning.

    Strong communication and interpersonal skills, with the ability to work effectively in a team-oriented environment

    Self-motivated and eager to learn new technologies, tools, and methodologies


    EOS is committed to creating a diverse and inclusive work environment and is proud to be an equal opportunity employer.

    We invite you to consider opportunities at EOS regardless of your gender; gender identity; gender reassignment; age; religious or similar philosophical belief; race; national origin; political opinion; sexual orientation; disability; marital or civil partnership status or other non-merit factor.

    #J-18808-Ljbffr


  • Advanced Micro Devices , Inc. Austin, United States

    Overview: · WHAT YOU DO AT AMD CHANGES EVERYTHING · We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences the building bloc ...

  • Apple

    Reliability Engineer

    2 weeks ago


    Apple Austin, United States

    Reliability Engineer (Mac SoC Package Integration) · Austin,Texas,United States · Hardware · Do you ever wonder what goes into making Apple products an amazing user experience? Apples innovative reliability team is responsible for insuring that our products exceed our customer ...


  • Apple Austin, United States

    Reliability Engineer (Mac SoC Package Integration) · Austin,Texas,United States · Hardware · Do you ever wonder what goes into making Apple products an amazing user experience? Apples innovative reliability team is responsible for insuring that our products exceed our customer ...


  • Pinnacle Group, Inc. Austin, United States

    Responsibilities · We are looking for an operations engineer to join the Crypto Services SRE team. The Crypto Services SRE team is responsible for systems and services that support a vast number of both Apple's internal services as well as services that users directly use. As an ...


  • APEX Austin, United States

    Job Description* JD: · Skilled at writing clean, high-performant and unit-testable code in Java. · Proficiency with the architecture, deployment, performance tuning, and troubleshooting large scale distributed systems on AWS · Understanding of SRE principles including monitoring ...


  • Flextronics Austin, United States

    Job Posting Start Date Job Posting End Date Flex is the diversified manufacturing partner of choice that helps market-leading brands design, build and deliver innovative products that improve the world.We believe in the power of diversity and inclusion and cultivate a workplace c ...


  • FLEX Inc Austin, United States

    To support our extraordinary teams who build great products and contribute to our growth, were looking to add a Equipment Reliability Engineer located in Austin, TX. Reporting to the Engineering Manager, as part of the site engineering team, the Eq Reliability Engineer, Liability ...


  • Zenoss Austin, United States

    Job Description · Job DescriptionZenoss is seeking an experienced Site Reliability Engineer (SRE) to join a team of Engineers and Architects creating breakthrough ITOps and AIOps Platform. We are seeking individuals with experience and knowledge in building software and tools to ...


  • Pinnacle Group Austin, United States

    Responsibilities · We are looking for an operations engineer to join the Crypto Services SRE team. The Crypto Services SRE team is responsible for systems and services that support a vast number of both Apples internal services as well as services that users directly use. As an ...


  • FLEX Inc Austin, United States

    Job Posting Start Date Job Posting End Date · Flex is the diversified manufacturing partner of choice that helps market-leading brands design, build and deliver innovative products that improve the world. · We believe in the power of diversity and inclusion and cultivate a work ...


  • Apple Inc. Austin, United States

    Imagine what you could do here. At Apple, great ideas have a way of becoming great products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish Join the Apple Service Engineering team as a Sit ...


  • Ibm Careers Austin, United States

    POSITION AVAILABLESite Reliability Engineer, IBM Corporation, Austin, TX: Monitor the health of production and test systems. Respond to production and test system issues and alerts. Execute changes in the production and test environments manually or through automation. Assist the ...


  • Apple Austin, United States

    Site Reliability Engineer - Ad Platforms · Austin,Texas,United States · Software and Services · At Apple, we work every day to build products that enrich people's lives. Our Advertising Platforms group makes it possible for people around the world to easily access informative and ...


  • Virtu Financial Austin, United States

    Virtu is a leading financial firm that leverages cutting edge technology to deliver liquidity to the global markets and innovative, transparent trading solutions to our clients. As a market maker, Virtu provides deep liquidity that helps to create more efficient markets around th ...


  • Virtu Financial Austin, United States

    Virtu is a leading financial firm that leverages cutting edge technology to deliver liquidity to the global markets and innovative, transparent trading solutions to our clients. As a market maker, Virtu provides deep liquidity that helps to create more efficient markets around th ...


  • Strive Works Austin, United States

    The Role · As a Site Reliability Engineer at Striveworks, you'll have the opportunity to receive mentorship from experienced engineers, as well as bring your own experiences and training. Your ideas are welcome and invited on day one; there is no smartest person at Striveworks. Y ...

  • Apple

    Reliability Engineer

    2 weeks ago


    Apple Austin, United States

    Summary · Posted: Apr 9, 2024 · Weekly Hours: 40 · Role Number: · Do you ever wonder what goes into making Apple products an amazing user experience? Apple's innovative reliability team is responsible for insuring that our products exceed our customer's expectations for robu ...


  • Procore Technologies Austin, United States Full time

    Job Description · What if you could use your technology skills to develop a product that impacts the way communities' hospitals, homes, sports stadiums, and schools across the world are built? Construction impacts the lives of nearly everyone in the world yet it's also one of the ...


  • Teacher Retirement System of Texas Austin, United States

    JOB AD: · Location: · 1000 Red River · Austin, Texas, 78701 · United States · Platform Reliability Engineer OR Platform Reliability Eng Senior -Hybrid · Requisition ID: · req1037 · Employment Type: · Unclassified Regular Part-Time (URP) · Division: · CORE Systems & Ide ...


  • Procore Technologies Austin, United States

    Lead projects within a small team of Reliability Engineers to continually improve the reliability of Procores services through engineering and process improvement. Collaborate with your peers to envision, design, and develop solutions in your respec Reliability Engineer, Liabilit ...