Infrastructure Lead - Southlake, United States - KTek Resourcing

    KTek Resourcing
    KTek Resourcing Southlake, United States

    Found in: Appcast US C2 - 1 week ago

    Default job background
    Accounting / Finance
    Description

    Job Description:

    Who are we looking for?

    Infrastructure Lead who has 8+ years of experience in supporting infrastructure of large-scale distributed systems hosted on on-premise servers and cloud.

    Primary Responsibilities:

    Effectively handle the infrastructure outages & Performance Issues with quick analysis and resolution

    Extensive hands-on experience in troubleshooting Infrastructure failures (OS, N/W, Storage, File Systems, LDAP, Active Directory and SAML etc.) that can impact the application availability and performance.

    Manage incidents and effectively communicate with users, application owners and senior stakeholders across all areas.

    Expertise in implementation of common infrastructure activities (i.e., Patch Mgt., Migrations, Upgrades, Assessments, Certification/password renewals etc.).

    Coordinate application resiliency exercises within required recovery time objective, perform functional and non-functional validations during and post DR exercises.

    Actively participate in Change management process with view to manage risk in production environment.

    Minimize manual involvement by driving solutions, automation and implementing continuous improvements that creates an operating environment, including development & configuration for dynamic monitoring, ing & recovery.

    Identify and/or analyze patterns of incidents/problem, conduct flawless post-mortems, develop permanent remediation plans, implement automation to prevent future incidents from re-occurring again.

    Build and improve the SOPs for all the maintenance activities.

    Challenge existing infrastructure setup, processing and suggest different ways to solve problems or improve stability.

    Technical Skills:

    Expertise Linux Server Administration -Primary / Windows Server Administration-Secondary

    Extensive hands-on experience in troubleshooting Infrastructure failures (OS, N/W, Storage, File Systems, LDAP, Active Directory and SAML etc.) that can impact the application availability and performance.

    Should have deep expertise in implementation of common infrastructure activities (i.e., Patch Mgt., Migrations, Upgrades, Assessments, Certification/password renewals etc.).

    Programming: Shell/Python.

    Troubleshooting of Web Application/Service and Database failures from infra prospective

    Observability stack – AppDynamics, Splunk, 1000Eyes, ITRS or similar Tools.

    Experience with common networking protocols and services including TCP, UDP, DNS, DHCP, HTTP, SSH, FTP, SNMP, and LDAP

    Working knowledge with Remedy, JIRA etc.

    Good To Have:

    Fair Understanding of CI/CD and DevOps Tools

    Expertise in Google Cloud Platform.

    Configuration Management: SaltStack, Ansible, Puppet, Terraform etc.

    Process Skills:

    Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive

    Interest in designing, analysing and troubleshooting large-scale distributed systems.

    Behavioral Skills :

    Participates as a team member and fosters teamwork by inter-group coordination within the modules of the project.

    Proven ability in developing relationships with stakeholders, understanding detailed business requirements across multiple project initiatives

    Ability to steer technology direction and provide appropriate training to the team

    Good attitude, learning skills.

    Qualification:

    8+ years of work experience in Infra & environment support.

    Education qualification: B.Tech, BE, BCA, MCA, M. Tech or equivalent technical degree from a reputed college