Site Reliability Engineer, Enterprise Justice - Plano, United States - Tyler Technologies

    Default job background
    Description

    This Site Reliability Engineer position is a technical role within the Technical and Cloud Services group that helps ensure the reliability, scalability, and performance of our infrastructure while driving automation and efficiency in our development process.

    This engineer provides technical guidance to team members and other development teams related to the Cloud Services initiatives and helps to ensure our Cloud initiatives are consistently improving.

    This role requires a proactive problem-solving mindset, excellent communication skills, and the ability to collaborate effectively with our cross-functional teams that help courts and public safety organizations of all sizes better protect and serve the public.

    By helping provide solutions that improve efficiency and response time, you can help serve our citizens and make communities safer.


    SHIFT EXPECTATION:


    Second Shift: 11 AM - 8 PM CST (with one-hour break)ResponsibilitiesDesign, build, and maintain highly available, scalable, and secure infrastructure components to support our applications and services.

    Implement and maintain monitoring, alerting, and logging systems to proactively identify andresolve issues before they impact users.

    Collaborate with development teams to automate deployment pipelines, infrastructure provisioning, and configuration management using tools like Jenkins, Terraform, and Kubernetes.

    Lead incident response and post-mortem activities to identify root causes and implement preventive measures to minimize future incidents.
    Conduct regular performance tuning and optimization of system resources to ensure optimal efficiency and cost-effectiveness.
    Drive continuous improvement initiatives to streamline processes, enhance reliability, and increase productivity across the organization.
    Stay current with industry trends, best practices, and emerging technologies in SRE, DevOps, cloud computing, and automation.

    QualificationsBS/BA degree in Computer Science, Computer Engineering, Information Systems, or similar field5-7 years of experience in a Site Reliability and/or DevOps role, with a strong understanding ofboth disciplines.

    Proficiency in cloud computing platforms such as AWS, Azure, or GCP, including infrastructure as code (IaC) tools like Terraform.
    Hands-on experience with containerization and orchestration tools such as Docker and Kubernetes.
    Expertise in scripting and programming languages such as PowerShell, Python, Bash, or Go for automation and tooling.
    Strong understanding of database concepts and administration, including T-SQL scripting, indexing, and performance tuning.
    Solid understanding of networking concepts, security best practices, and system administration in Windows and Linux environments.
    Strong analytical and problem-solving skill, with the ability to troubleshoot complex issues anddrive resolution in a timely manner.
    Excellent communication and interpersonal skills, with the ability to collaborate effectively withcross-functional teams and stakeholders.
    Creative problem solving with demonstrated pattern of applying creative solutions to bridge thegap between product development and environmental considerations.


    Required:
    AWSMS SQL Server and/or PostgreSQLWindows Server OSLinux OSPowerShellIISPreferred: Knowledge of .NET Framework and Languages (C#, VB.NET)Experience with monitoring tools, such as DataDog, AWS CloudWatch, and SolarWindsDatabase Performance AnalyzerExperience with PagerDutyKnowledge of Agile Development, with experience with tools such as JIRAKnowledge of Web Development Practices and TechnologiesExperience with ticketing systems, such as Microsoft CRMOctopus DeployHangfireElasticsearchRabbitMQApacheAdvanced Knowledge of Microsoft Hosting Technologies#LI-SB1#LI-HYBRID#J-18808-Ljbffr