Jobs
>
Rockville

    Senior Site Reliability Engineer - Rockville, United States - Donnelley Financial, LLC

    Default job background
    Description
    :

    Donnelley Financial Solutions (DFIN) is a leader in risk and compliance solutions, providing insightful technology, industry expertise and data insights to clients across the globe. We're here to help you make smarter decisions with insightful technology, industry expertise and data insights at every stage of your business and investment lifecycles. As markets fluctuate, regulations evolve and technology advances, we're there. And through it all, we deliver confidence with the right solutions in moments that matter.

    Summary:


    We are looking for technical team members at all levels who want to push themselves to deliver best in market SaaS solutions. We offer a challenging environment where you will have to grow, adapt and use your skills consistently. Our customers rely on us in the moments that matter. Engineering delivers on that promise.

    The Senior Site Reliability Engineer is responsible for ensuring our SaaS products are fast, stable and optimized for our customers. SRE's at DFIN take on availability, performance, managing change, monitoring, response and are guardians of non-functional requirements.

    You either have an infrastructure background with a programmatic, automated mindset or are someone that comes with a software engineering background with infrastructure experience. The SRE goal is to build automated systems that reduce or eliminate manual work to keep our products up and running and performing optimally. We are looking for someone who thrives on collaboration within the team and across other groups and can operate independently to deliver solutions.

    Responsibilities:

    • Champion and implement a culture of SRE to maintain a high-quality platform infrastructure
    • Champion and implement application and infrastructure monitoring and alerting to prevent client impacting issues by ensuring system availability, performance and scalability to maintain SLOs and SLAs
    • Optimize application performance at scale
    • Automate everything including system operational runbooks
    • Define and support continuous integration and deployment pipelines (CI/CD) aligned to branching and quality assurance strategies
    • Dive deep into technology and stay on the forefront of the latest tools, technologies, and strategies; help evaluate, prototype, and integrate them into work processes
    • Perform with broad independence and deliver on project milestones and tasks on schedule while communicating progress regularly
    • Build strong relationships with SRE team members and software engineering teams to hold each other accountable for quality expectations
    • Learn continuously and apply lessons learned
    • Evangelize best practices, eliminate bottlenecks, and improve process

    Qualifications:

    • 5+ years experience writing software in any modern software language such as C# .NET, Java
    • 5+ years experience creating automated deployments with tools such as Azure DevOps Pipelines, Ansible, Jenkins or other scripting languages to manage infrastructure, software build and deployment in a continuous integration (CI) / continuous delivery (CD) environment
    • 5+ years experience writing scripts in PowerShell or Python/Bash to automate system operations as runbooks for Windows and Linux environments.
    • 5+ years experience implementing production performance, availability, and scalability monitoring and alerting best practices using a tool such as New Relic, Dynatrace, DataDog or AppDynamics
    • 5+ years experience as a global admin of Azure including cloud cost management
    • 5+ years of experience supporting public client facing revenue generating systems
    • Strong DevOps focus and experience building and deploying Infrastructure as Code with Terraform or similar technology
    • Experiencing monitoring and preventing issues with databases and database queries (SQL, Cosmos) using tools like Solarwinds Database Performance Analyzer, Idera SQL Diagnostic Manager, or Redgate SQL Monitor
    • Experience planning, coordinating, developing and executing all stages of test scripts
    • Experience securing Windows or Linux systems in 24x7 production environment
    • Experience with containerization and managing Kubernetes clusters
    • Experience with common networking, firewall and load balancing protocols
    • BS in Computer Science or equivalent work experience.

    It is the policy of Donnelley Financial Solutions to select, place and manage all its employees without discrimination based on race, color, national origin, gender, age, religion, actual or perceived disability, veteran's status, actual or perceived sexual orientation, genetic information or any other protected status.

    If you are a qualified individual with a disability or a disabled veteran, you have the right to request a reasonable accommodation if you are unable or limited in your ability to use or access as a result of your disability. You can request a reasonable accommodation by sending an email to



  • JLL Spring, United States Full time

    JLL supports the Whole You, personally and professionally. · Our people at JLL are shaping the future of real estate for a better world by combining world class services, advisory and technology to our clients. We are committed to hiring the best, most talented people in our ind ...


  • Donnelley Financial, LLC Rockville, United States

    : · Donnelley Financial Solutions (DFIN) is a leader in risk and compliance solutions, providing insightful technology, industry expertise and data insights to clients across the globe. We're here to help you make smarter decisions with insightful technology, industry expertise ...


  • Donnelley Financial, LLC Rockville, United States

    : · Donnelley Financial Solutions (DFIN) is a leader in risk and compliance solutions, providing insightful technology, industry expertise and data insights to clients across the globe. We're here to help you make smarter decisions with insightful technology, industry expertise a ...


  • Donnelley Financial, LLC Rockville, United States

    : · Donnelley Financial Solutions (DFIN) is a leader in risk and compliance solutions, providing insightful technology, industry expertise and data insights to clients across the globe. We're here to help you make smarter decisions with insightful technology, industry expertise ...


  • The Squires Group Reston, United States

    Our client has an immediate need for a CLEARED Site Reliability Engineer in their Reston, VA location. In this role, you will be responsible for envisioning, designing, coding, validating, and deploying NEW features from the ground up · This direct hire role offers competitive p ...


  • MetroStar Washington, United States

    As a Site Reliability Engineering (SRE) Lead, you'll deliver mission-critical services that empower end users. As the ideal candidate, you'll use your extensive experience designing and implementing end-to-end continuous delivery pipelines and experience in AI/ML. You will also u ...


  • Booz Allen Hamilton Chantilly, United States Full time

    Site Reliability EngineerThe Opportunity: · Do you love finding ways to make systems more efficient? Do you find it impossible to simply maintain when you could improve? Engineering to make a system more resilient and efficient frees up time and money to build more capabilities. ...


  • Booz Allen Hamilton Annapolis Junction, United States Full time

    Site Reliability EngineerThe Opportunity: · Engineering to make a system more resilient and efficient frees up time and money to build more capabilities. Whether you come from a background in network engineering, systems administration, or software development—if you have a pass ...


  • GEICO Chevy Chase, United States Full time

    Our Senior Site Reliability Engineer Manager is an engineering leader who works with the engineering staff to innovate and build new engineering solutions, improve and enhance existing solutions as well as leverage engineering solutions to solve critical operational problems. A S ...


  • Humana Bethesda, United States

    Humana · Senior Site Reliability Engineer · Austin , · Texas · Apply Now · Become a part of our caring community and help us put health first · The Senior Site Reliability Engineer maintains, integrates and analyzes software applications within the organization. The Senior S ...


  • Geico - Government Employees Insurance Company Chevy Chase, United States

    ?Drive the overall strategy for the organization, aligning it with the organization's business goals and objectives ?Provide thought leadership in the organization, staying ahead of industry trends and emerging technologies to enhance the reliability Engineer, Liability, Reliabil ...


  • Teaching Strategies Bethesda, United States

    Be a Part of our Team · Join a working family that is dedicated to the mission of the work we do · Teaching Strategies is an innovative edtech organization focused on connecting teachers, children, and families. As front runners in the early childhood education market, we build ...


  • Marriott International Bethesda, United States

    Job Number · Job Category Information Technology · Location Marriott International HQ, 7750 Wisconsin Avenue, Bethesda, Maryland, United States VIEW ON MAP · Schedule Full-Time · Located Remotely? Y · Relocation? N · Position Type Management · JOB SUMMARY · Lead role in the Moni ...


  • Marriott Bethesda, United States

    Job DescriptionJOB SUMMARYLead role in the Monitoring and Performance Management function at Marriott. Performs detailed performance analysis of applications and infrastructure in support of incident and problem investigation and application release management. Develops solutions ...


  • Teaching Strategies, LLC Bethesda, United States

    Job Description · Job DescriptionBe a Part of our Team · Join a working family that is dedicated to the mission of the work we do · Teaching Strategies is an innovative edtech organization focused on connecting teachers, children, and families. As front runners in the early child ...


  • Marriott Bethesda, United States

    Job Description · JOB SUMMARY · Lead role in the Monitoring and Performance Management function at Marriott. Performs detailed performance analysis of applications and infrastructure in support of incident and problem investigation and application release management. Develops so ...


  • Teaching Strategies, LLC Bethesda, United States

    Job Description · Job DescriptionBe a Part of our Team · Join a working family that is dedicated to the mission of the work we do · Teaching Strategies is an innovative edtech organization focused on connecting teachers, children, and families. As front runners in the early child ...


  • Alta It Services Reston, United States

    Site Reliability Engineering (SRE) Lead · 100% Remote · W2 ONLY · US Citizenship required per government contract · Must be able to obtain a DHS Public Trust clearance · As a Site Reliability Engineering (SRE) Lead, you'll deliver mission-critical services that empower end u ...


  • System One Holdings, LLC Reston, United States

    Site Reliability Engineering (SRE) Lead · 100% Remote · W2 ONLY · US Citizenship required per government contract · Must be able to obtain a DHS Public Trust clearance · As a Site Reliability Engineering (SRE) Lead, you'll deliver mission-critical services that empower end user ...


  • Red Gate Group Reston, United States

    Job Description · Do you have the following skills, experience and drive to succeed in this role Find out below. · The Red Gate Group is seeking a TS/SCI cleared Site Reliability Engineer to support the Defense Threat Reduction Agency (DTRA) in Reston, VA. · As a Site Reliabilit ...