- Passion for reliability and performance, you will own uptime and support all customer-facing services and products
- Own and drive improvements to observability of service performance metrics, monitors, and alerting
- Provision, manage and automate our SaaS platform across multiple production and test environments
- Support and enhance build and release pipelines using process and tooling to provide self-service automations
- Collaborate with development teams on software and platform, helping to identify and remove potential performance bottlenecks
- Help our engineering partners establish SLIs and SLOs for their services
- Participate in the on-call rotation with the team
- Resolve incidents, perform root cause analysis, and grow our library of runbooks
- Implement and automate security controls, governance processes, and compliance validation
- Actively participate in and drive infrastructure architecture decisions
- Mentor junior members of the team
- Occasional domestic travel required for in-person team, department, and company meetings
- Minimum of 8 years experience in a 24x7x365 SaaS production environment
- Build automation and release management experience
- Hands-on experience with Linux and system administration and engineering
- Comfortable in a containerized world of Kubernetes (EKS), helm, and ArgoCD
- Proficiency with configuration management tools such as Ansible, Chef, Salt
- Production experience in operations for an always-up, always-available mission-critical service
- Strong knowledge of ephemeral infrastructure, horizontal scaling, self-healing architectures, service discovery, logging, monitoring and alerting
- Expert level experience with AWS and hybrid cloud systems/designs
- Proficiency with IaC tools such as Terraform and AWS CloudFormation
- Expert understanding and ability to troubleshoot systems at the protocol layer - TCP/IP, UDP, HTTP, SSL/TLS, and DNS
- Proficient with multiple scripting languages such as Bash, Python, or Go
- Experience developing CI/CD pipelines using Jenkins or BitBucket Pipelines
- Knowledge of best-practice security, performance, and networking techniques for high-traffic customer-facing systems
- Experience with monitoring and logging tools such as New Relic or AWS CloudWatch
- Experience with relational and NoSQL databases, including Microsoft SQL, Postgres, and MongoDB
- Strong bias for security posture; experience with PCI compliance is a plus
- Excellent troubleshooting and testing skills
- A passion for learning new technologies
- Experience with Agile methodology and passion for software development best practices
- Strong sense of collaboration, teamwork, and accountability
- Bonus: Experience working for a B2C SaaS company
- Competitive compensation package, including Employee Equity Appreciation Program
- Health insurance benefits
- 401k with employer match
- 100% remote work environment
- Unlimited paid time off (which includes paid holidays and Winter Break)
- Paid parental leave
- Tuition assistance and Professional development and growth opportunities
- 100% paid life, short and long term disability insurance
- Pre-tax medical and dependent care flexible spending accounts (FSA)
- Voluntary life and critical illness insurance
-
Database Reliability Engineer
1 week ago
Teaching Strategies, LLC Bethesda, United StatesJob Description · Job DescriptionBe a Part of our Team · Join a working family that is dedicated to the mission of the work we do · Teaching Strategies is an innovative edtech organization focused on connecting teachers, children, and families. As front runners in the early child ...
-
Senior Site Reliability Engineer
6 days ago
Humana Bethesda, United StatesHumana · Senior Site Reliability Engineer · Austin , · Texas · Apply Now · Become a part of our caring community and help us put health first · The Senior Site Reliability Engineer maintains, integrates and analyzes software applications within the organization. The Senior S ...
-
Sr. Site Reliability Engineer
3 weeks ago
Marriott International Bethesda, United StatesJob Number · Job Category Information Technology · Location Marriott International HQ, 7750 Wisconsin Avenue, Bethesda, Maryland, United States VIEW ON MAP · Schedule Full-Time · Located Remotely? Y · Relocation? N · Position Type Management · JOB SUMMARY · Lead role in the Moni ...
-
Sr. Site Reliability Engineer
1 week ago
Marriott International Bethesda, United StatesJob Number · Job Category Information Technology · Location Marriott International HQ, 7750 Wisconsin Avenue, Bethesda, Maryland, United States VIEW ON MAP · Schedule Full-Time · Located Remotely? Y · Relocation? N · Position Type Management · JOB SUMMARY · Lead role in the Moni ...
-
Sr. Site Reliability Engineer
3 weeks ago
Marriott Bethesda, United StatesJob Description · JOB SUMMARY · Lead role in the Monitoring and Performance Management function at Marriott. Performs detailed performance analysis of applications and infrastructure in support of incident and problem investigation and application release management. Develops so ...
-
Sr. Site Reliability Engineer
3 days ago
Marriott International Bethesda, United StatesJob Description · JOB SUMMARY · Lead role in the Monitoring and Performance Management function at Marriott. Performs detailed performance analysis of applications and infrastructure in support of incident and problem investigation and application release management. Develops sol ...
-
Principal Site Reliability Engineer
3 weeks ago
Teaching Strategies Bethesda, United StatesBe a Part of our Team · Join a working family that is dedicated to the mission of the work we do · Teaching Strategies is an innovative edtech organization focused on connecting teachers, children, and families. As front runners in the early childhood education market, we build ...
-
Principal Site Reliability Engineer
3 days ago
Teaching Strategies Bethesda, United StatesBe a Part of our Team · Join a working family that is dedicated to the mission of the work we do · Teaching Strategies is an innovative edtech organization focused on connecting teachers, children, and families. As front runners in the early childhood education market, we build ...
-
Sr. Site Reliability Engineer
3 weeks ago
Marriott International Bethesda, MD, United StatesJob Number Job Category Information TechnologyLocation Marriott International HQ, 7750 Wisconsin Avenue, Bethesda, Maryland, United States VIEW ON MAPSchedule Full-TimeLocated Remotely? YRelocation? NPosition Type ManagementJOB SUMMARYLead role in the Monitoring and Performance M ...
-
Sr. Site Reliability Engineer
2 weeks ago
Marriott Bethesda, United StatesJob Description · JOB SUMMARY · Lead role in the Monitoring and Performance Management function at Marriott. Performs detailed performance analysis of applications and infrastructure in support of incident and problem investigation and application release management. Develops so ...
-
Sr. Site Reliability Engineer
4 weeks ago
Marriott Bethesda, United StatesJob Description · JOB SUMMARY · Lead role in the Monitoring and Performance Management function at Marriott. Performs detailed performance analysis of applications and infrastructure in support of incident and problem investigation and application release management. Develops so ...
-
Sr. Site Reliability Engineer
1 week ago
Marriott Bethesda, United StatesJob Description · JOB SUMMARY · Lead role in the Monitoring and Performance Management function at Marriott. Performs detailed performance analysis of applications and infrastructure in support of incident and problem investigation and application release management. Develops so ...
-
Sr. Site Reliability Engineer
1 week ago
Marriott Bethesda, United StatesJob DescriptionJOB SUMMARYLead role in the Monitoring and Performance Management function at Marriott. Performs detailed performance analysis of applications and infrastructure in support of incident and problem investigation and application release management. Develops solutions ...
-
Senior Site Reliability Engineer
4 days ago
Teaching Strategies Bethesda, United StatesBe a Part of our Team · Join a working family that is dedicated to the mission of the work we do · Teaching Strategies is an innovative edtech organization focused on connecting teachers, children, and families. As front runners in the early childhood education market, we build ...
-
Sr. Site Reliability Engineer
19 hours ago
Marriott Bethesda, United StatesJob Number · Job Category Information Technology · Location Marriott International HQ, 7750 Wisconsin Avenue, Bethesda, Maryland, United States VIEW ON MAP · Schedule Full-Time · Located Remotely? Y · Relocation? N · Position Type Management · JOB SUMMARY · Lead role in t ...
-
Sr. Site Reliability Engineer
3 weeks ago
Marriott Bethesda, United StatesJob Number Job Category Information TechnologyLocation Marriott International HQ, 7750 Wisconsin Avenue, Bethesda, Maryland, United States VIEW ON MAPSchedule Full-TimeLocated Remotely? YRelocation? NPosition Type ManagementJOB SUMMARYLead role in the Monitoring and Performance M ...
-
Sr. Site Reliability Engineer
3 days ago
Marriott Bethesda, United StatesJob DescriptionJOB SUMMARYLead role in the Monitoring and Performance Management function at Marriott. Performs detailed performance analysis of applications and infrastructure in support of incident and problem investigation and application release management. Develops solutions ...
-
Principal Site Reliability Engineer
1 week ago
Teaching Strategies, LLC Bethesda, United StatesJob Description · Job DescriptionBe a Part of our Team · Join a working family that is dedicated to the mission of the work we do · Teaching Strategies is an innovative edtech organization focused on connecting teachers, children, and families. As front runners in the early child ...
-
Senior Site Reliability Engineer
1 week ago
Teaching Strategies, LLC Bethesda, United StatesJob Description · Job DescriptionBe a Part of our Team · Join a working family that is dedicated to the mission of the work we do · Teaching Strategies is an innovative edtech organization focused on connecting teachers, children, and families. As front runners in the early child ...
-
Reliability Engineer
3 weeks ago
Saint-Gobain Washington DC, United StatesConsistent with CertainTeed Gypsum Vision, Mission, Values and Objectives, the Reliability Engineer identifies and quantifies Line 1 and Line 2 root cause failure(s), and drives permanent solutions to address systemic or chronic mechanical deficiencies to world class levels of sa ...
Senior Site Reliability Engineer - Bethesda, United States - Teaching Strategies
Description
Be a Part of our TeamJoin a working family that is dedicated to the mission of the work we do
Teaching Strategies is an innovative edtech organization focused on connecting teachers, children, and families. As front runners in the early childhood education market, we build dynamic, top-quality digital products that integrate all of the essential elements of a high-quality solution: curriculum, assessment, professional development, and family engagement. We are building a team of results-oriented individuals who will thrive in a collaborative, work-hard/play-hard culture. We pride ourselves on the impact we have on the early childhood field through supporting teachers who are doing the most important work there is, teaching children to become creative, confident thinkers.
Position Overview
Teaching Strategies is looking for a highly talented, innovative, creative, and experienced SRE to join our Infrastructure Engineering group. The Senior SRE Engineer will be an integral member of our Technology department and will work closely with the development, QA and Core Platform Engineering (CPE) teams. The role of Senior SRE Engineer is a hands-on technical role and requires a thorough understanding of all components of a modern web application stack, including front-end, database, networking, and systems engineering. The ideal candidate will bring forward innovative and creative ideas around performance, security, systems design, Infrastructure-as-Code and automation, and CI/CD in order to help us maintain scalable and performant technology solutions for our products. And help advance us towards our vision of a platform provisioned and managed by Infrastructure-as-Code with automations and tooling that empowers and accelerates our engineering teams.
Specific Roles & Responsibilities:
At Teaching Strategies, our solutions and services are only as strong as the teams that create them. By bringing passion, dedication, and creativity to your job every day, there's no telling what you can do and where you can go We provide a competitive compensation and benefits package, flexible work schedules, opportunities to engage with co-workers, access to career advancement and professional development opportunities, and the chance to make a difference in the communities we serve.
Let's open the door to your career at Teaching Strategies
Some additional benefits & perks while working with Teaching Strategies
Teaching Strategies offers our employees a robust suite of benefits and other perks which include: