- System/Software Maintenance: Manage software and system updates/upgrades across our AWS infrastructure to maintain security, performance, and uptime.
- DR and HA: Design and implement disaster recovery plans and high availability architectures
- Performance Monitoring and Optimizations: Build and optimize monitoring systems, dashboards, and alerts for proactive issue detection
- DevOps and Outcome minded: Collaborate with development teams to improve observability and minimize MTTR
- Automation and CI/CD: Automate routine tasks and processes to increase efficiency and reliability
- Issue Detection & Resolution: Conduct post-incident reviews, identify root causes, and implement preventative measures.
- Infrastructure Management: Contribute to infrastructure-as-code practices using tools like Terraform.
- 3+ years of experience as a Site Reliability Engineer or similar DevOps role
- Strong proficiency with AWS services (EC2, ELB, RDS, VPC, EKS, IAM etc.)
- Strong containerization and Kubernetes skills, AWS EKS preferred
- Experience with IAC & configuration management tools (e.g. Ansible, Terraform)
- Expertise in monitoring and observability solutions (e.g. DataDog, Prometheus, Grafana, AWS Cloudwatch)
- Skilled in scripting languages (e.g. Python, Bash) for automation
- Understanding of CI/CD practices and Trunk-based development workflows, ideally with Github Actions.
- Excellent troubleshooting and root cause analysis abilities
- Strong communication and collaboration skills
- Detail oriented with a focus on operational excellence
- Impactful Work: Be part of a team that ensures the reliability and performance of critical services.
- Growth Opportunities: Continuous learning and development opportunities in cutting-edge technologies.
- Collaborative Environment: Work with a talented and dedicated team passionate about technology and innovation.
- Flexible Work Environment: Remote work options available to support work-life balance.
- Flexible, portable individual medical plan options with generous employer contributions
- Group dental and vision plan with 99% SureCo paid coverage
- Generous annual reimbursement allowance for employee health-related expenses
- SureCo paid Ancillary benefits such as basic life, short-term and Long-term disability insurances
- Complimentary telemedicine virtual visits and employee assistance program (EAP)
- Concierge Services such as Beneficiary Assist, Travel/ID Protection, College Tuition Rewards, and much more
-
Reliability Engineer
1 week ago
Advanced Micro Devices , Inc. Austin, United StatesOverview: · WHAT YOU DO AT AMD CHANGES EVERYTHING · We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences the building bloc ...
-
Reliability Engineer
2 weeks ago
Apple Austin, United StatesReliability Engineer (Mac SoC Package Integration) · Austin,Texas,United States · Hardware · Do you ever wonder what goes into making Apple products an amazing user experience? Apples innovative reliability team is responsible for insuring that our products exceed our customer ...
-
Reliability Engineer
3 days ago
Apple Austin, United StatesReliability Engineer (Mac SoC Package Integration) · Austin,Texas,United States · Hardware · Do you ever wonder what goes into making Apple products an amazing user experience? Apples innovative reliability team is responsible for insuring that our products exceed our customer ...
-
Site Reliability Engineer
3 weeks ago
Pinnacle Group, Inc. Austin, United StatesResponsibilities · We are looking for an operations engineer to join the Crypto Services SRE team. The Crypto Services SRE team is responsible for systems and services that support a vast number of both Apple's internal services as well as services that users directly use. As an ...
-
Site Reliability Engineer
3 weeks ago
APEX Austin, United StatesJob Description* JD: · Skilled at writing clean, high-performant and unit-testable code in Java. · Proficiency with the architecture, deployment, performance tuning, and troubleshooting large scale distributed systems on AWS · Understanding of SRE principles including monitoring ...
-
Equipment Reliability Engineer
4 days ago
Flextronics Austin, United StatesJob Posting Start Date Job Posting End Date Flex is the diversified manufacturing partner of choice that helps market-leading brands design, build and deliver innovative products that improve the world.We believe in the power of diversity and inclusion and cultivate a workplace c ...
-
Equipment Reliability Engineer
1 week ago
FLEX Inc Austin, United StatesTo support our extraordinary teams who build great products and contribute to our growth, were looking to add a Equipment Reliability Engineer located in Austin, TX. Reporting to the Engineering Manager, as part of the site engineering team, the Eq Reliability Engineer, Liability ...
-
Site Reliability Engineer
5 days ago
Apple Austin, United StatesSite Reliability Engineer - Ad Platforms · Austin,Texas,United States · Software and Services · At Apple, we work every day to build products that enrich people's lives. Our Advertising Platforms group makes it possible for people around the world to easily access informative and ...
-
Site Reliability Engineer
3 weeks ago
Zenoss Austin, United StatesJob Description · Job DescriptionZenoss is seeking an experienced Site Reliability Engineer (SRE) to join a team of Engineers and Architects creating breakthrough ITOps and AIOps Platform. We are seeking individuals with experience and knowledge in building software and tools to ...
-
Site Reliability Engineer
6 days ago
Virtu Financial Austin, United StatesVirtu is a leading financial firm that leverages cutting edge technology to deliver liquidity to the global markets and innovative, transparent trading solutions to our clients. As a market maker, Virtu provides deep liquidity that helps to create more efficient markets around th ...
-
Site Reliability Engineer
2 weeks ago
Virtu Financial Austin, United StatesVirtu is a leading financial firm that leverages cutting edge technology to deliver liquidity to the global markets and innovative, transparent trading solutions to our clients. As a market maker, Virtu provides deep liquidity that helps to create more efficient markets around th ...
-
Site Reliability Engineer
3 weeks ago
Pinnacle Group Austin, United StatesResponsibilities · We are looking for an operations engineer to join the Crypto Services SRE team. The Crypto Services SRE team is responsible for systems and services that support a vast number of both Apples internal services as well as services that users directly use. As an ...
-
Service Reliability Engineer
3 weeks ago
EOS IT Solutions Austin, United StatesWHO WE ARE: · EOS IT Solutions is a Global Technology and Logistics company, providing Collaboration and Business IT Support services to some of the world's largest industry leaders, delivering forward-thinking solutions based on multi-domain architecture. Customer satisfaction ...
-
Site Reliability Engineer
3 weeks ago
Apple Inc. Austin, United StatesImagine what you could do here. At Apple, great ideas have a way of becoming great products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish Join the Apple Service Engineering team as a Sit ...
-
Site Reliability Engineer
1 day ago
Ibm Careers Austin, United StatesPOSITION AVAILABLESite Reliability Engineer, IBM Corporation, Austin, TX: Monitor the health of production and test systems. Respond to production and test system issues and alerts. Execute changes in the production and test environments manually or through automation. Assist the ...
-
Equipment Reliability Engineer
5 days ago
FLEX Inc Austin, United StatesJob Posting Start Date Job Posting End Date · Flex is the diversified manufacturing partner of choice that helps market-leading brands design, build and deliver innovative products that improve the world. · We believe in the power of diversity and inclusion and cultivate a work ...
-
Site Reliability Engineer
3 days ago
Strive Works Austin, United StatesThe Role · As a Site Reliability Engineer at Striveworks, you'll have the opportunity to receive mentorship from experienced engineers, as well as bring your own experiences and training. Your ideas are welcome and invited on day one; there is no smartest person at Striveworks. Y ...
-
Reliability Engineer
2 weeks ago
Apple Austin, United StatesSummary · Posted: Apr 9, 2024 · Weekly Hours: 40 · Role Number: · Do you ever wonder what goes into making Apple products an amazing user experience? Apple's innovative reliability team is responsible for insuring that our products exceed our customer's expectations for robu ...
-
Staff Site Reliability Engineer
5 days ago
Procore Technologies Austin, United States Full timeJob Description · What if you could use your technology skills to develop a product that impacts the way communities' hospitals, homes, sports stadiums, and schools across the world are built? Construction impacts the lives of nearly everyone in the world yet it's also one of the ...
-
Staff Site Reliability Engineer
2 days ago
Procore Technologies Austin, United StatesLead projects within a small team of Reliability Engineers to continually improve the reliability of Procores services through engineering and process improvement. Collaborate with your peers to envision, design, and develop solutions in your respec Reliability Engineer, Liabilit ...
Site Reliability Engineer - Austin, United States - SureCo Inc
![Default job background](https://contents.bebee.com/public/img/bg-user-ex-1.jpg)
Description
Job Type
Full-time
Description
Job Title: Site Reliability Engineer (SRE)
Location: Remote (comfortable working in the Pacific Time Zone)
SureCo is changing how people in the US take care of their health - in 2020, new regulations went into effect, allowing employers to offer more choice at lower cost for employee health benefits, and SureCo is at the vanguard of that change. We are a startup providing a SaaS platform letting brokers, employers, and employees to shop for the services they want to support the health of their families.
As we grow, we're hiring an SRE to join our infra team - come join the team ensuring our cloud infrastructure scales are secure and performing. You'll collaborate with other infra engineers as well as application developers & internal stakeholders to drive our observability / HA / DR / capacity management, and otherwise help ensure our users can utilize our platform - driving down MTTR and driving up early detection of issues before they affect users.
We are a diverse and collaborative team, eager to support each other's growth and success. If you're enthusiastic about our mission and have unique strengths to contribute, even if you don't meet every qualification, we encourage you to apply.
Responsibilities
This position is open to fully remote candidates, with a preference for individuals who can align with the US/Pacific time zone working hours .
SureCo is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, gender expression, national origin, protected veteran status, or any other basis protected by applicable law and will not be discriminated against on the basis of disability