- Adding a `datadog: true` flag to our `` configuration file. When enabled, this ensures that Datadog agents are installed and configured properly for this cluster.
- Modifying an image build process from x86-only to also support ARM
- Building a tool to provision AWS Managed Prometheus
- Managing infrastructure in production with Terraform, Pulumi, or a similar tool
- Developing Kubernetes components in Golang, like Operators or Admission Controllers
- Independently troubleshooting and resolving issues related to Kubernetes and Cloud infrastructure (AWS, GCP, etc).
-
Reliability Engineer
2 weeks ago
Apple Austin, United StatesSummary · Posted: Apr 9, 2024 · Weekly Hours: 40 · Role Number: · Do you ever wonder what goes into making Apple products an amazing user experience? Apple's innovative reliability team is responsible for insuring that our products exceed our customer's expectations for robu ...
-
Reliability Engineer
2 weeks ago
Apple Austin, United StatesReliability Engineer (Mac SoC Package Integration) · Austin,Texas,United States · Hardware · Do you ever wonder what goes into making Apple products an amazing user experience? Apples innovative reliability team is responsible for insuring that our products exceed our customer ...
-
Reliability Engineer
1 day ago
Apple Austin, United StatesReliability Engineer (Mac SoC Package Integration) · Austin,Texas,United States · Hardware · Do you ever wonder what goes into making Apple products an amazing user experience? Apples innovative reliability team is responsible for insuring that our products exceed our customer ...
-
Site Reliability Engineer
5 days ago
Apple Inc. Austin, United StatesImagine what you could do here. At Apple, great ideas have a way of becoming great products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish Join the Apple Service Engineering team as a Sit ...
-
Diverse Lynx Austin, United StatesImmediate need for a talented · Facility & Reliability Engineer . This is a · 12+ Months Contract · opportunity with long-term potential and is located in · Summit, NJ (Onsite) . Please review the job description below and contact me ASAP if you are interested. · Job ID: · ...
-
Site Reliability Engineer
4 days ago
Pinnacle Group, Inc. Austin, United StatesResponsibilities · We are looking for an operations engineer to join the Crypto Services SRE team. The Crypto Services SRE team is responsible for systems and services that support a vast number of both Apple's internal services as well as services that users directly use. As an ...
-
Reliability Engineer Lead
2 weeks ago
Apple Austin, United StatesSummary · Posted: Sep 6, 2023 · Role Number: · Do you love crafting sophisticated solutions to highly complex challenges? Do you intrinsically see the importance in every detail? As part of our Silicon Technologies group, you'll help design and manufacture our next-generation ...
-
Service Reliability Engineer
2 weeks ago
EOS USA Austin, United StatesWHO WE ARE: · EOS IT Solutions is a Global Technology and Logistics company, providing Collaboration and Business IT Support services to some of the world's largest industry leaders, delivering forward-thinking solutions based on multi-domain architecture. Customer satisfaction ...
-
site reliability engineer
2 weeks ago
Thales USA, Inc. Austin, United StatesLocation: Austin, United States of America. Thales people architect identity management and data protection solutions at the heart of digital security. Business and governments rely on us to bring trust to the billons of digital interactions they hav Reliability Engineer, Liabili ...
-
Site Reliability Engineer
3 weeks ago
Tech Mahindra (formerly Mahindra Satyam) Austin, United StatesJOB RESPONSIBILITIES · • Make software services reliable, secure, monitor and auto-manage infrastructure for specialized workloads on AWS. · • Responsible for a scenario that would require you to collaborate closely across organizations and teams, to collaborate across geographi ...
-
Site Reliability Engineer
2 weeks ago
Apple Austin, United StatesSummary · Posted: May 1, 2024 · Weekly Hours: 40 · Role Number: · At Apple, we work every day to build products that enrich people's lives. Our Advertising Platforms group makes it possible for people around the world to easily access informative and imaginative content on t ...
-
Site Reliability Engineer
1 week ago
Apple Austin, United StatesSite Reliability Engineer - Ad Platforms · Austin,Texas,United States · Software and Services · At Apple, we work every day to build products that enrich peoples lives. Our Advertising Platforms group makes it possible for people around the world to easily access informative a ...
-
Site Reliability Engineer
2 weeks ago
Apple Austin, United StatesSite Reliability Engineer · Austin,Texas,United States · Software and Services · The Apple Information Apps Engineering teams power some of the most widely used Apple applications, such as Apple News, Stocks, Weather, and Books. We do this at a massive, global scale. We meet o ...
-
Reliability Engineer Lead
1 day ago
Apple Austin, United StatesReliability Engineer Lead – Custom Silicon Management · Austin,Texas,United States · Hardware · Do you love crafting sophisticated solutions to highly complex challenges? Do you intrinsically see the importance in every detail? As part of our Silicon Technologies group, you'll he ...
-
Site Reliability Engineer
5 days ago
Apixio Austin, United StatesWho We Are At the intersection of health plans and providers, Apixio and ClaimLogiq are creating a leading Connected Care platform to minimize reimbursement inaccuracies and high-quality patient care so they can thrive as the industry moves toward value-based reimbursement models ...
-
Site Reliability Engineer
3 days ago
SureCo Inc Austin, United StatesJob Type · Full-time · Description · Job Title: Site Reliability Engineer (SRE) · Location: Remote (comfortable working in the Pacific Time Zone) · SureCo is changing how people in the US take care of their health - in 2020, new regulations went into effect, allowing employers ...
-
Maintenance and Reliability Engineer
1 week ago
Jacobs Austin, United StatesYour Impact: · Challenging Today. Reinventing Tomorrow. · We're invested in you and your success. Everything we do is more than just a project. It's our challenge as human beings, too. That's why we bring a thoughtful and collaborative approach to every one of our partnerships. · ...
-
Site Reliability Engineer
1 day ago
Apple Austin, United StatesSite Reliability Engineer - Ad Platforms · Austin,Texas,United States · Software and Services · At Apple, we work every day to build products that enrich peoples lives. Our Advertising Platforms group makes it possible for people around the world to easily access informative a ...
-
Service Reliability Engineer
2 days ago
EOS IT Solutions Austin, United StatesWHO WE ARE: · EOS IT Solutions is a Global Technology and Logistics company, providing Collaboration and Business IT Support services to some of the world's largest industry leaders, delivering forward-thinking solutions based on multi-domain architecture. Customer satisfaction ...
-
Site Reliability Engineer
3 weeks ago
Apple Austin, United States Regular, Full timeSummary · Posted: Apr 18, 2024 · Weekly Hours: · 40 · Role Number: · Imagine what you could do here. At Apple, great ideas have a way of becoming great products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what ...
Site Reliability Engineer - Austin, United States - Pinnacle Group
Description
Job Details:4Excellent Python, bash, and scripting fundamentals
Expertise with AWS/GCP cloud platform
Expertise with infrastructure-as-code tools such as Terraform, Cloud Deployment Manager, Ansible, or Chef.
Experience in building systems where observability is a first-class concern using protocols and tools that cover the space of log aggregation, analytics, monitoring, distributed systems tracing and alerting
Experience with containerization and cluster management technologies like Docker, Kubernetes and EKS.
Familiarity with microservices architecture and container orchestration with Kubernetes
Experience in designing and managing a predictive alerting platform using monitoring tools such as Prometheus, Grafana, Cloud monitoring, Splunk.
Proficient at Linux system administration
Experience with modern web services architectures
Experience with Git
Expertise in Kubernetes – probably certified.
Able to build tools from scratch when needed.
Ability to quickly learn new and existing technologies.
Strong problem solving skills
Responsibilities:
Implement features that enable customer engineers to easily enable and configure Kubernetes and Infrastructure capabilities.
For example:
Pay Range:
$75-95
The specific compensation for this position will be determined by a number of factors, including the scope, complexity and location of the role as well as the cost of labor in the market; the skills, education, training, credentials and experience of the candidate; and other conditions of employment.
#J-18808-Ljbffr