- Expert in Observability & SRE principles, SLI, SLO and SLA definition and management
- Experienced in Grafana stack and other Application performance management tools and frameworks like Elastic stack, AppD, etc.
- Expert in SRE observability implementation in instrumentation of metrics, logs and traces.
- Expertise in Docker & Kubernetes is required.
- Excellent understanding of micro-services architecture, design patterns, and standard methodologies with an eye towards scale, automation, resiliency, and high availability
- Prior experience dealing with high volume distributed technical architectures with a high cost of failure, i.e. focus on reliability and availability
- Experienced with telemetry tooling and observability systems such as: Jaegar, Prometheus, OpenTracing, OpenTelemetry, App Dynamics, Splunk, DataDog, NewRelic, Lightstep, Grafana.
- Experienced with some amount of Big Data technologies such as: ElasticSearch, NoSql Stores, Kafka, Columnar Databases, DataFlow or Pipeline Systems, Graph DataStores.
- Expert Performance Analyzer, Resilience, Chaos engineering, FMEA, Scalability, High Availability, JProfiler, Thread dump Analyser, etc.
- Experience with leveraging common infrastructure services like Enterprise Message Bus, Configuration Services, Toggles, Logging Systems, Telemetry for Observability (e.g. OpenTelemetry).
- Strong experience with ServiceNow ITOM, ITSM Modules that focuses Discovery, Event Management, Incident, Problem and Change Management
- Strong sense of architecture and design for fault tolerance, scale-out approaches and stability practiced Azure Well Architected framework or other cloud such as Google Cloud Platform
- Experience in emerging technologies like Machine Learning and AI Ops is a plus.
-
Senior Site Reliability Engineer
3 weeks ago
Epsilon Grand Prairie, United StatesSonova · Radolfzell am Bodensee, BW 78315 · posted 04/27/2024 · More... · front runner · Buyer/Planner Fertigungsdisponent:in (d/w/m) · Danaher · Bodman-Ludwigshafen, Baden-Württemberg 78351 · posted 04/27/2024 · More... · front runner · Manager: in Software Team (d/f/m) ...
-
Reliability Engineer
3 weeks ago
Mass Staffing Projects Dallas, United StatesJob Description · Looking for something reliable? One of our top mining clients is looking for a skilled and experienced Reliability Engineer to join their team in Free State. · Requirements Include: · Senior Certificate · Degree in Mechanical / Electrical Engineering · GCC ...
-
Reliability Engineer
3 weeks ago
Mass Staffing Projects Dallas, United StatesJob Description Looking for something reliable?One of our top mining clients is looking for a skilled and experienced Reliability Engineer to join their team in Free State.Requirements Include: · •Senior Certificate · •Degree in Mechanical / Electrical Engineering · •GCC Mines & ...
-
Site Reliability Engineer
2 weeks ago
AllSTEM Connections Plano, United StatesSITE RELIABILTY ENGINEER · ON W2 · PLANO,TX/HOUSTON,TX/DELAWARE · HYBRID REPORTING: 3DAYS ONSITE · SKILLSET NEEDED: · AWS · BIG DATA · SPARK · PYTHON · SCRIPTING · SHELL · PERL · CONTROL-M · AUTOSYS · GRAFANA · ...
-
Electrical Reliability Engineer
3 weeks ago
WestRock Dallas, United StatesThe Electrical Reliability Engineer is considered the local expert on the equipment within their area of assignment and must either have demonstrated capability unique to the equipment or be capable of quickly assimilating new information to become e Reliability Engineer, Electri ...
-
Senior Reliability Engineer
4 days ago
RTX Dallas, United StatesDate Posted: · :00 · Country: · United States of America · Location: · TX360: Dallas - North Bldg 13510 North Central Expressway North Building, Dallas, TX, 75243 USA · The Whole Life Engineering (WLE) Department is made up of several disciplines whose main objective is to influe ...
-
Site Reliability Engineer
3 weeks ago
Global Mobility Services Dallas, United States** We are not looking for C2C profiles at the moment** · Site Reliability Engineer · 7 months contract to start (potential for CTH) · Looking for hybrid resources local to Denver or Dallas. · Summary · Our client is actively seeking a skilled and experienced Site Reliability Eng ...
-
System Reliability Engineer
2 weeks ago
Highbrow Dallas, United StatesSystem Reliability Engineer (SRE) 2 · —> 5 to 7 years experience · Location :- Kansas City, Mi or Atlanta, GA or Dallas, Texas · Job Description: · We are seeking an experienced System Reliability Engineer (SRE) 2 to join our team. The ideal candidate will have 5 to 7 years of ...
-
Site Reliability Engineer
6 days ago
Saxon Global Dallas, United StatesAs a member of the Production Support/SRE team you will work cross-functionally amongst a variety of teams and be a core contributor in every significant engineering service or solution that we deliver to our stakeholders. You'll excel if you have enthusiasm for digging deep, and ...
-
Site Reliability Engineer
5 days ago
Diverse Lynx Dallas, United StatesJob Title: Site Reliability Engineer · Location: Dallas, TX//Onsite · Duration: Full Time-Only · Job Description · Responsible for ensuring the reliability of systems, minimizing downtime, and maintaining service-level objectives (SLOs). · Developing, automation and implemen ...
-
Site Reliability Engineer
2 weeks ago
Diverse Lynx Dallas, United StatesJob Title: Site Reliability Engineer · Location: Dallas, TX//Onsite · Duration: Full Time-Only · JOB DESCRIPTION: · Responsible for ensuring the reliability of systems, minimizing downtime, and maintaining service-level objectives (SLOs). · •Developing, automation and imple ...
-
Site Reliability Engineer
2 weeks ago
Avetta Dallas, United StatesJoin Avetta as a Site Reliability Engineer · Site Reliability Engineers are pioneers of the production systems, we believe in proactive discovery and analysis of our entire stack, continually optimizing, tuning, and scaling the system for maximal end-user experience on a globall ...
-
Site Reliability Engineer
5 days ago
Avetta Dallas, United StatesJoin Avetta as a Site Reliability Engineer · Site Reliability Engineers are pioneers of the production systems, we believe in proactive discovery and analysis of our entire stack, continually optimizing, tuning, and scaling the system for maximal end-user experience on a globall ...
-
Reliability Engineer, Remote
5 days ago
Oldcastle BuildingEnvelope Dallas, United StatesReliability Engineer, Remote · Who We Are · At OBE, together, we build excellence every day.We are driven by our passion to lead our industry and build a sustainable future, we focus on exceeding customer expectations and delivering innovative solutions. We succeed through the ...
-
Site Reliability Engineer
3 weeks ago
Suncaptech Dallas, United StatesRole: Site Reliability Engineer · Location: Dallas, TX (Onsite) · 2 Positions available · Implement SRE practices · Identify, craft, and maintain SLIs and SLOs for teams, as well as metrics such as MTTR, Lead time for change, Deployment Frequency and Change Failure Rate · Work ...
-
Site Reliability Engineer
3 days ago
Saxon Global Dallas, United StatesAs a member of the Production Support/SRE team you will work cross-functionally amongst a variety of teams and be a core contributor in every significant engineering service or solution that we deliver to our stakeholders. You'll excel if you have enthusiasm for digging deep, and ...
-
Site Reliability Engineer
3 days ago
K-Tek Resourcing LLC Dallas, United StatesJob Overview: · We are looking for a motivated Junior Operations Engineer to ensure the smooth operation of our software and systems. This role combines technical expertise with problem-solving skills to automate operational processes, enhance system functionality, and maintain t ...
-
Site Reliability Engineer
1 week ago
Cedent Consulting Dallas, United StatesSite Reliability Engineer (Chicago, IL; Dallas, TX; ...) · Qualifications: · 8+ years of Software Engineering experience, or equivalent demonstrated through one or a combination of the following: work experience, training, experience, education. · Contractor will implement and ...
-
Site Reliability Engineer
3 days ago
PMG, Inc. Dallas, United StatesPMG is a digital company that helps marketers connect people with their brand. Focused on people and grounded in data, our award-winning culture fosters meaningful careers. Partnering with the most iconic brands in the world, we put people at the center of everything we do to del ...
-
Electrical Reliability Engineer
1 week ago
WestRock Dallas, United StatesGeneral Information · Job ID · ATR40660 · Posting Job Title · Electrical Reliability Engineer · Locations · TX Dallas Mill · Employment Type · Full Time · Date Posted · 06-May-2024 · Relocation Support · Yes · Description & Requirements · WestRock (NYSE :WRK) is a g ...
Site Reliability Engineer - Dallas, United States - Tecktiva
Description
Job Title: Site Reliability Engineer
Location: Phoenix, AZ / dallas, TX
Duration: 6 Months+
Responsibilities:
Warm Regards,
Dev Raj | Delivery Head - Talent Acquisition
Work:
email:
Tecktiva, LLC