- Observability and Monitoring:
- Develop and implement robust observability strategies, including logging, metrics, and tracing, to gain deep insights into the performance and health of our systems.
- Site Reliability Engineering:
- Lead initiatives to improve the reliability, availability, and scalability of our applications and infrastructure.
- Incident Management:
- Develop and refine incident response processes, ensuring timely detection, analysis, and resolution of incidents.
- Automation and Tooling:
- Build and maintain automation tools for deployment, monitoring, and incident response to streamline operational processes.
- Collaboration and Leadership:
- Provide technical leadership and mentorship to the engineering team.
- Bachelor's or higher degree in Computer Science, Software Engineering, or a related field.
- Certifications in relevant areas such as AWS Certified DevOps Engineer, Certified Kubernetes Administrator (CKA), or equivalent.
-
SRE Engineer
4 weeks ago
Diverse Lynx Wilmington, United StatesJob Title: SRE Engineer · Location : Wilmington, DE - Only Local · Job Description Formal training or certification on site reliability engineering concepts and 3+ years applied experience · Proficient in site reliability culture and principles and familiarity with how to implem ...
-
SRE Engineer
2 weeks ago
Diverse Lynx Wilmington, United StatesJob Title: SRE Engineer · Location : Wilmington, DE (onsite, o nly l ocal s) · Job Description Formal training or certification on site reliability engineering concepts and 3+ years applied experience · Proficient in site reliability culture and principles and familiarity with h ...
-
SRE Engineer with AWS
6 days ago
Diverse Lynx Wilmington, United StatesJob Title: SRE Engineer with AWS · Location: Wilmington, DE(Day1 Onsite) · Duration: Contract/Full-Time · DESCRIPTION: 9+ years DevOps/SRE experience required in a tech forward environment, having demonstrated progressively innovative contributions in system architecture. · E ...
-
SRE Engineer with AWS
2 weeks ago
Abode Techzone LLC Wilmington, United StatesGreetings from Abode Techzone, LLC · We are looking for a best match for one of our client's urgent requirements mentioned below. · Let me know if you would be interested to move ahead, if yes please share your · UPDATED RESUME · with your · Hourly Rates · expectations along ...
-
SRE Engineer
3 weeks ago
Diverse Lynx Wilmington, United StatesJob Title: SRE Engineer · Location : Wilmington, DE (onsite, locals only) · Job Description: · Formal training or certification on site reliability engineering concepts and 3+ years applied experience · Proficient in site reliability culture and principles and familiarity with ho ...
-
SRE Engineer with AWS
3 weeks ago
Abode Techzone LLC Wilmington, United StatesGreetings from Abode Techzone, LLC · We are looking for a best match for one of our client's urgent requirements mentioned below. · Let me know if you would be interested to move ahead, if yes please share your UPDATED RESUME with your Hourly Rates expectations along with belo ...
-
SRE, DevOps Engineer
3 weeks ago
Diverse Lynx Malvern, United StatesSkill: SRE, DevOps, AWS, Site Reliable Engineer · Technology requirements · Min 8+ yrs hands on experience in Ansible Devops technologies. · Strong technical skills in CI CD pipelines. · Experience in Agile methodologies. · Be a key technical contributor on this team. · Partner ...
-
Sr. AWS SRE Engineer
4 weeks ago
Saxon Global Wilmington, United StatesJob Description: · 5+ years DevOps/SRE experience required in a tech forward environment, having demonstrated progressively innovative contributions in system architecture. · Experience building deployment automation within AWS-hosted services and infrastructure. · Experience w ...
-
Performance Engineer
1 week ago
PENN Interactive Philadelphia, United StatesPenn Interactive (PI) is an interactive gaming company headquartered in Philadelphia. PI is the digital arm of PENN Entertainment (NASDAQ: PENN), the largest regional casino operator in the U.S.). Our mission is to challenge the norms of the gaming industry by building an immersi ...
-
Lead Software Engineer
3 weeks ago
Comcast Corporation Philadelphia, PA, United StatesFreeWheel, a Comcast company, provides comprehensive ad platforms for publishers, advertisers, and media buyers. Powered by premium video content, robust data, and advanced technology, we're making it easier for buyers and sellers to transact across all screens, data types, and s ...
-
Technical Lead
1 week ago
Tanisha Systems Philadelphia, United StatesMandatory Skills: · Elastic Observability Mandatory · Observability & Monitoring: · Site Reliability Engineering: · Incident Management: · Automation & Tooling: · Collaboration & Leadership: · OTEL · Responsibilities: · Observability and Monitoring: · Develop and implement ...
-
DevOps (Linux, AWS, Rel Mgmt) II
3 weeks ago
Apidel Technologies Philadelphia, United StatesJob Description · Job DescriptionDescription: · Seeking top technical talent for their Site Reliability Engineering team. Our client has revenues of $5 billion with 30 percent being generated from the online organization. The online organization has consistently leveraged innovat ...
-
Performance Engineer
2 weeks ago
Penn Interactive Philadelphia, United StatesPenn Interactive (PI) is an interactive gaming company headquartered in Philadelphia. PI is the digital arm of PENN Entertainment (NASDAQ: PENN), the largest regional casino operator in the U.S.). Our mission is to challenge the norms of the gaming industry by building an immersi ...
-
Lead Mainframe SRE
2 weeks ago
Metlife Services And Solutions Llc Philadelphia, United Statesand RequirementsRole Value Proposition:We are looking for a Mainframe SRE who can help design and implement modern applications on our Mainframe.You will be responsible for ensuring the reliability, performance, security, and scalability of our Mainframe zCX, OpenShift, z/OS Conn ...
-
Comcast Philadelphia, United StatesFreeWheel, a Comcast company, provides comprehensive ad platforms for publishers, advertisers, and media buyers. Powered by premium video content, robust data, and advanced technology, were making it easier for buyers and sellers to transact across all screens, data types, and sa ...
-
Senior Site Reliability Engineer
2 weeks ago
Comcast Philadelphia, United StatesMake your mark at Comcast -- a Fortune 30 global media and technology company. From the connectivity and platforms we provide, to the content and experiences we create, we reach hundreds of millions of customers, viewers, and guests worldwide. Become part of our award-winning tec ...
-
Sr. Site Reliability Engineer
1 week ago
Comcast Philadelphia, United StatesFreeWheel, a Comcast company, provides comprehensive ad platforms for publishers, advertisers, and media buyers. Powered by premium video content, robust data, and advanced technology, we're making it easier for buyers and sellers to transact across all screens, data types, and s ...
-
Sr. Site Reliability Engineer
2 weeks ago
Comcast Philadelphia, United StatesFreeWheel, a Comcast company, provides comprehensive ad platforms for publishers, advertisers, and media buyers. Powered by premium video content, robust data, and advanced technology, were making it easier for buyers and sellers to transact across all screens, data types, and sa ...
-
Sr. Site Reliability Engineer
2 weeks ago
Comcast Philadelphia, United StatesFreeWheel, a Comcast company, provides comprehensive ad platforms for publishers, advertisers, and media buyers. Powered by premium video content, robust data, and advanced technology, we're making it easier for buyers and sellers to transact across all screens, data types, and s ...
-
Lead Site Reliability Engineer
1 week ago
Comcast Philadelphia, United StatesFreeWheel, a Comcast company, provides comprehensive ad platforms for publishers, advertisers, and media buyers. Powered by premium video content, robust data, and advanced technology, we're making it easier for buyers and sellers to transact across all screens, data types, and s ...
SRE Engineer - Philadelphia, United States - Diverse Lynx
Description
Job Title: SRE EngineerLocation: Philadelphia, PA (Day 1 Onsite)
Position: Contract
Mandatory Skills : Datadog
Additional Skills: ELK Performance Monitoring
Responsibilities:
Drive the adoption of SRE principles and practices across the organization.
Qualifications:
Strong expertise in designing and implementing distributed systems for high availability and reliability.
Proficiency in APM (Application performance monitoring), RUM (Real user monitoring), Synthetics, correlation, alert & incident management will be required. (e.g., OTEL, Jaeger, Kloudfuse, service-now)
Proficiency in one or more programming languages (e.g., Java, Python, Go).
Experience with cloud platforms (e.g., AWS, Azure, GCP) and container orchestration (e.g., Kubernetes).
In-depth knowledge of observability tools and frameworks (e.g., Prometheus, Grafana, ELK stack, Datadog, Aternity) and incident management processes.
In-depth knowledge of Client & AI frameworks (e.g., Anomaly, Outlier, AIOps, LLM )
Excellent communication and collaboration skills.
Demonstrated ability to lead technical initiatives and mentor team members.
Preferred Qualifications:
Familiarity with Infrastructure as Code (IaC) tools such as Terraform, Packer & C Crossplane
Diverse Lynx LLC is an Equal Employment Opportunity employer. All qualified applicants will receive due consideration for employment without any discrimination. All applicants will be evaluated solely on the basis of their ability, competence and their proven capability to perform the functions outlined in the corresponding role. We promote and support a diverse workforce across all levels in the company.