-
Senior Engineer, SRE/DevOps
2 weeks ago
MachineFi Lab Remote, United States Full timeOur Vision: Machines Will Be Our Future Workforce · At MachineFi Lab, we're not just envisioning the future; we're actively building it—today. We power the new reward economy by fostering a fairer, safer, and more rewarding Internet of Things (IoT). Central to our mission is the ...
-
Kubernetes SRE
2 days ago
Diverse Lynx Myrtle Point, United StatesTitle Role : Kubernetes SRE · Location : USA Remote · Full Time · Job Description · SRE teams are expected to provide 24 x 5 (weekdays) support covering 4 time zones: UK, HK, IN, US (East Coast) which means that SREs must participate in weekend on-call rotations and be open t ...
-
Contract - SRE / Platform Engineer
3 weeks ago
Aquarium Remote, United States Full timeAquarium is hiring for a contractor Site Reliability / Platform Engineer role, focused on our Tidepool product. Tidepool is a data enrichment application that sits on top of a data warehouse, and allows users to create derived AI tables based on existing text columns. To address ...
-
Cloud Engineer
1 week ago
Five9 Remote, United StatesResponsibilities: Design, implementation, and automation of large-scale distributed systems · Build tools and automation that help Five9 achieve higher availability, scalability, latency, and efficiency · Work with Engineering teams to deliver high quality software in a fast-p ...
-
Multicloud Engineer
5 days ago
Unisys Remote, United States Full timeWhat success looks like in this role: · Key Responsibilities · Develop cloud solutions based on well-architected framework concepts · Collaborate with enterprise architects, cloud solutions architects, cloud engineers, program/project managers, and client stakeholders to manag ...
-
Software Engineer, L6
1 week ago
Netflix Remote, United States Full timeNetflix has been changing how people watch shows and movies, enabling on-demand access to thousands of movies and TV shows. Recently, Netflix has expanded its entertainment offering to include Live content, like Chris Rock Comedy Special, the SAG Awards ceremony or The Netflix Sl ...
-
Site Reliability Engineer
2 weeks ago
Fireblocks Remote, United States Full time· The world of digital assets is accelerating in speed, magnitude, and complexity, opening the door to new ways for leveraging the blockchain. Fireblocks' platform and network provide the simplest and most secure way for companies to work with digital assets and it trusted by so ...
-
Site Reliability Performance Engineer
3 weeks ago
Brooksource Remote, United StatesContract to Hire * · *Remote (EST Time Zone)* · Our Fortune 15 health care client is seeking a Site Reliability Engineer (SRE) to assist them as they fully transition to the cloud. You will play a critical role in ensuring the reliability, scalability, and performance of their sy ...
-
Senior Site Reliability Engineer
3 weeks ago
Lumin Digital Remote, United States Full timeOur Site Reliability Engineers (SRE) are good developers with an operations mindset. They enjoy reducing or completely eliminating manual tasks, are excellent problem solvers, and know automation is the key to operating a large-scale system. · SREs make sure that our application ...
-
Site Reliability Engineer
3 weeks ago
Coinbase Remote, United StatesWe're a group of hard-working overachievers who are deeply focused on building the future of finance and Web3 for our users across the globe, whether they're trading, storing, staking or using crypto. Know those people who always lead the group project? We're a remote-first compa ...
-
Senior Site Reliability Engineer
1 month ago
DFIN Remote, United States Full timeDonnelley Financial Solutions (DFIN) is a leader in risk and compliance solutions, providing insightful technology, industry expertise and data insights to clients across the globe. We're here to help you make smarter decisions with insightful technology, industry expertise and d ...
-
Sr. Site Reliability Engineer
3 days ago
AlphaSense Remote, United States Full timeLocation: Remote, United States · Reports to: Engineering Manager, SRE · About AlphaSense: · AlphaSense is a market intelligence platform used by the world's leading companies and financial institutions. Since 2011, our AI-based technology has helped professionals make smarter ...
-
Site Reliability Engineer
4 weeks ago
Podium Remote, United States Full time· At Podium, our mission is to help local businesses win. Our lead conversion platform, powered by AI and integrations, helps local businesses convert leads faster, communicate easier, and make more sales. Every day, thousands of local businesses utilize our review management, c ...
-
Site Reliability Engineer, Americas
1 month ago
Edge & Node Remote, United States Full timeEdge & Node stands as the revolutionary vanguard of web3, a vision of a world powered by individual autonomy, shared self-sovereignty and limitless collaboration. Established by trailblazers behind The Graph, we're on a mission to make The Graph the internet's unbreakable foundat ...
-
Sr. Site Reliability Engineer
1 month ago
Sunrun Remote, United States Full timeEverything we do at Sunrun is driven by a determination to transform the way we power our lives. We know that starts at the individual employee level. We strive to foster an environment you can thrive in through our commitment to diversity, inclusion and belonging. · Objective: · ...
-
Senior Solutions Engineer
1 month ago
RapDev Remote, United States Full timeAbout RapDev · We specialize in modern ITOM, ITAM, ITSM, DevOps & SecOps ServiceNow implementations and extensions, as well as integrations and services for Datadog. Our experienced team of ServiceNow Developers, Engineers, and Architects, as well as our Cloud, DevOps, SRE, and O ...
-
Site Reliability Engineer
1 month ago
Aurora Labs Remote, United States Full timeAbout Us · Aurora Labs is the development company behind Aurora—the EVM blockchain that runs on the NEAR Protocol. We are also the developers of, and integration partner behind, Aurora Cloud—a suite of products that allow Web2 companies to capture the value of Web3. · We invite ...
-
Big Data DevOps Lead
3 weeks ago
Zeta Global Remote, United States Full timeSummary: · As a Senior or Lead Big Data DevOps Engineer, you will be working with a team responsible for setting up, scaling, and maintaining Big Data infrastructure and tools in private and public cloud environments. · Main Responsibilities: · • Driving improvement of the effici ...
-
Site Reliability Engineer: Postgres
2 weeks ago
Supabase Remote, United States Full timeSupabase is an Open Source and fully remote company building developer tools for databases. · We are seeking an experienced SRE to manage the infrastructure of our Postgres databases. We currently manage over 1M Postgres instances and are growing fast. · You will: · Help build th ...
-
Staff Site Reliability Engineer
3 days ago
SentinelOne Remote, United States Full time· About Us: · SentinelOne is defining the future of cybersecurity through our XDR platform that automatically prevents, detects, and responds to threats in real-time. Singularity XDR ingests data and leverages our patented AI models to deliver autonomous protection. With Sentine ...
sre - Remote, OR, United States - Diverse Lynx
Description
Role: SRE
Location : United States (Remote)
Job Description
SRE teams are expected to provide 24 x 5 (weekdays) support covering 4 time zones: UK, HK, IN, US (East Coast) which means that SREs must participate in weekend on-call rotations and be open to flexible working hours.
Standard working is remote: 8 hours each day, 5 days per week, with 1h for lunch, exact hours to be agreed with the team lead, with flexibility to work occasionally outside of their agreed hours.
The SRE will be assigned to work in one region. Flexibility is required for this role, for example they may be asked to work 08:00-17:00 or 10:00-19:00. Start times may vary across the team and will be discussed and agreed by the team lead with each SRE.
On-call:
SREs are expected to participate in mandatory on-call weekend rotations (1 weekend in 4). Exact dates and times to be agreed with team lead to ensure coverage across all regions,
The SRE on-call is expected to acknowledge incident within 10 mins and respond appropriately. If the SRE on-call is not available for any reason, they must inform the team lead and arrange a swap with another team member,
On-call hours and any extra hours worked should be taken as time off in lieu.
Additional hours:
There are occasions when the team s client requires 24/7 online support e.g., for major regulatory releases, where a separate support rotation and schedule may be required to accommodate the requirement,
SREs are not required to take time off in lieu following online support, it will be treated as paid overtime.
Requirements (Must Have)
Experience as a Senior DevOps Engineer/SRE in an Agile environment.
Experience with Linux/Unix systems and Shell Scripting (BASH).
Experience with Kubernetes, preferably GKE on Prem.
Ability to program (structured and OO) with one or more high level languages, such as Python or Go.
Experience with building and managing automated CI/CD pipelines and related tools (GitLab CI/CD, Jenkins).
Experience with VMware and other virtualization platform technology.
Incident support and resolutions.
Implement and enhance monitoring, alerting, and incident response processes.
Automate manual tasks to boost efficiency and minimise human error.
Excellent analytical skills, capable of fast decision-making using sound judgement, and not afraid to explore new ideas.
Excellent interpersonal skills in dealing with customers with differing technical specializations, able to work efficiently in a high-pressure environment.
Good organizational and English communication skills are required, including prioritization of multiple projects and objectives.
Desirable
Istio knowledge and understanding of Anthos Service Mesh.
Familiarity with monitoring and logging tools (Splunk, Prometheus, Datadog, Kiali).
Kubernetes certification is desirable and considered a plus.
Experience on Load Balancer and reverse Proxies (Nginx Controller / Seesaw).
Experience with containerisation technologies (Docker) and infrastructure-as-code tools (Terraform).
Knowledge of Portworx for Kubernetes Storage.
Knowledge about confluent resources like connectors, control center, replicators.
Knowledge of Kafka Stream Generator, KSQLDB, cluster federation, Spark Streams.
Diverse Lynx LLC is an Equal Employment Opportunity employer. All qualified applicants will receive due consideration for employment without any discrimination. All applicants will be evaluated solely on the basis of their ability, competence and their proven capability to perform the functions outlined in the corresponding role. We promote and support a diverse workforce across all levels in the company.