Senior Site Reliability Engineer - Los Angeles, United States - SHEIN Technology LLC

SHEIN Technology LLC Los Angeles, United States

2 weeks ago

Description

Job Title:
Senior Site Reliability Engineer I

Reports to:
Senior Manager of Site Reliability Engineering

Job Location:
San Diego, CA, USA

Job Status:
Exempt, FT

About SHEIN
SHEIN is a global fashion and lifestyle e-retailer committed to making the beauty of fashion accessible to all.

We use on-demand manufacturing technology to connect suppliers to our agile supply chain, reducing inventory waste and enabling us to deliver a variety of affordable products to customers around the world.

From our global offices, we reach customers in more than 150 countries.

Founded in 2012, SHEIN has nearly 10,000 employees operating from offices around the world, with U.S. Headquarters located in Los Angeles and Global Headquarters located in Singapore. In SHEIN, we work with outstanding, creative, and capable peers. We share an energetic and open culture for capable people to discern, work and ignite as a team.

Position Summary
We are looking for a Senior Site Reliability Engineer - Big Data (

Official Title:
Senior Site Reliability Engineer I) for our San Diego, CA-based office hub.

Site Reliability Engineers work with the Technical Operations team at SHEIN and are hybrid software/systems engineers, whose overarching goal is to ensure that Production Services are "Always On." They strive to build the most reliable and performant systems on the planet.

SREs work closely cross-functional teams to ensure we have the right set of tools to generate, collect, analyze, visualize and alert on operational data, so we know exactly what happens across the ecosystem and can see problems before they occur and address them as quickly as possible.

They are also responsible for improving Operational Efficiency, Utilization and System Resiliency of the Platform.

They own Critical Open-Source Software that our platform relies on and are core participants in every significant engineering effort underway in the platform.

They are also tasked with driving forward the operability of the platform to drive down the number of incidents while reducing MTTR.

To accomplish this, the team combines software development, networking and systems engineering expertise, and a strong desire to be challenged by problems of scale and complexity to make our service better for our customers.

Job Responsibilities
Participate in an on-call rotation to ensure 24/7/365 availability of SHEIN's production system
Supervise capacity & utilization and work closely with cross-functional teams to orchestrate scale-up/down of the services
Own & operate critical open-source services like Elasticsearch, Kafka, RabbitMQ, Redis
Build tools and design processes that help improve observability and system resiliency of the platform
Triage Site Availability Incidents and proactively work towards reducing MTTR for customer impacting incidents
Partner with Service owners to implement Service Level Metrics & Service Level Objectives that act as service level health indicators
Establish design patterns for monitoring, benchmarking and deploying new features for the backend services
Develop and maintain technical documentation, network diagrams, runbooks, and procedures
Driving initiatives to evolve our current platform to increase efficiency and keep it in line with current standards and best practices
Responding to production incidents and using your experience in software development, systems engineering, and networking to proactively prevent repeatable issues
Provide relief and sustainable resolution to issues within our infrastructure
Drive initiatives with partner teams to improve the reliability and performance of the infrastructure through improved system design.
Join a culture of intolerance to manual activity which results in a highly automated environment delivering scalable solutions.
Drive efficiencies through software improvement and root cause analysis resulting in service delivery, maturity, and scalability.

Job Requirements
Bachelor's degree in Computer Science, Information Systems, or equivalent technical discipline is preferred
Experience with Big Data related component operation and maintenance, including Hadoop, Yarn, HBase, Hive, Spark, etc., is highly preferred
Experience with OSS technologies, like Elasticsearch, Kafka, and Redis, is highly preferred
Solid understanding of Linux system is preferred
Minimum 3 years working experience in an enterprise 24/7 production environment supporting mission-critical, real-time, high-traffic applications, especially in cloud environments is preferred
Systematic problem-solving approach, combined with a sense of ownership and drive
Full-stack debugging and performance optimization ability, including knowledge of Cloud systems (load balancing, caching, content distribution, etc.), continuous integration/build systems, Java, SQL and NoSQL databases
Track record monitoring and analyzing system performance, isolating issues or bottlenecks that could impact reliability, performance and scalability
Strong experience with observability tools such as Grafana, Prometheus, Zabbix etc
Good experience in any of the scripting/programming languages: Python, GoLang etc
Familiar with container technology, such as: Docker, Kubernetes, Mesos, etc.
Understanding and experience with SRE concepts and practices, including being an advocate for the elimination of toil and drive simple solutions
Good verbal and written communication skills, and be able to work effectively with geographically remote teams

Pay
$107,600.00 min - $180,200.00 max annually, Bonus & RSU offered.

Benefits and Culture
Healthcare (medical, dental, vision, prescription drugs)
Health Savings Account with Employer Funding
Flexible Spending Accounts (Healthcare and Dependent care)
Company-Paid Basic Life/AD&D insurance
Company-Paid Short-Term and Long-Term Disability
Voluntary Benefit Offerings (Voluntary Life/AD&D, Hospital Indemnity, Critical Illness, and Accident)
Employee Assistance Program
Business Travel Accident Insurance
401(k) savings plan with discretionary company match and access to a financial advisor
Vacation, Paid holidays and sick days
Employee Discounts

Perks (HQ Location)
Free weekly catered lunch at HQ
Dog-Friendly office
Free Gym Access at HQ
Free Swag Giveaways
Annual Holiday Party
Invitations to pop-ups and other company events
Complimentary daily office snacks and beverages
Free Shuttle Service from HQ to LA Union Station

SHEIN Distribution is an equal opportunity employer committed to a diverse workplace environment.

#J-18808-Ljbffr

Reliability Engineer

2 weeks ago

Kindeva Drug Delivery Los Angeles, United States

Monday, May 6, 2024 · The Reliability Engineer will lead the sites Asset Reliability agenda, effectively promoting analytical problem-solving techniques and structured reliability improvement processes. · We have an immediate opening for a Reliability Engineers at Kindeva's Nort ...
Reliability Engineer

2 weeks ago

Kindeva Drug Delivery Company Los Angeles, United States

The Reliability Engineer will lead the sites Asset Reliability agenda, effectively promoting analytical problem-solving techniques and structured reliability improvement processes. · We have an immediate opening for a Reliability Engineers at Kindeva's Northridge, CA manufacturi ...
Reliability Engineer

2 weeks ago

GRN Hudson (Global Recruiters Network) Los Angeles, United States

Reliability Engineer · Supporting manufacturing operations to achieve superior equipment efficiency with minimal down time. The ideal candidate will play a crucial role in ensuring the reliability and performance of our industrial equipment through continuous improvement initiati ...
Reliability Engineer

1 week ago

The Mosaic Company Los Angeles, United States

Are You Our Next Reliability Engineer-Multiple Level Applicants Welcome? · Join our dynamic team at the forefront of the global digital acceleration as a Reliability Engineer I, II, III or Sr to work at our Mosaic Uncle Sam Plant. The successful candidate will find opportunities ...
Reliability Engineer

5 days ago

Hargrove Engineers and Constructors Los Angeles, United States

Who We Are · Hargrove supplies unparalleled services in engineering, procurement, construction management, and technical services in the industrial, commercial, and government sectors. With over 2,000 Teammates across the US, we build long-term support relationships in the energ ...
Site Reliability Engineer

2 weeks ago

BayOne Solutions Los Angeles, United States

Position: Site Reliability Engineer · Location: Los Angeles, CA · Duration: 6+ Months · Pay Range: $85/hr - 90/hr on W2 · Site Reliability Engineer · It is an exciting time to be part of SIE's CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the inte ...
Site Reliability Engineer

1 week ago

City National Bank Los Angeles, United States

Overview: · SITE RELIABILITY ENGINEER WHAT IS THE OPPORTUNITY? As an SRE, you will utilize your software, systems engineering, and operations background to build and run large-scale, fault-tolerant systems. Your role is to ensure the reliability, scalability and maximum uptime o ...
Site Reliability Engineer

2 weeks ago

BayOne Solutions Los Angeles, United States

Position: Site Reliability Engineer · Location: Los Angeles, CA · Duration: 6+ Months · Pay Range: $85/hr - 90/hr on W2 · If your skills, experience, and qualifications match those in this job overview, do not delay your application. · Site Reliability Engineer · It is an e ...
Maintenance Reliability Engineer

1 week ago

Kelly Science, Engineering, Technology & Telecom Los Angeles, United States

Kelly Engineering is seeking a · Maintenance Reliability Engineer in Westlake, LA · to join one of our leading clients focused on the design and construction of an · ethylene · production facility to operate 2 billion pounds per year. · If you're looking to launch your caree ...
Site Reliability Engineer

3 weeks ago

Metric Corporation Los Angeles, United States

Role Description · This is a full-time remote role for a Site Reliability Engineer at Metric LLC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, availability, and performance of our systems and services. Your day-to-day tasks will include mo ...
Site Reliability Engineer

1 week ago

SHEIN Technology LLC Los Angeles, United States

Job Title : Site Reliability Engineer · Reports to: · SRE Manager · Job Location : Los Angeles, CA · Job Status: · Exempt, FT · About SHEIN · SHEIN is a global fashion and lifestyle e-retailer committed to making the beauty of fashion accessible to all. We use on-demand manufa ...
Site Reliability Engineer

2 weeks ago

MetroSys Inc Los Angeles, United States

Responsibilities: · Operational Oversight: · Oversee the operation of software and services within the Traffic Front End and Coordination teams. · Ensure seamless functionality, uptime, and performance. · Networking and Traffic Expertise: · Leverage your deep understanding of n ...
Site Reliability Engineer

2 weeks ago

SHEIN Technology LLC Los Angeles, United States

Job Title: Site Reliability Engineer · Reports to: SRE Manager · Job Location: Los Angeles, CA · Job Status: Exempt, FT · About SHEIN · SHEIN is a global fashion and lifestyle e-retailer committed to making the beauty of fashion accessible to all. We use on-demand manufacturing t ...
Site Reliability Engineer

3 weeks ago

iTCO Solutions Los Angeles, United States

CI/CD Site Reliability Team · Onsite hybrid in any California office. · 6 months (possible hire) · Site Reliability Engineer · It is an exciting time to be part of SIEs CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection of Software Eng ...
Senior Reliability Engineer

2 weeks ago

ATR International Los Angeles, United States

We are seeking a Reliability Development Engineer for a very important client. · Job Overview - Principal Duties and Responsibilities · Successful candidate will be tasked for Product, Package reliability test tracking; reliability database, data analysis and summarization on a ...
Site Reliability Engineer

2 weeks ago

BayOne Solutions Los Angeles, United States

Position: Site Reliability Engineer · Location: Los Angeles, CA · Duration: 6+ Months · Pay Range: $85/hr - 90/hr on W2 · Site Reliability Engineer · It is an exciting time to be part of SIE's CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the inte ...
Site Reliability Engineer

2 weeks ago

Epsilon3 Los Angeles, United States

[Full Time] Site Reliability Engineer (Remote) at Epsilon3 (United States) | BEAMSTART Jobs · Site Reliability Engineer (Remote) · Epsilon3 United States · Date Posted · 27 Jun, 2023 · Work Location · Los Angeles, CA, United States · Salary Offered · $120000 — $180000 yearly · ...
Site Reliability Engineer

1 week ago

BayOne Solutions Los Angeles, United States

Position: Site Reliability EngineerLocation: Los Angeles, CADuration: 6+ MonthsPay Range: $85/hr - 90/hr on W2 · Site Reliability Engineer · It is an exciting time to be part of SIE's CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection ...
Site Reliability Engineer

2 weeks ago

BayOne Solutions Los Angeles, United States

Position: Site Reliability Engineer · Location: Los Angeles, CA · Duration: 6+ Months · Pay Range: $85/hr - 90/hr on W2 · Site Reliability Engineer · It is an exciting time to be part of SIEs CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the i ...
Site Reliability Engineer

2 weeks ago

Adastra replica Los Angeles, United States

Job Description · Job Description · Our client is looking for an experienced Site Reliability Engineer to design, operate, maintain, and scale mission-critical infrastructure and products. Products include (but are not limited to) automated Hardware-In-The-Loop (HITL) data anal ...

Senior Site Reliability Engineer - Los Angeles, United States - SHEIN Technology LLC

Description

Reliability Engineer

Reliability Engineer

Reliability Engineer

Reliability Engineer

Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer

Maintenance Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer

Senior Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer

Arther Shelby

Oliver Keane

Brad Glassman

Kateryna Pavlova

Shaun Robles

Mohamed Khedr

for Recruiters

Information

Senior Site Reliability Engineer - Los Angeles, United States - SHEIN Technology LLC

Description

Senior Site Reliability Engineer professionals in Los Angeles