Site Reliability Engineer, Cloud Native Platform - San Jose, CA, United States - TikTok

    Default job background
    Description
    TikTok is the leading destination for short-form mobile video. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo.
    At TikTok, our people are humble, intelligent, compassionate and creative. We create to inspire - for you, for us, and for more than 1 billion users on our platform.

    We lead with curiosity and aim for the highest, never shying away from taking calculated risks and embracing ambiguity as it comes.

    Join us and make impact happen with a career at TikTok.

    Our infrastructure team is seeking experienced site reliability engineers to build globally distributed edge platform for provisioning and deploying edge services.

    Our team operates a large network of edge POPs around the world to accelerate site traffic and cache CDN content.

    We use Kubernetes to manage on-prem/cloud nodes and build an eco-system around it, including tools for monitoring, alerting, logging, CI/CD, etc.

    and various services with automated deployment and scaling in order to maximize daily operation efficiencies.

    On top of the Kubernetes infra, we build the edge computing platform (PaaS) to help deploy and manage global edge services.

    Deploy and administrate Kubernetes clusters both on-prem and in cloud (AWS, GCP, etc.).


    • Collaborate with software engineers to build enterprise-level edge computing platform (PaaS) with cutting-edge Cloud Native Computing Foundation (CNCF) technologies.
    • Design, develop, automate, and continuously improve platform services and pipelines, such as monitoring, alerting, logging, tracing, CI/CD, etc.
    • Improve Kubernetes system efficiency and debug issues related to networking, storage, scheduling, etc.
    • Collaborate with open-source communities to advance Kubernetes and edge computing technologies.
    Master's degree (or Bachelor's degree with 3+ years of experience) in Computer Engineering, Computer Science, or related fields.


    • 3+ years of experience in Unix/Linux systems from kernel to shell and beyond.
    • Experience with Kubernetes CNI deployment and troubleshooting, including (but not limited to) the following CNIs: Cilium, Kube-Router, Calico, Flannel.
    • Experience in designing, analyzing, and building automation tools for large scale and complex systems.
    Experience in using and contributing to open-source projects in Kubernetes ecosystem, e.g. Experience in networking technologies such TCP/IP, BGP, DNS, load balancers, etc.


    • Experience in CI/CD pipeline design and development.
    • TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. TikTok is committed to providing reasonable accommodations during our recruitment process. If you need assistance or accommodation, please reach out to us at .Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units.

    We cover 100% premium coverage for employee medical insurance, approximately 75% premium coverage for dependents and offer a Health Savings Account(HSA) with a company match.

    As well as Dental, Vision, Short/Long term Disability, Basic Life, Voluntary Life and AD&D insurance plans. In addition to Flexible Spending Account(FSA) Options like Health Care, Limited Purpose and Dependent Care.

    Our time off and leave plans are: 10 paid holidays per year plus 17 days of Paid Personal Time Off (PPTO) (prorated upon hire and increased by tenure) and 10 paid sick days per year as well as 12 weeks of paid Parental leave and 8 weeks of paid Supplemental Disability.

    We also provide generous benefits like mental and emotional health benefits through our EAP and Lyra. A 401K company match, gym and cellphone service reimbursements.