LLM Research Engineer - Palo Alto, United States - CHAI: AI Platform
Description
AI Research Engineer (LLM Optimization)
$250-350K | PALO ALTO, CA
Chai is one of the fastest-growing, generative AI startups in Silicon Valley. YouTube but for LLM's - we have over 1 million active users.
Whoweare looking for:
We need a relentless engineer with 3+ years of experience overseeing and being responsible for optimizing our LLMs. Ensuring they are performant, scaleable, and cost-efficient. You will work alongside equally talented and driven teammates implementing cutting-edge AI inference engines. We need someone who is reliable and has high standards.
Here's why we might not be the right fit for you:
We work hard and have a high-velocity environment with lots of growth opportunities.
We value exceptional performance and continuous improvement. We believe that if you aren't constantly learning, you aren't growing.
You will be responsible and accountable for making high-impact decisions that determine Chai's future
Here are the top 2 reasons why you should join us:
Exponential growth. 1 Million MAU. Join the team that gets us to 100 million MAU
Craftsmanship. Build something beautiful
Requirements:
Familiar with vLLM, quantization, and current techniques of LLM optimization
3+ years of experience in software engineering
Bachelor or Master degree from a leading academic institution
Here is our tech stack:
Front end: Python, Flutter, Dart
Back end: Python, GCP, Redis, Kubernetes
Process:
Exceptionally fast, application to offer within 7 days
1. Apply here
2. First round video interview, system design interview, then onsite
3. Reference checks, negotiation, and offer