Research Engineer - San Francisco, California, US
1 day ago

Job description
Cloudglue - Video Understanding Infrastructure
Cloudglue is a Y Combinator-backed startup building developer APIs that turn video and audio into structured, searchable data. We handle the hard infrastructure - transcription, visual analysis, search, extraction - so developers can build on top of video without managing ML pipelines themselves.
We process millions of minutes of video for customers building search, analytics, and automation products. The research problems are real: how do you retrieve the right 10 seconds from 10,000 hours of video? How do you extract structured facts from noisy, multimodal content? How do you reason across visual and spoken information at scale?
Our team has shipped large-scale systems at Snapchat and Amazon, with work presented at NeurIPS, ICCV, CVPR, KubeCon, and DEF CON. We're a small, technical team where researchers ship code and engineers read papers.
The Role
We're looking for a research engineer to work on the core multimodal retrieval and video reasoning systems that power Cloudglue. This is a 50/50 research and engineering role - you'll design novel approaches to hard retrieval and understanding problems, and you'll ship them into production where real customers depend on them.
You'll work across:
- Multimodal retrieval - finding relevant moments across visual, audio, and text signals in large video collections
- Structured extraction - pulling entities, facts, and relationships from video content
- Video reasoning - understanding temporal, causal, and semantic relationships across long-form content
- Evaluation and benchmarking - designing metrics and datasets to measure real-world system quality
This is not a pure research role. You'll be expected to take ideas from paper to prototype to production. But it's also not a pure engineering role - we need someone with genuine research depth who can identify the right problems to work on and design novel solutions.
What You'll Do
- Multimodal retrieval: Design and improve retrieval systems that search across video, audio, and text - including embedding models, re-ranking, and hierarchical search strategies.
- Video understanding: Build systems that extract structured information from video - temporal segmentation, entity extraction, scene understanding, and content summarization.
- Model fine-tuning & integration: Fine-tune and adapt vision and language models (LoRA/PEFT, full fine-tuning) for production use cases. Evaluate open-source and proprietary models and orchestrate them in serving pipelines.
- Experiment and ship: Run experiments, analyze results rigorously, and turn successful research into production systems that handle real-world video at scale.
- Collaborate: Work directly with founders and infrastructure engineers. Short feedback loops, no layers of process.
What We're Looking For
Required
- MS or PhD in computer science, machine learning, or a related field
- Research experience in one or more of: multimodal learning, information retrieval, computer vision, NLP, or video understanding
- Strong implementation skills in Python and PyTorch (or equivalent)
- Ability to independently drive research from idea to experiment to working system
Nice to Have
- First-author publication at a top venue (NeurIPS, CVPR, ICCV, ECCV, ACL, EMNLP, SIGIR, ISMIR, ICASSP, or similar)
- Experience with video or multimodal foundation models (CLIP, LLaVA, Qwen3-VL, etc.)
- Experience with retrieval systems, embedding models, or ranking/re-ranking pipelines
- Experience deploying ML systems in production
- Familiarity with vector databases (Milvus, Weaviate) or search infrastructure
- Experience with model fine-tuning techniques (LoRA, PEFT, QLoRA) and training infrastructure (Ray, Kubeflow, or similar)
- Experience with ML inference serving (vLLM, TensorRT, Triton, or similar)
Why Cloudglue?
Video is the largest and most underutilized data source on the internet. Most software still can't meaningfully search or reason over it. The research problems here - multimodal retrieval, temporal reasoning, structured extraction from noisy real-world content - are genuinely unsolved and directly tied to the product.
If you want to work on:
- Research problems with immediate, measurable product impact
- A domain where the state of the art is still being defined
- A small team where your research directly shapes the product
- Multimodal systems at real scale, not toy benchmarks
…this is that role.
Similar jobs
We're seeking an exceptional Research Engineer / Research Scientist to join our Life Science team at Anthropic. Our team is organized around the north star goal of accelerating progress in the life sciences... · We believe that the highest-impact AI research will be big science. ...
1 month ago
We are seeking a Research Engineer / Scientist to join our Post-Training team responsible for training and improving pre-trained models. · ...
1 week ago
A deep-tech startup on a mission to build a foundational AI model from scratch. Our team combines world-class research, cutting-edge engineering, and a bold vision: to create AI systems that are causal, interpretable, and scalable. · ...
3 weeks ago
We are building the infrastructure for decentralized AI development at scale. · We aggregate global compute and enable researchers to collaboratively train state-of-the-art models through distributed training across clusters. · As a Founding Research Engineer, you'll play a cruci ...
4 weeks ago
· The Center for AI Safety (CAIS) is a leading research and advocacy organization focused on mitigating societal-scale risks from AI. We address AI's toughest challenges through technical research, field-building initiatives, and policy engagement, along with our sister organiza ...
2 days ago
We are recruiting for an SF based deep-tech startup building foundation ML models for multi-physics simulation. · ...
1 month ago
We are developing foundation models for physics simulations—generalizable, high-fidelity solvers that accelerate inverse design and enable scalable design space exploration. · Proven contributions to computational physics and scientific machine learning (through publications or i ...
1 month ago
+This is a job that Jill, our AI Recruiter, is recruiting for on behalf of one of our customers. · Join a mission-driven AI company as a Research Engineer to bridge theoretical AI safety research with practical implementation. · +Build and maintain robust research infrastructure, ...
1 month ago
Design and implement scalable technical infrastructure that enables researchers to efficiently run experiments and evaluate AI systems. · ...
1 week ago
+Job summary · You=re needed to push the boundaries of what our models can understand.We=re building evaluation frameworks and research systems that measure, improve, and validate enterprise intelligence at a scale nobody has attempted. · Designing evaluation pipelines that measu ...
2 weeks ago
We are looking for visionary Research Engineers to join our Applied Voice Team where you'll conduct groundbreaking research on speech models and transform it into real-world applications. · You will design and build state-of-the-art speech models and help turn research breakthrou ...
5 days ago
We seek an AI Research Engineer to help design train evaluate & optimize Chai's core models & infrastructure about · Develop all infrastructure tooling upon which our AI research depends. · Collaborate with AI researchers to understand implement & optimize new research direction ...
1 week ago
We're an AI research lab building internet-scale video understanding systems used by frontier AI labs Fortune 100s and fast-growing gen-AI startups. · In the last 3 months we've gone from $0 → $XXM in revenue growing 25% month-over-month with a team of just 13 people. · ...
3 weeks ago
This is a full-time, on-site role for a Research Engineer based in the San Francisco Bay Area. · We are building a next-generation cinematic video generation model and an advanced agentic workflow platform. ...
1 month ago
+As a Research Engineer at Magic you will work on training large AI models new inference-time compute techniques build internet-scale datasets and help prototype new research product ideas. · Strong general software engineering skills · Thorough knowledge of the deep learning lit ...
2 weeks ago
We are creating a new category of work where expertise powers AI advancement. Achieving this requires an ambitious, fast-paced and deeply committed team. · Mercor is at the intersection of labor markets and AI research. We partner with leading AI labs and enterprises to provide t ...
1 month ago
We're building Agentic AI that empowers software engineers by automating production engineering and SRE workflows. · Develop AI-powered workflows end-to-end, balancing research and engineering to create production-ready AI models, · Build and optimize data pipelines to process hi ...
3 weeks ago
We're an independent research team working at the cutting edge of AI safety and post‑training technique development.We're looking for experienced research engineers to help us push that frontier. · ...
3 weeks ago
We build large-scale LLM systems for roleplay, interactive storytelling, and emotionally intelligent content generation for millions of users.The goal of our post-training team is to turn foundation models into high-quality character agents with strong personality consistency, lo ...
1 week ago
This role sits at the intersection of AI research and engineering. · You will anticipate upcoming data needs at leading AI labs, · translate research signals into executable data projects, · and drive early pilots with teams working at the frontier.Anticipate future human data re ...
1 week ago