- Improve the platform's auth, billing, and payment systems
- Add new features to the interactive Playground where customers can try our models
- Implement new platform features for managing deployments
- Write and ship minimal code that runs in low-resource environments, and has highly stringent deployment mechanisms
- As security and privacy are paramount, you will sometimes need to reinvent the wheel, and won't be able to use the most popular libraries or tooling
- You have 5+ years of experience writing clean backend code. Our stack includes: Golang and React.
- You've built payment systems and have experience with subscription or usage-based SaaS, and/or products with a freemium model.
- You have strong coding abilities and are comfortable working across the stack. You're able to read and understand, and even fix issues outside of the main code base.
- You've worked in both large enterprises and startups.
- You excel in fast-paced environments and can execute while priorities and objectives are a moving target.
- An open and inclusive culture and work environment
- Work closely with a team on the cutting edge of AI research
- Weekly lunch stipend, in-office lunches & snacks
- Full health and dental benefits, including a separate budget to take care of your mental health
- 100% Parental Leave top-up for up to 6 months
- Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
- Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
- 6 weeks of vacation (30 working days)
-
We're growing quickly and recently raised our $150M Series D, backed by investors including BOND, IVP, Spark Capital, Greylock, · and Conviction. · ...
New York1 month ago
-
We enable companies operating at the frontier of AI to bring cutting-edge models into production. · ...
New York, NY5 days ago
-
Anthropic's Inference organization is responsible for serving Claude to millions of users and enterprise customers with the speed, reliability, and efficiency that frontier AI demands. · We offer competitive compensation and benefits, · optional equity donation matching, · genero ...
New York, NY1 week ago
-
Join us on our mission and shape the future as an Audio Inference Engineer for Model Efficiency. · <li>Work on advancing core audio model serving metrics, including latency, throughput, and quality by diving deep into our systems,<br/></li><li>Collaborate closely with both the tr ...
New York, NY2 weeks ago
-
We are seeking an experienced Senior ML Inference Engineer to join our team focusing on optimizing and deploying our production virtual staining models at scale.We will work on critical challenges such as reducing inference latency for whole slide imaging (WSI) from tens of minut ...
New York, NY1 week ago
-
We are training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · ...
New York6 days ago
-
The vLLM and LLM-D Engineering team at Red Hat is looking for a customer obsessed developer to join our team as a Forward Deployed Engineer. · Orchestrate Distributed Inference: Deploy and configure LLM-D and vLLM on Kubernetes clusters. · Optimize for Production: Go beyond stand ...
New York2 days ago
-
We want AI to be safe and beneficial for our users and for society as a whole.About The RoleOur Inference team is responsible for building and maintaining the critical systems that serve Claude to millions of users worldwide.You May Be a Good Fit If You · Pick up slack, even if i ...
New York $300,000 - $485,000 (USD)1 week ago
-
We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · Our team is responsible for developing, deploying, and operating the AI platform deliv ...
New York1 month ago
-
Our mission is to scale intelligence to serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · 5+ years of engineering experi ...
New York, NY1 month ago
-
+Job summaryAbout Anthropic · Anthropics mission is to create reliable interpretable and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. · +Designing intelligent routing algorithms that optimize request distribution across thou ...
New York, NY1 week ago
-
We want AI to be safe and beneficial for our users and for society as a whole. · About The RoleThe team has a dual mandate: maximizing compute efficiency to serve our explosive customer growth, · while enabling breakthrough research by giving our scientists the high-performance i ...
New York $300,000 - $485,000 (USD)1 month ago
-
We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. · ...
New York, NY1 month ago
-
We are training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · Improve the platform's auth, billing, and payment systems · Add new features to the i ...
New York, NY1 month ago
-
The vLLM and LLM-D Engineering team at Red Hat is looking for a customer obsessed developer to join our team as a Forward Deployed Engineer. · In this role, you will not just build software; you will be the bridge between our cutting-edge inference platform (LLM-D, and vLLM) and ...
New York $193,390 - $318,980 (USD)3 weeks ago
-
About us FriendliAI is building an AI inference platform · The RoleWe seek Inference Engine Engineer to optimize our core inference engine. · ...
New York3 weeks ago
-
++About the Job FriendliAI is seeking a Forward Deployed Engineer (FDE) to assist enterprises in deploying scaling and operating generative and agentic AI workloads on FriendliAI infrastructure. · ...
New York3 weeks ago
-
We power mission-critical inference for the world's most dynamic AI companies. · Baseten recently raised $150M Series D backed by investors including BOND and IVP. · Join us to build the platform engineers turn to ship AI products. As Infrastructure Product Manager at Baseten you ...
New York1 month ago
-
We're growing quickly and recently raised our $150M Series D, · Design and architect scalable infrastructure systems for our ML inference platform · Lead optimization of Kubernetes deployments for efficient, cost-effective model serving · ...
New York1 month ago
-
Blackdog is hiring a Senior AI Site Reliability Engineer to lead the reliability, scalability, and performance of our production AI/ML platform. This role is deeply technical and hands‑on, owning end‑to‑end stability for mission‑critical model serving, data pipelines, and GPU‑int ...
New York2 weeks ago
-
We are looking for a well-rounded AI/ML Systems Engineer to join our team and build LM Studio with us. · Build and maintain world-class on-device inference engines for LLMs and other models. · ...
New York Full time1 month ago
Full-Stack Software Engineer, Inference - New York - Cohere
Description
Who are we?
Our mission is to scale intelligence to serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI.
We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. We like to work hard and move fast to do what's best for our customers.
Cohere is a team of researchers, engineers, designers, and more, who are passionate about their craft. Each person is one of the best in the world at what they do. We believe that a diverse range of perspectives is a requirement for building great products.
Join us on our mission and shape the future
Why this role?
Cohere customers self-serve our API without any intervention. This team unlocks the complex technology we build for customers to understand, trust, and pay for.
As a Senior Software Engineer, You Will
You May Be a Good Fit If
If some of the above doesn't line up perfectly with your experience, we still encourage you to apply
We value and celebrate diversity and strive to create an inclusive work environment for all. We welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs.
Full-Time Employees At Cohere Enjoy These Perks
#J-18808-Ljbffr
-
Applied AI Inference Engineer
Only for registered members New York
-
Applied AI Inference Engineer
Only for registered members New York, NY
-
Engineering Manager, Inference Developer Productivity
Only for registered members New York, NY
-
Audio Inference Engineer, Model Efficiency
Only for registered members New York, NY
-
Sr. ML Inference Engineer
Only for registered members New York, NY
-
Audio Inference Engineer, Model Efficiency
Only for registered members New York
-
Forward Deployed Engineer, AI Inference
Only for registered members New York
-
Staff + Senior Software Engineer, Inference
Only for registered members New York
-
Site Reliability Engineer, Inference Infrastructure
Only for registered members New York
-
Staff Software Engineer, Inference Infrastructure
Only for registered members New York, NY
-
Staff + Senior Software Engineer, Inference
Only for registered members New York, NY
-
Senior/Staff Software Engineer, Inference
Only for registered members New York
-
Site Reliability Engineer, Inference Infrastructure
Only for registered members New York, NY
-
Full-Stack Software Engineer, Inference
Only for registered members New York, NY
-
Forward Deployed Engineer, AI Inference
Only for registered members New York
-
Software Engineer
Only for registered members New York
-
Forward Deployed Engineer
Only for registered members New York
-
Product Manager
Only for registered members New York
-
Senior Software Engineer
Only for registered members New York
-
Senior AI Site Reliability Engineer
Only for registered members New York
-
Software Engineer, AI/ML Systems
Full time Only for registered members New York