Full-Stack Software Engineer, Inference - New York - Cohere

Cohere New York

5 days ago

Description

Who are we?

Our mission is to scale intelligence to serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI.

We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. We like to work hard and move fast to do what's best for our customers.

Cohere is a team of researchers, engineers, designers, and more, who are passionate about their craft. Each person is one of the best in the world at what they do. We believe that a diverse range of perspectives is a requirement for building great products.

Join us on our mission and shape the future

Why this role?

Cohere customers self-serve our API without any intervention. This team unlocks the complex technology we build for customers to understand, trust, and pay for.

As a Senior Software Engineer, You Will

Improve the platform's auth, billing, and payment systems
Add new features to the interactive Playground where customers can try our models
Implement new platform features for managing deployments
Write and ship minimal code that runs in low-resource environments, and has highly stringent deployment mechanisms
As security and privacy are paramount, you will sometimes need to reinvent the wheel, and won't be able to use the most popular libraries or tooling

You May Be a Good Fit If

You have 5+ years of experience writing clean backend code. Our stack includes: Golang and React.
You've built payment systems and have experience with subscription or usage-based SaaS, and/or products with a freemium model.
You have strong coding abilities and are comfortable working across the stack. You're able to read and understand, and even fix issues outside of the main code base.
You've worked in both large enterprises and startups.
You excel in fast-paced environments and can execute while priorities and objectives are a moving target.

If some of the above doesn't line up perfectly with your experience, we still encourage you to apply

We value and celebrate diversity and strive to create an inclusive work environment for all. We welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs.

Full-Time Employees At Cohere Enjoy These Perks

An open and inclusive culture and work environment
Work closely with a team on the cutting edge of AI research
Weekly lunch stipend, in-office lunches & snacks
Full health and dental benefits, including a separate budget to take care of your mental health
100% Parental Leave top-up for up to 6 months
Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
6 weeks of vacation (30 working days)

#J-18808-Ljbffr

Work in company
Applied AI Inference Engineer
Only for registered members

We're growing quickly and recently raised our $150M Series D, backed by investors including BOND, IVP, Spark Capital, Greylock, · and Conviction. · ...

New York
1 month ago
Work in company
Applied AI Inference Engineer
Only for registered members

We enable companies operating at the frontier of AI to bring cutting-edge models into production. · ...

New York, NY
5 days ago
Work in company
Engineering Manager, Inference Developer Productivity
Only for registered members

Anthropic's Inference organization is responsible for serving Claude to millions of users and enterprise customers with the speed, reliability, and efficiency that frontier AI demands. · We offer competitive compensation and benefits, · optional equity donation matching, · genero ...

New York, NY
1 week ago
Work in company Remote job
Audio Inference Engineer, Model Efficiency
Only for registered members

Join us on our mission and shape the future as an Audio Inference Engineer for Model Efficiency. · <li>Work on advancing core audio model serving metrics, including latency, throughput, and quality by diving deep into our systems,<br/></li><li>Collaborate closely with both the tr ...

New York, NY
2 weeks ago
Work in company Remote job
Sr. ML Inference Engineer
Only for registered members

We are seeking an experienced Senior ML Inference Engineer to join our team focusing on optimizing and deploying our production virtual staining models at scale.We will work on critical challenges such as reducing inference latency for whole slide imaging (WSI) from tens of minut ...

New York, NY
1 week ago
Work in company
Audio Inference Engineer, Model Efficiency
Only for registered members

We are training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · ...

New York
6 days ago
Work in company Remote job
Forward Deployed Engineer, AI Inference
Only for registered members

The vLLM and LLM-D Engineering team at Red Hat is looking for a customer obsessed developer to join our team as a Forward Deployed Engineer. · Orchestrate Distributed Inference: Deploy and configure LLM-D and vLLM on Kubernetes clusters. · Optimize for Production: Go beyond stand ...

New York
2 days ago
Work in company
Staff + Senior Software Engineer, Inference
Only for registered members

We want AI to be safe and beneficial for our users and for society as a whole.About The RoleOur Inference team is responsible for building and maintaining the critical systems that serve Claude to millions of users worldwide.You May Be a Good Fit If You · Pick up slack, even if i ...

New York $300,000 - $485,000 (USD)
1 week ago
Work in company
Site Reliability Engineer, Inference Infrastructure
Only for registered members

We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · Our team is responsible for developing, deploying, and operating the AI platform deliv ...

New York
1 month ago
Work in company
Staff Software Engineer, Inference Infrastructure
Only for registered members

Our mission is to scale intelligence to serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · 5+ years of engineering experi ...

New York, NY
1 month ago
Work in company
Staff + Senior Software Engineer, Inference
Only for registered members

+Job summaryAbout Anthropic · Anthropics mission is to create reliable interpretable and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. · +Designing intelligent routing algorithms that optimize request distribution across thou ...

New York, NY
1 week ago
Work in company
Senior/Staff Software Engineer, Inference
Only for registered members

We want AI to be safe and beneficial for our users and for society as a whole. · About The RoleThe team has a dual mandate: maximizing compute efficiency to serve our explosive customer growth, · while enabling breakthrough research by giving our scientists the high-performance i ...

New York $300,000 - $485,000 (USD)
1 month ago
Work in company
Site Reliability Engineer, Inference Infrastructure
Only for registered members

We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. · ...

New York, NY
1 month ago
Work in company
Full-Stack Software Engineer, Inference
Only for registered members

We are training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · Improve the platform's auth, billing, and payment systems · Add new features to the i ...

New York, NY
1 month ago
Work in company
Forward Deployed Engineer, AI Inference
Only for registered members

The vLLM and LLM-D Engineering team at Red Hat is looking for a customer obsessed developer to join our team as a Forward Deployed Engineer. · In this role, you will not just build software; you will be the bridge between our cutting-edge inference platform (LLM-D, and vLLM) and ...

New York $193,390 - $318,980 (USD)
3 weeks ago
Work in company
Software Engineer
Only for registered members

About us FriendliAI is building an AI inference platform · The RoleWe seek Inference Engine Engineer to optimize our core inference engine. · ...

New York
3 weeks ago
Work in company
Forward Deployed Engineer
Only for registered members

++About the Job FriendliAI is seeking a Forward Deployed Engineer (FDE) to assist enterprises in deploying scaling and operating generative and agentic AI workloads on FriendliAI infrastructure. · ...

New York
3 weeks ago
Work in company
Product Manager
Only for registered members

We power mission-critical inference for the world's most dynamic AI companies. · Baseten recently raised $150M Series D backed by investors including BOND and IVP. · Join us to build the platform engineers turn to ship AI products. As Infrastructure Product Manager at Baseten you ...

New York
1 month ago
Work in company
Senior Software Engineer
Only for registered members

We're growing quickly and recently raised our $150M Series D, · Design and architect scalable infrastructure systems for our ML inference platform · Lead optimization of Kubernetes deployments for efficient, cost-effective model serving · ...

New York
1 month ago
Work in company
Senior AI Site Reliability Engineer
Only for registered members

Blackdog is hiring a Senior AI Site Reliability Engineer to lead the reliability, scalability, and performance of our production AI/ML platform. This role is deeply technical and hands‑on, owning end‑to‑end stability for mission‑critical model serving, data pipelines, and GPU‑int ...

New York
2 weeks ago
Work in company
Software Engineer, AI/ML Systems
Only for registered members

We are looking for a well-rounded AI/ML Systems Engineer to join our team and build LM Studio with us. · Build and maintain world-class on-device inference engines for LLMs and other models. · ...

New York Full time
1 month ago

Applied AI Inference Engineer
Only for registered members New York
Applied AI Inference Engineer
Only for registered members New York, NY
Engineering Manager, Inference Developer Productivity
Only for registered members New York, NY
Audio Inference Engineer, Model Efficiency
Only for registered members New York, NY
Sr. ML Inference Engineer
Only for registered members New York, NY
Audio Inference Engineer, Model Efficiency
Only for registered members New York
Forward Deployed Engineer, AI Inference
Only for registered members New York
Staff + Senior Software Engineer, Inference
Only for registered members New York
Site Reliability Engineer, Inference Infrastructure
Only for registered members New York
Staff Software Engineer, Inference Infrastructure
Only for registered members New York, NY
Staff + Senior Software Engineer, Inference
Only for registered members New York, NY
Senior/Staff Software Engineer, Inference
Only for registered members New York
Site Reliability Engineer, Inference Infrastructure
Only for registered members New York, NY
Full-Stack Software Engineer, Inference
Only for registered members New York, NY
Forward Deployed Engineer, AI Inference
Only for registered members New York
Software Engineer
Only for registered members New York
Forward Deployed Engineer
Only for registered members New York
Product Manager
Only for registered members New York
Senior Software Engineer
Only for registered members New York
Senior AI Site Reliability Engineer
Only for registered members New York
Software Engineer, AI/ML Systems
Full time Only for registered members New York