Software Engineer, Model Routing & Inference

Cursorai· via Ashby

Total 26B2B 0AI 21Web3 0Poland/EU 0

Details

Location: New York
Remote: onsite
Employment: fulltime
Seniority: unknown
Category: ai
Salary: —
Published: 2026-04-07
First seen by tracker: 2026-06-29
Last seen: 2026-06-30

Apply ↗

Why this score

Evidence

Title matches "software engineer"+10
Stack: distributed systems+4

Warnings

Salary not provided

Notes

Description

Our mission is to automate coding. The first step in our journey is to build the best tool for professional programmers, using a combination of inventive research, design, and engineering. Our organization is very flat, and our team is small and talent dense. We particularly like people who are truth-seeking, passionate, and creative. We enjoy spirited debate, crazy ideas, and shipping code. ABOUT THE ROLE As a Software Engineer on the Model Routing & Inference team at Cursor, you'll build the inference platform that powers every AI interaction in the product. This team owns the full inference path: making Cursor's AI faster, more reliable, and more cost-effective at a scale few teams in the world get to operate at. Every agent session, every tab completion, and every chat message flows through your stack. EXAMPLE PROJECTS INCLUDE... - Building and evolving our inference gateway, a single abstraction over every provider's API semantics, so model onboarding becomes a config change. - Designing intelligent cross-provider failover so no single provider outage causes user-visible degradation. - Designing routing backpressure and admission control so traffic spikes don't cascade into providers. YOU MAY BE A FIT IF - You have deep experience building high-throughput, low-latency distributed systems, especially in inference serving, traffic routing, or real-time data pipelines. - You're comfortable reasoning about cost/performance tradeoffs at scale (GPU utilization, provider economics, capacity planning). - You have strong software engineering fundamentals and enjoy shipping production systems that handle millions of requests. - You make good calls in the gray area: weighing reliability, cost, latency, and user experience when there isn't a single "right" answer. APPLYING If there appears to be a fit, we'll reach to schedule 2-3 short technicals. After, we'll schedule an onsite in our office, where you'll work on a small project, discuss ideas, and meet the team. #LI-DNI