Skip to main content
Back to all jobs
Cosine.sh

Cosine.sh

Featured role

London, UK£80k-£110k + Equity
Featured roleLondon, UK£80k-£110k + Equity

ML Engineer at Cosine.sh

Join Cosine.sh in London to lead the ML training of the world's most advanced autonomous coding agents. As an ML Engineer, you will own the SFT and RL pipelines for the Lumen Enterprise models, moving beyond simple autocomplete to build agents that reason through entire codebases. Working on multi-node GPU clusters with direct access to the CEO, your work will define the state-of-the-art in AI-driven software engineering. If you are a PyTorch expert who wants to see your models ship to enterprise customers instead of just publishing papers, this is the role for you.

Overview

Why this role stands out

Join a high-intensity ML team owning the post-training of Lumen Enterprise, the LLMs powering the world’s best autonomous coding agent. You will drive SFT, RL, and continued pretraining to push state-of-the-art performance on complex software engineering tasks. This role offers direct impact on a product used by global enterprises.

Company

Cosine.sh

Cosine.sh

cosine.sh

Visit website
Software

Cosine.sh - YC-backed AI startup

Responsibilities

What you will do

  • Transform open-source base models into high-performance SWE agents through supervised fine-tuning and advanced reinforcement learning (PPO, GRPO, or DPO).
  • Design and execute large-scale training experiments on multi-node clusters, optimizing for long-context stability and tool-use reasoning.
  • Build and iterate on automated RL loops where models are rewarded for successfully running tests, linters, and static analysis on real-world codebases.

Candidate profile

Who this is a fit for

  • 3-5+ years of experience training deep learning models in production with deep proficiency in PyTorch distributed primitives like FSDP and DDP.
  • Proven track record of training large-scale models (≥70B parameters) and implementing complex RLVR systems for LLM alignment.
  • Strong software engineering background with the ability to write production-grade Python and a focus on data quality and sampling strategies.

What makes it remarkable

Why this role is remarkable

  • Direct ownership of post-training for Genie, a SOTA coding agent that achieved a 72% score on OpenAI’s SWE-Lancer benchmark.
  • Work at the technical frontier with multi-node GPU clusters, large-scale MoE architectures, and long-context training on proprietary software-engineering reasoning data.
  • Join a small, elite 4-person ML team reporting directly to the CEO, where your training runs ship immediately to real-world enterprise users.

Jack & Jill

How Jack & Jill work together

Jack
I get to know what you’re great at, then find roles you’d never find yourself.
Jill
I recruit from Jack’s network and make the intro when I spot a great match.

About Jack & Jill

Meet Jack

Jack gets to know what you are great at, what you want next, and makes sure Jill considers you for the right opportunities.

How does this work?

Jack's an AI agent for job searching and career coaching. He works for you.

Jill is the AI recruiter working for the company. She recruits from Jack's network.

If it's a match and the company wants to meet you, they'll make the intro. In the meantime, if you'd like, Jack will send you excellent alternatives.

Learn more aboutJack

Ready to find your next role?

Talk to Jack for 10 minutes and see your first matches.