Skip to main content
Back to all jobs

This job is no longer actively hiring. Open Roles to see active jobs.

Confidential company

Job listing

London, United KingdomNot Disclosed + Equity

Reinforcement Learning Engineer at fast-growing AI infrastructure startup

Are you ready to shape the future of RLOps? We are looking for a Reinforcement Learning Engineer to build a first-of-its-kind platform for training and deploying RL models at scale. You'll work on cutting-edge open-source frameworks, enjoy a 6-month remote work policy, and receive significant stock options in a well-funded AI startup. If you have deep expertise in PyTorch and distributed computing, this is your chance to lead the development of industrial-grade RL infrastructure in London.

Overview

Role overview

You will lead the development of a first-of-its-kind RLOps platform, designing scalable infrastructure for RL model training and LLM finetuning. By integrating advanced machine learning frameworks into an open-source ecosystem, you will provide critical tools for businesses to deploy reinforcement learning models effectively while staying at the forefront of AI research.

Company

About the company

Fast-growing AI infrastructure startup

Responsibilities

What you will do

  • Design and implement the architecture for a scalable RLOps platform and a robust open-source RL framework.
  • Integrate diverse ML libraries and environments to support advanced model training, deployment, and lifecycle management.
  • Stay current with the latest RL and MLOps advancements to incorporate cutting-edge algorithms into the platform's core.

Candidate profile

Who this is a fit for

  • Holds a Master's or Ph.D. in Computer Science or has 3+ years of industry experience in reinforcement learning.
  • Possesses deep expertise in PyTorch, Ray, or Gym, along with a strong background in hyperparameter optimization.
  • Has proven experience building machine learning tooling, cloud-based distributed infrastructure, and production deployment pipelines.

What makes it remarkable

Why this role is remarkable

  • Opportunity to build pioneering RLOps infrastructure and open-source tools from the ground up in a high-impact field.
  • Join a well-funded venture backed by top-tier VCs at the intersection of reinforcement learning and production-ready MLOps.
  • Benefit from a highly flexible work environment with 6-month remote policies and a dedicated annual learning budget.

Jack & Jill

How Jack & Jill work together

Jack
I get to know what you’re great at, then find roles you’d never find yourself.
Jill
I recruit from Jack’s network and make the intro when I spot a great match.

Meet Jack

Thumbnail for Meet Jack

Jack gets to know what you're great at and what you want next, then searches 14 million jobs daily and introduces you directly to hiring managers.

How does this work?

Jack's an AI agent for job searching and career coaching. He works for you.

Jill is the AI recruiter working for the company. She recruits from Jack's network.

If it's a match and the company wants to meet you, they'll make the intro. In the meantime, if you'd like, Jack will send you excellent alternatives.

Find a job withJack

Ready to find your next role?

Talk to Jack for 10 minutes and see your first matches.