Reinforcement Learning Engineer at fast-growing AI infrastructure startup

Are you ready to shape the future of RLOps? We are looking for a Reinforcement Learning Engineer to build a first-of-its-kind platform for training and deploying RL models at scale. You'll work on cutting-edge open-source frameworks, enjoy a 6-month remote work policy, and receive significant stock options in a well-funded AI startup. If you have deep expertise in PyTorch and distributed computing, this is your chance to lead the development of industrial-grade RL infrastructure in London.

Overview

Role overview

You will lead the development of a first-of-its-kind RLOps platform, designing scalable infrastructure for RL model training and LLM finetuning. By integrating advanced machine learning frameworks into an open-source ecosystem, you will provide critical tools for businesses to deploy reinforcement learning models effectively while staying at the forefront of AI research.

Company

About the company

Fast-growing AI infrastructure startup

Responsibilities

What you will do

Design and implement the architecture for a scalable RLOps platform and a robust open-source RL framework.
Integrate diverse ML libraries and environments to support advanced model training, deployment, and lifecycle management.
Stay current with the latest RL and MLOps advancements to incorporate cutting-edge algorithms into the platform's core.

Candidate profile

Who this is a fit for

Holds a Master's or Ph.D. in Computer Science or has 3+ years of industry experience in reinforcement learning.
Possesses deep expertise in PyTorch, Ray, or Gym, along with a strong background in hyperparameter optimization.
Has proven experience building machine learning tooling, cloud-based distributed infrastructure, and production deployment pipelines.

What makes it remarkable

Why this role is remarkable

Opportunity to build pioneering RLOps infrastructure and open-source tools from the ground up in a high-impact field.
Join a well-funded venture backed by top-tier VCs at the intersection of reinforcement learning and production-ready MLOps.
Benefit from a highly flexible work environment with 6-month remote policies and a dedicated annual learning budget.

Jack & Jill

How Jack & Jill work together

I get to know what you’re great at, then find roles you’d never find yourself.Ok, I'll go first. I'm Jack, an AI that gets to know you on a quick call, learning what you're great at and what you want from your career. Then I help you land your dream job by finding unmissable opportunities as they come up, supporting you with applications, interview prep, and moral support.

I recruit from Jack’s network and make the intro when I spot a great match.And I'm Jill, an AI Recruiter who talks to companies to understand who they're looking to hire. Then I recruit from Jack's network, making an introduction when I spot an excellent candidate.

Meet Jack

Jack gets to know what you're great at and what you want next, then searches 14 million jobs daily and introduces you directly to hiring managers.

How does this work?

Jack's an AI agent for job searching and career coaching. He works for you.

Jill is the AI recruiter working for the company. She recruits from Jack's network.

If it's a match and the company wants to meet you, they'll make the intro. In the meantime, if you'd like, Jack will send you excellent alternatives.

Find a job with

Jack