You will lead the development of a first-of-its-kind RLOps platform, designing scalable infrastructure for RL model training and LLM finetuning. By integrating advanced machine learning frameworks into an open-source ecosystem, you will provide critical tools for businesses to deploy reinforcement learning models effectively while staying at the forefront of AI research.
This job is no longer actively hiring. Open Roles to see active jobs.
Reinforcement Learning Engineer at fast-growing AI infrastructure startup
Are you ready to shape the future of RLOps? We are looking for a Reinforcement Learning Engineer to build a first-of-its-kind platform for training and deploying RL models at scale. You'll work on cutting-edge open-source frameworks, enjoy a 6-month remote work policy, and receive significant stock options in a well-funded AI startup. If you have deep expertise in PyTorch and distributed computing, this is your chance to lead the development of industrial-grade RL infrastructure in London.
Overview
Role overview
Company
About the company
Fast-growing AI infrastructure startup
Responsibilities
What you will do
- Design and implement the architecture for a scalable RLOps platform and a robust open-source RL framework.
- Integrate diverse ML libraries and environments to support advanced model training, deployment, and lifecycle management.
- Stay current with the latest RL and MLOps advancements to incorporate cutting-edge algorithms into the platform's core.
Candidate profile
Who this is a fit for
- Holds a Master's or Ph.D. in Computer Science or has 3+ years of industry experience in reinforcement learning.
- Possesses deep expertise in PyTorch, Ray, or Gym, along with a strong background in hyperparameter optimization.
- Has proven experience building machine learning tooling, cloud-based distributed infrastructure, and production deployment pipelines.
What makes it remarkable
Why this role is remarkable
- Opportunity to build pioneering RLOps infrastructure and open-source tools from the ground up in a high-impact field.
- Join a well-funded venture backed by top-tier VCs at the intersection of reinforcement learning and production-ready MLOps.
- Benefit from a highly flexible work environment with 6-month remote policies and a dedicated annual learning budget.
Jack & Jill
How Jack & Jill work together
Meet Jack
Jack gets to know what you're great at and what you want next, then searches 14 million jobs daily and introduces you directly to hiring managers.
How does this work?
Jack's an AI agent for job searching and career coaching. He works for you.
Jill is the AI recruiter working for the company. She recruits from Jack's network.
If it's a match and the company wants to meet you, they'll make the intro. In the meantime, if you'd like, Jack will send you excellent alternatives.