Skip to main content
Back to all jobs

This job is no longer actively hiring. Open Roles to see active jobs.

Research Engineer, Data at Fast-growing generative AI startup

Are you passionate about the data-centric side of AI? Join a world-class team of researchers from top labs at this fast-growing generative AI startup in San Francisco. You will own the data strategy for next-generation foundation models, building the multilingual datasets and evaluation systems that define how intelligence is scaled globally. This is a rare opportunity to work at the intersection of cutting-edge SSM research and production-level systems while receiving competitive compensation and equity in a well-funded venture.

Want to apply for this role?

C

This role is no longer actively hiring, but Jack can still help you discover similar open roles that fit.

Location

San Francisco, United States

Compensation

$180k-$250k + Equity

Company

Confidential company

See Open Roles

Role overview

You will lead the quality and coverage of data powering next-generation foundation models. As the in-house expert on global datasets, you'll ensure exceptional performance across dozens of languages. You will bridge the gap between research and production by building scalable systems to curate, evaluate, and steer massive multilingual data collections.

About the company

Fast-growing generative AI startup

What you will do

  • Design and build large-scale multilingual datasets and run controlled experiments to measure their impact on model behavior.
  • Develop automated quality control systems and speech model evaluations using both manual annotation and automated metrics.
  • Implement advanced steering techniques to improve model intelligence through data and mitigate bias in generative outputs.

Who this is a fit for

  • Proven experience building or working with large-scale multilingual datasets for generative models like speech or text.
  • Strong applied machine learning background with a specific focus on data-centric approaches and scalable system building.
  • Demonstrated ability to guide human annotation processes and evaluation metrics across multiple languages and cultures.

Why this role is remarkable

  • Work at the frontier of model architecture innovation alongside founding experts from world-class AI labs.
  • Join a well-funded team backed by top-tier VCs and industry-leading AI advisors during a high-growth phase.
  • Directly influence the intelligence and inclusivity of global-scale models used for audio, video, and text processing.

How Jack & Jill work together

Jack
I get to know what you’re great at, then find roles you’d never find yourself.
Jill
I recruit from Jack’s network and make the intro when I spot a great match.
Thumbnail for Meet Jack

Jack gets to know what you're great at and what you want next, then searches 15 million jobs daily and helps you discover roles at companies like this.

Meet Jack

What happens next?

Jack’s an AI agent for job searching and career coaching. He works for you.

Jill is the AI recruiter working for the company. She recruits from Jack’s network.

If your profile’s a match and Confidential company wants to meet, Jill will make the intro. In the meantime, Jack will send you excellent alternatives.

Learn about Jack

Ready to find your next role?

Talk to Jack for 10 minutes and see your first matches.