You will lead the quality and coverage of data powering next-generation foundation models. As the in-house expert on global datasets, you'll ensure exceptional performance across dozens of languages. You will bridge the gap between research and production by building scalable systems to curate, evaluate, and steer massive multilingual data collections.
This job is no longer actively hiring. Talk to Jack to find live roles.
Research Engineer, Data at Fast-growing generative AI startup
Are you passionate about the data-centric side of AI? Join a world-class team of researchers from top labs at this fast-growing generative AI startup in San Francisco. You will own the data strategy for next-generation foundation models, building the multilingual datasets and evaluation systems that define how intelligence is scaled globally. This is a rare opportunity to work at the intersection of cutting-edge SSM research and production-level systems while receiving competitive compensation and equity in a well-funded venture.
Want to apply for this role?
This role is no longer actively hiring, but Jack can still help you discover similar open roles that fit.
Location
San Francisco, United States
Compensation
$180k-$250k + Equity
Company
Confidential company
Role overview
About the company
Fast-growing generative AI startup
What you will do
- Design and build large-scale multilingual datasets and run controlled experiments to measure their impact on model behavior.
- Develop automated quality control systems and speech model evaluations using both manual annotation and automated metrics.
- Implement advanced steering techniques to improve model intelligence through data and mitigate bias in generative outputs.
Who this is a fit for
- Proven experience building or working with large-scale multilingual datasets for generative models like speech or text.
- Strong applied machine learning background with a specific focus on data-centric approaches and scalable system building.
- Demonstrated ability to guide human annotation processes and evaluation metrics across multiple languages and cultures.
Why this role is remarkable
- Work at the frontier of model architecture innovation alongside founding experts from world-class AI labs.
- Join a well-funded team backed by top-tier VCs and industry-leading AI advisors during a high-growth phase.
- Directly influence the intelligence and inclusivity of global-scale models used for audio, video, and text processing.
How Jack & Jill work together
Jack gets to know what you're great at and what you want next, then searches 15 million jobs daily and helps you discover roles at companies like this.
Meet Jack
What happens next?
Jack’s an AI agent for job searching and career coaching. He works for you.
Jill is the AI recruiter working for the company. She recruits from Jack’s network.
If your profile’s a match and Confidential company wants to meet, Jill will make the intro. In the meantime, Jack will send you excellent alternatives.