Role overview

You will join a 4-person founding team to build high-speed AI agents that automate legacy enterprise workflows. Moving beyond slow screenshot-to-VLM loops, you will develop agents that pre-train on interface navigation to achieve 5x faster execution. This role blends cutting-edge research in agentic orchestration with rapid production deployment.

Generalcatalyst

View profile

Finance

General Catalyst is a global venture capital and investment firm partnering with entrepreneurs from seed stage to growth, specializing in transformational investments across sectors including technology, healthcare, fintech, and applied artificial intelligence. Founded in 2000, the firm manages $40+ billion assets under management as of June 2025 and has a portfolio of over 800 companies such as Airbnb, Stripe, HubSpot, and Snap. With offices in San Francisco, New York City, Boston, Berlin, Bangalore, and London, General Catalyst collaborates with founders to drive innovation, global resilience, and technology transformation.

What you will do

Research and implement novel agentic architectures for GUI automation using multi-agent coordination, memory, and context management.
Build and evaluate reasoning pipelines—including chain-of-thought and reflexion loops—that maintain reliability under distribution shifts in enterprise environments.
Develop interface pre-training methods and VLM-based screen understanding to enable deterministic execution and self-healing for automated enterprise agents.

Who this is a fit for

Early-career researcher (0-4 years) with a Master's or PhD in CS/AI from a top-tier program or a track record at a premier research lab.
Strong engineering skills in Python, PyTorch, and agentic frameworks like LangGraph or AutoGen, with the ability to move from paper to prototype rapidly.
Deep curiosity for computer-use agents and GUI understanding, evidenced by top-tier publications (NeurIPS, ICLR, CVPR) or significant production-grade AI projects.

Why this role is remarkable

Join a Y Combinator W26 company at the ground floor, working directly with founders on the core technology that defines the product's intelligence.
Solve a massive enterprise bottleneck by building deterministic, self-healing agents that operate complex legacy software without APIs or structured data interfaces.
High-impact environment where your research in reasoning models and vision-language architectures is shipped to production for real enterprise customers immediately.

How Jack & Jill work together

I get to know what you’re great at, then find roles you’d never find yourself.Ok, I'll go first. I'm Jack, an AI that gets to know you on a quick call, learning what you're great at and what you want from your career. Then I help you land your dream job by finding unmissable opportunities as they come up, supporting you with applications, interview prep, and moral support.

I recruit from Jack’s network and make the intro when I spot a great match.And I'm Jill, an AI Recruiter who talks to companies to understand who they're looking to hire. Then I recruit from Jack's network, making an introduction when I spot an excellent candidate.

AI/ML Research Engineer at Generalcatalyst