Skip to main content
Back to all jobs

Confidential company

Featured role

San Francisco, USA$160K – $250K + Equity
Featured roleSan Francisco, USA$160K – $250K + Equity

Senior Software Engineer at Series B Multimodal AI Lab

Are you ready to build the future of human-computer interaction? Join a premier Series B multimodal AI lab as a Senior Software Engineer and lead the development of real-time conversational video interfaces. In this high-impact role, you'll solve complex challenges in low-latency communication, multiprocessing, and multilingual AI simulation. Backed by the world's top-tier VCs, this team is creating AI humans that can see, hear, and empathize at scale. If you are a Python expert with a passion for cutting-edge AI and high-performance systems, this is your chance to ship software that defines an entirely new category of technology.

Overview

Why this role stands out

You will lead the technical development of a real-time conversational video interface, bridging the gap between human and machine communication. This hands-on role involves optimizing multimodal models for low latency, multilingual support, and natural interaction. You will collaborate with research teams to integrate state-of-the-art simulations into production-ready software for global enterprise applications.

Company

About the company

Series B backed multimodal AI lab

Responsibilities

What you will do

  • Own the delivery of core features including voice localization, sentence endpointing, and naturalness optimization for real-time video.
  • Partner with research teams to integrate sophisticated multimodal models into a reliable, high-uptime production codebase.
  • Optimize system performance by centralizing inter-process communication and shaving latency off utterance turn-taking for smoother conversations.

Candidate profile

Who this is a fit for

  • Expert in Python with extensive experience in asynchronous frameworks, multiprocessing, and low-level system concepts.
  • Proven track record of shipping polished, reliable software in ambiguous, fast-paced environments where the state-of-the-art evolves rapidly.
  • Strong communicator who can simplify complex technical concepts and has experience with LLM frameworks or WebRTC video streaming.

What makes it remarkable

Why this role is remarkable

  • Work at the forefront of human-computer interaction by building AI humans that see, hear, and respond in real-time.
  • Join a well-funded Series B startup backed by top-tier VCs that is defining the conversational video interface category.
  • Experience a high-impact environment where you can shape architecture and ship features that reach millions of users across multiple languages.

Jack & Jill

How Jack & Jill work together

Jack
I get to know what you’re great at, then find roles you’d never find yourself.
Jill
I recruit from Jack’s network and make the intro when I spot a great match.

About Jack & Jill

Meet Jack

Jack gets to know what you are great at, what you want next, and makes sure Jill considers you for the right opportunities.

How does this work?

Jack's an AI agent for job searching and career coaching. He works for you.

Jill is the AI recruiter working for the company. She recruits from Jack's network.

If it's a match and the company wants to meet you, they'll make the intro. In the meantime, if you'd like, Jack will send you excellent alternatives.

Learn more aboutJack

Ready to find your next role?

Talk to Jack for 10 minutes and see your first matches.