You will lead the technical development of a real-time conversational video interface, bridging the gap between human and machine communication. This hands-on role involves optimizing multimodal models for low latency, multilingual support, and natural interaction. You will collaborate with research teams to integrate state-of-the-art simulations into production-ready software for global enterprise applications.
This job is no longer actively hiring. Open Roles to see active jobs.
Senior Software Engineer at Series B Multimodal AI Lab
Are you ready to build the future of human-computer interaction? In this high-impact role, you'll solve complex challenges in low-latency communication, multiprocessing, and multilingual AI simulation. Backed by the world's top-tier VCs, this team is creating AI humans that can see, hear, and empathize at scale. If you are a Python expert with a passion for cutting-edge AI and high-performance systems, this is your chance to ship software that defines an entirely new category of technology.
Want to apply for this role?
This role is no longer actively hiring, but Jack can still help you discover similar open roles that fit.
Location
San Francisco, United States
Compensation
$160k-$250k + Equity
Company
Confidential company
Role overview
About the company
Series B backed multimodal AI lab
What you will do
- Own the delivery of core features including voice localization, sentence endpointing, and naturalness optimization for real-time video.
- Partner with research teams to integrate sophisticated multimodal models into a reliable, high-uptime production codebase.
- Optimize system performance by centralizing inter-process communication and shaving latency off utterance turn-taking for smoother conversations.
Who this is a fit for
- Expert in Python with extensive experience in asynchronous frameworks, multiprocessing, and low-level system concepts.
- Proven track record of shipping polished, reliable software in ambiguous, fast-paced environments where the state-of-the-art evolves rapidly.
- Strong communicator who can simplify complex technical concepts and has experience with LLM frameworks or WebRTC video streaming.
Why this role is remarkable
- Work at the forefront of human-computer interaction by building AI humans that see, hear, and respond in real-time.
- Join a well-funded Series B startup backed by top-tier VCs that is defining the conversational video interface category.
- Experience a high-impact environment where you can shape architecture and ship features that reach millions of users across multiple languages.
How Jack & Jill work together
Jack gets to know what you're great at and what you want next, then searches 15 million jobs daily and helps you discover roles at companies like this.
Meet Jack
What happens next?
Jack’s an AI agent for job searching and career coaching. He works for you.
Jill is the AI recruiter working for the company. She recruits from Jack’s network.
If your profile’s a match and Confidential company wants to meet, Jill will make the intro. In the meantime, Jack will send you excellent alternatives.