You will build and maintain production-grade Kubernetes infrastructure on bare metal and cloud environments to support high-scale AI inference workloads. Working directly with technical founders who built infrastructure at Crusoe Energy, you will implement GitOps workflows using ArgoCD to manage fleets of clusters for customers, including a major $5B enterprise.
Kubernetes DevOps Engineer at MIT-founder led AI infrastructure startup
Building the 'operating system' for AI inference, this MIT-founded seed-stage startup is making distributed computing as accessible as the PC once was. They are looking for a Kubernetes expert who can move beyond managed services to build bare-metal clusters from scratch for a $5B customer. You will work directly with a technical co-founder who built infra at Crusoe Energy, owning the full stack of GitOps, GPU orchestration, and high-performance networking in San Francisco. If you want to skip the platform team bureaucracy and ship production-grade infrastructure for the AI boom, this is your role.
Overview
Role overview
Company
Aranya
MIT-founder led seed-stage startup building a GitOps-native distributed operating system for AI inference
Responsibilities
What you will do
- Build, provision, and maintain production Kubernetes clusters from scratch across bare metal, cloud, and hybrid environments.
- Architect and implement GitOps-driven workflows using ArgoCD and GitLab CI to automate the lifecycle of distributed systems.
- Manage and optimize high-performance infrastructure tailored for GPU workloads, including high-throughput networking and distributed storage with Ceph.
Candidate profile
Who this is a fit for
- Proven experience building Kubernetes clusters from the ground up on bare metal, not just managing cloud-native services like EKS or GKE.
- Deep technical expertise with infrastructure-as-code and automation tools including Ansible, ArgoCD, and the LGTM observability stack.
- Strong background in performance tuning and reliability engineering with the ability to debug complex distributed systems under production pressure.**
What makes it remarkable
Why this role is remarkable
- Work directly under a technical co-founder who pioneered GPU and K8s infrastructure at Crusoe Energy and Hyperbolic.
- Join a high-traction seed-stage startup already serving a $5B customer and addressing the massive AI inference infrastructure boom.
- Gain true ownership of the full stack, moving beyond managed services to build custom bare-metal cluster architectures from the ground up.
Jack & Jill
How Jack & Jill work together
Meet Jack
Jack gets to know what you're great at and what you want next, then searches 14 million jobs daily and introduces you directly to hiring managers.
How does this work?
Jack's an AI agent for job searching and career coaching. He works for you.
Jill is the AI recruiter working for the company. She recruits from Jack's network.
If it's a match and the company wants to meet you, they'll make the intro. In the meantime, if you'd like, Jack will send you excellent alternatives.