Software Engineer, Virtualization
As a Software Engineer specializing in Virtualization at fal, you will be instrumental in building and delivering custom compute environments tailored for customer workloads. This role involves working across the full stack, from Linux image building to overlay network design and cluster bootstrapping, ensuring optimal GPU performance and seamless integration within fal's generative media ecosystem.
Your primary responsibilities will include building and delivering custom environments with excellent GPU performance for customer workloads, leveraging AI to automate provisioning, alerting, and recovery processes, provisioning and configuring dedicated Kubernetes clusters tailored to customer requirements, designing and implementing overlay networking and routing configurations for tenant isolation and performance, building and maintaining Linux images, setting up network monitoring and diagnostics for customer environments, and automating the end-to-end lifecycle of customer compute environments, including creation, configuration, validation, and teardown.
The ideal candidate will have over 5 years of experience with Linux virtualization technologies such as KVM/QEMU, libvirt, VFIO device passthrough, hugepages, NUMA, and CPU pinning. Strong networking fundamentals, including VXLAN, VLAN, ECMP, BGP, and ARP, with the ability to debug packet-level issues using tools like tcpdump and Wireshark, are essential. Production experience in building and operating Kubernetes clusters on bare metal, proficiency with Linux image building and OS provisioning, and expertise in Python, Bash, Ansible, and Terraform are required. Deep experience with NVIDIA GPUs, including drivers, MIG, container runtimes, InfiniBand, RDMA/RoCEv2, and GPUDirect for high-performance AI networking, is also necessary. Excellent communication skills and a proactive, self-starting attitude are highly valued.
Compensation for this position ranges from $180,000 to $250,000 annually, plus equity and benefits. fal offers relocation assistance to San Francisco, health, dental, and vision insurance, and regular team events and offsites. The company fosters a dynamic and challenging work environment with ample opportunities for learning and growth, making it an ideal place for professionals looking to advance their careers in AI and virtualization technologies.