Senior Software Engineer II, AI Workload Orchestration
CoreWeave is seeking a Senior Software Engineer II to join our AI Workload Orchestration team. In this role, you will design and operate our Kubernetes-native platform for admitting, scheduling, and managing AI workloads at scale. This platform integrates multiple orchestration and scheduling frameworks, including Kueue, Volcano, and Ray, to support modern AI training and inference workflows. As a key member of the team, you will own significant components of the platform, drive reliability and performance improvements, and help scale the system to meet growing customer demand and workload complexity.
Your primary responsibilities will include designing, building, and operating Kubernetes-native services for AI workload orchestration and scheduling. You will own one or more platform components end-to-end, encompassing design, implementation, testing, and on-call support. Additionally, you will improve scheduling latency, cluster utilization, and workload reliability through metrics-driven engineering. Collaborating closely with adjacent teams, such as CKS, infrastructure, and managed inference, you will ensure clean interfaces and integrations. Mentoring junior engineers and raising the quality bar for code, design, and operations will also be a key part of your role.
The ideal candidate will have 5–8 years of professional software engineering experience in distributed systems, cloud infrastructure, or platform engineering. Strong experience building production systems in Go is required, with additional experience in Python or C++ being a plus. A solid understanding of Kubernetes fundamentals, APIs, controllers, and operating services in production is essential. Experience working with scheduling, resource management, or quota-based systems is also important. Proven ability to improve system reliability and performance using data and operational metrics, as well as comfort owning services in production and participating in on-call rotations, are necessary qualifications.
Preferred qualifications include experience with Kubernetes-native orchestration frameworks such as Kueue, Volcano, Ray, Kubeflow, or Argo Workflows. Familiarity with GPU-based workloads, machine learning training, or inference pipelines is advantageous. Knowledge of scheduling concepts such as quota enforcement, pre-emption, and backfilling, as well as experience with reliability practices including SLOs, alerting, and incident response, are also desirable. Exposure to AI infrastructure, high-performance computing, or large-scale distributed compute environments will be beneficial.
The base salary range for this role is $165,000 to $242,000 annually. The starting salary will be determined based on job-related knowledge, skills, experience, and market location. In addition to base salary, our total rewards package includes a discretionary bonus, equity awards, and a comprehensive benefits program. Benefits include medical, dental, and vision insurance fully paid by CoreWeave, company-paid life insurance, voluntary supplemental life insurance, short and long-term disability insurance, flexible spending account, health savings account, tuition reimbursement, ability to participate in the Employee Stock Purchase Program (ESPP), mental wellness benefits through Spring Health, family-forming support provided by Carrot, paid parental leave, flexible, full-service childcare support with Kinside, 401(k) with a generous employer match, flexible PTO, catered lunch each day in our office and data center locations, a casual work environment, and a work culture focused on innovative disruption.
At CoreWeave, we work hard, have fun, and move fast. We’re in an exciting stage of hyper-growth that you will not want to miss out on. We’re not afraid of a little chaos, and we’re constantly learning. Our team cares deeply about how we build our product and how we work together, which is represented through our core values: Be Curious at Your Core, Act Like an Owner, Empower Employees, Deliver Best-in-Class Client Experiences, and Achieve More Together. We support and encourage an entrepreneurial outlook and independent thinking. We foster an environment that encourages collaboration and provides the opportunity to develop innovative solutions to complex problems. As we get set for takeoff, the growth opportunities within the organization are constantly expanding. You will be surrounded by some of the best talent in the industry, who will want to learn from you, too. Come join us!