MLOps & Agentic Platform Engineer (AI Infrastructure)
Hyphen Connect is seeking a skilled MLOps & Agentic Platform Engineer to join our AI Infrastructure team in the San Francisco Bay Area. This role focuses on managing model registries, developing continuous training loops, and implementing A/B testing infrastructure to enhance our AI capabilities.
The engineer will be responsible for deploying agents as scalable microservices on Kubernetes, ensuring efficient and reliable AI operations. Additionally, the role involves building observability dashboards to monitor token usage, latency, and agent reasoning paths, providing critical insights into system performance.
Candidates should possess a strong background in DevOps and MLOps, with proficiency in Kubernetes, Docker, and Terraform. Experience with tools such as MLflow, Weights & Biases, or LangSmith is essential. A solid understanding of building scalable microservice architectures is also required.
Hyphen Connect offers a competitive compensation package, including benefits and perks designed to support our employees' well-being and professional growth.
Our company fosters a collaborative and innovative culture, providing ample opportunities for career advancement and skill development. We encourage applications from individuals passionate about AI infrastructure and eager to contribute to cutting-edge projects.