MLOps & Agentic Platform Engineer (AI Infrastructure)
Hyphen Connect is seeking a skilled MLOps & Agentic Platform Engineer to join our AI Infrastructure team. In this role, you will be responsible for managing model registries, developing continuous training loops, and implementing A/B testing infrastructure to support our AI initiatives. This position offers an opportunity to contribute to cutting-edge AI projects within a dynamic and innovative company.
As an MLOps & Agentic Platform Engineer, your primary responsibilities will include deploying agents as scalable microservices on Kubernetes, building observability dashboards to monitor token usage, latency, and agent reasoning paths, and ensuring the reliability and scalability of our AI systems. You will collaborate closely with cross-functional teams to enhance our AI infrastructure and support the deployment of machine learning models into production environments.
The ideal candidate will possess a strong DevOps/MLOps background, with proficiency in Kubernetes, Docker, and Terraform. Experience with tools such as MLflow, Weights & Biases, or LangSmith is essential. Additionally, a solid understanding of building scalable microservice architectures is required to excel in this role.
Hyphen Connect offers a competitive compensation package, including a comprehensive benefits plan and opportunities for professional growth. We are committed to fostering a collaborative and inclusive work environment where innovation and creativity are encouraged.
Joining Hyphen Connect means becoming part of a forward-thinking company that values technological advancement and employee development. We provide ample opportunities for career progression and the chance to work on impactful AI projects that make a difference.