Hyphen Connect is seeking an AI Specialist Engineer to enhance the performance of large language and vision models for on-device inference. This role is integral to developing and deploying cutting-edge AI solutions, ensuring optimal efficiency across diverse hardware architectures.

Key responsibilities include compressing and optimizing large language and vision models for on-device inference, developing pipelines for model distillation and hardware-specific compilation, and benchmarking performance across various NPU/GPU architectures.

The ideal candidate will have expertise in model distillation, pruning, and 4-bit/8-bit quantization techniques. Hands-on experience with TensorRT, ONNX Runtime, and edge deployment is essential. Strong proficiency in C++ and Python is also required.

Hyphen Connect offers a dynamic work environment where innovation and collaboration are highly valued. Employees have opportunities for professional growth and development, working on impactful projects that leverage the latest advancements in AI technology.

AI Specialist (AI Engineering)

More Jobs at Hyphen Connect

Robotic Safety Systems & Compliance Architect

AI Specialist (AI Engineering)

AI Safety Specialist (AI Engineering)

AI/Robotics Product Manager