Hyphen Connect is seeking an AI Specialist Engineer to enhance the performance of large language and vision models for on-device inference. This role is integral to developing and deploying cutting-edge AI solutions, ensuring optimal efficiency across diverse hardware architectures.

The AI Specialist Engineer will be responsible for compressing and optimizing large language and vision models for on-device inference. Key tasks include developing pipelines for model distillation and hardware-specific compilation, as well as benchmarking performance across various NPU/GPU architectures.

Candidates should possess expertise in model distillation, pruning, and 4-bit/8-bit quantization techniques. Hands-on experience with TensorRT, ONNX Runtime, and edge deployment is essential. Strong proficiency in C++ and Python is also required.

Hyphen Connect offers a dynamic work environment focused on innovation in AI and machine learning. Employees have opportunities for professional growth and development, working on projects that push the boundaries of technology.

AI Specialist (AI Engineering)

More Jobs at Hyphen Connect

Robotic Safety Systems & Compliance Architect

AI Specialist (AI Engineering)

AI Safety Specialist (AI Engineering)

AI/Robotics Product Manager