AI Specialist (AI Engineering)
Hyphen Connect is seeking an AI Specialist Engineer to enhance the performance of large language and vision models for on-device inference. This role is integral to developing and deploying cutting-edge AI solutions, ensuring optimal efficiency across diverse hardware architectures.
The AI Specialist Engineer will be responsible for compressing and optimizing large language and vision models for on-device inference. Key tasks include developing pipelines for model distillation and hardware-specific compilation, as well as benchmarking performance across various NPU/GPU architectures.
Candidates should possess expertise in model distillation, pruning, and 4-bit/8-bit quantization techniques. Hands-on experience with TensorRT, ONNX Runtime, and edge deployment is essential. Strong proficiency in C++ and Python is also required.
Hyphen Connect offers a dynamic work environment focused on innovation in AI and machine learning. Employees have opportunities for professional growth and development, working on projects that push the boundaries of technology.