AI Safety Specialist (AI Engineering)

🇺🇸 San Francisco Bay Area, CA
Posted 3 weeks ago
Expires June 23, 2026
Full TimeOn-siteEngineeringCompliance

The AI Safety Specialist (AI Engineering) at Hyphen Connect is a pivotal role focused on enhancing the security and robustness of language models. This position involves working within a dynamic team dedicated to ensuring the safe deployment of AI systems, aligning with the company's commitment to ethical AI practices.

Key responsibilities include conducting adversarial testing on large language models (LLMs) and multimodal agents to identify vulnerabilities. The specialist will implement guardrails and real-time filtering mechanisms for autonomous tool usage, ensuring AI behaviors adhere to established safety protocols. Additionally, the role involves developing constitutional AI principles and assisting with reinforcement learning from human feedback (RLHF) alignment pipelines.

Candidates should possess a background in cybersecurity, prompt engineering, or adversarial machine learning. Experience with jailbreak taxonomies and automated red-teaming frameworks is essential. A strong analytical mindset is required to effectively identify and address edge cases in AI behavior.

Hyphen Connect offers a competitive compensation package, including benefits such as health insurance, retirement plans, and opportunities for professional development. Employees are encouraged to engage in continuous learning and contribute to the company's innovative projects.

The company fosters a collaborative and inclusive culture, emphasizing the importance of ethical AI development. Team members have access to growth opportunities within the organization, making it an ideal environment for professionals passionate about advancing AI safety and ethics.

More Jobs at Hyphen Connect