Reliability Engineer, Supercomputing
🇺🇸 San Francisco, CA
$4K - $5K Annual
Posted 3 days ago
Expires August 24, 2026
Thinking Machines Lab is seeking a Reliability Engineer to ensure the dependability of its GPU supercomputing infrastructure. This role involves managing the interface between hardware, firmware, and operating systems to maintain optimal performance for large-scale AI research. The engineer will be responsible for diagnosing and resolving hardware-related issues, collaborating with vendors, and implementing solutions that support the lab's advanced AI experiments.
More Jobs at Thinking Machines Lab
Network Engineer, Supercomputing
Thinking Machines Lab
🇺🇸 San Francisco, CA
$4K - $5K Annual
Full TimeOn-siteEngineering
3 days ago
Associate General Counsel, Corporate & Commercial
Thinking Machines Lab
🇺🇸 San Francisco, CA
$4K - $4K Annual
Full TimeOn-siteLegal TechLaw Firm+1
3 weeks ago
Associate General Counsel, Frontier AI & Privacy
Thinking Machines Lab
🇺🇸 San Francisco, California
$4K - $4K Annual
Full TimeOn-siteLegal TechLegal
1 month ago
Site Reliability Engineer (SRE)
Thinking Machines Lab
🇺🇸 San Francisco, California
$4K - $5K Annual
Full TimeOn-siteEngineeringOperations
2 months ago