Staff Software Engineer, GPU Infrastructure (HPC)
Cohere(30 days ago)
About this role
Cohere is seeking a Staff Software Engineer to build and operate AI infrastructure, focusing on deploying GPU/TPU superclusters across multiple clouds. The role involves collaborating with AI researchers to optimize and troubleshoot high-performance computing environments for machine learning workloads. The position is integral to accelerating the development of foundational AI models that power Cohere's platform.
Required Skills
- Kubernetes
- Python
- Go
- RDMA
- NCCL
- Distributed Training
- HPC
- Linux
- Cloud Infrastructure
- Machine Learning
About Cohere
cohere.comSanity is a platform that provides flexible content management solutions tailored for developers, marketers, and content creators. By utilizing a real-time collaborative editor and structured content, it allows users to build and manage high-performance applications and websites. Sanity’s APIs and flexible data model enable seamless integration with various frameworks and technologies, empowering users to deliver customized content experiences. With features like query-driven content fetching and an extensible plugin system, Sanity is designed to enhance productivity and scalability for teams of all sizes.
View more jobs at Cohere →