Staff Site Reliability Engineer, Managed AI
Crusoe(22 days ago)
About this role
Crusoe is seeking a Staff Site Reliability Engineer to ensure the reliability and scalability of its AI-optimized cloud platform. The role involves building and operating managed AI services, focusing on large language models, distributed systems, and automation to support AI workloads.
Required Skills
- Python
- Go
- Java
- C++
- Kubernetes
- Distributed Systems
- Telemetry
- Monitoring
- AI Infrastructure
- Large Language Models
About Crusoe
crusoe.aiCrusoe is a leading provider of next-generation AI infrastructure that focuses on renewable-powered cloud computing solutions. By employing an energy-first approach, Crusoe enables businesses to deploy AI workloads at scale while ensuring reliable performance and round-the-clock support. The company is committed to advancing sustainable technology, making it a strategic partner for organizations looking to enhance their AI capabilities in an environmentally conscious manner.
View more jobs at Crusoe →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at Crusoe
Similar Jobs
Principal Site Reliability Engineer
UiPath(24 days ago)
Staff Site Reliability Engineer
Replit(3 months ago)
Senior Site Reliability Engineer
SingleStore(25 days ago)
Staff Software Engineer- AI Workload Orchestration
CoreWeave(23 days ago)
Site Reliability Engineer, Inference Infrastructure
Cohere(1 month ago)
Staff Applied Scientist
Hippocratic AI(11 months ago)