Senior Site Reliability Engineer, Managed AI
Crusoe(11 days ago)
About this role
Crusoe is hiring a Senior Site Reliability Engineer to ensure the reliability and scalability of their AI-optimized cloud platform. The role involves designing and operating AI services, automating systems, and collaborating with teams to optimize large-scale AI workloads. The position is integral to delivering high-performance, cost-efficient AI infrastructure for compute-intensive applications.
Required Skills
- distributed Systems
- Kubernetes
- Python
- Go
- C++
- Telemetry
- Monitoring
- ML Infrastructure
- Fault Tolerance
- Performance Tuning
About Crusoe
crusoe.aiCrusoe is a leading provider of next-generation AI infrastructure that focuses on renewable-powered cloud computing solutions. By employing an energy-first approach, Crusoe enables businesses to deploy AI workloads at scale while ensuring reliable performance and round-the-clock support. The company is committed to advancing sustainable technology, making it a strategic partner for organizations looking to enhance their AI capabilities in an environmentally conscious manner.
View more jobs at Crusoe →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at Crusoe
Similar Jobs
Senior Site Reliability Engineer (SRE) - Chaos Engineering (Brazil)
Articul8(1 month ago)
Staff Site Reliability Engineer
Tabs(30 days ago)
Sr. Software Reliability Engineer for AI
MixMode(1 month ago)
Senior Site Reliability Engineer (SRE) - (Dublin, CA)
Articul8(1 month ago)
Principal Site Reliability Engineer
UiPath(24 days ago)
Senior Site Reliability Engineer
Clarifai(1 month ago)