CoreWeave

Reliability Lead, Common Services

CoreWeave(1 day ago)

HybridFull TimeManager$206,000 - $303,000Engineering
Apply Now

About this role

Reliability Lead, Common Services at CoreWeave will establish and lead the SRE and production operations practice for the Common Services organization, defining reliability strategy, processes, and standards. The role partners with engineering and product teams to ensure CoreWeave’s shared platforms are reliable, observable, and operable at scale.

View Original Listing

Required Skills

  • Site Reliability
  • Production Engineering
  • Incident Management
  • Observability
  • SLOs/SLIs
  • Linux
  • Kubernetes
  • Terraform
  • Automation
  • Capacity Planning

+1 more

CoreWeave

About CoreWeave

coreweave.com

CoreWeave is a cloud provider purpose-built for GPU-accelerated AI and high-performance compute workloads, positioning itself as "The Essential Cloud for AI." It offers on-demand and dedicated GPU infrastructure (bare metal, virtual machines, and Kubernetes), high-performance networking and storage, and managed services to support large-scale training, inference, and graphics rendering. CoreWeave emphasizes performance, cost-efficiency, and operational support so enterprises and research teams can deploy and scale AI workloads with predictable performance and security.

ApplyBlast uses AI to match you with the right jobs, tailor your resume and cover letter, and apply automatically so you can land your dream job faster.

© All Rights Reserved. ApplyBlast.com