fal.ai

Senior/Staff Site Reliability Engineer

fal.ai(5 hours ago)

San Francisco, CAOnsiteFull TimeSenior$180,000 - $250,000Site Reliability Engineering
Apply Now

About this role

A Senior Site Reliability Engineer at fal is responsible for maintaining the uptime, scalability, and reliability of critical production systems, primarily focusing on Kubernetes infrastructure, automation, and system monitoring. The role involves improving processes, building automation tools, and collaborating across teams to ensure optimal system performance.

View Original Listing

Required Skills

  • Kubernetes
  • Terraform
  • Ansible
  • Prometheus
  • Grafana
  • Python
  • Bash
  • Networking
  • CI/CD
  • Monitoring
fal.ai

About fal.ai

fal.ai

fal.ai is a leading generative AI platform that enables developers to integrate an extensive library of over 600 generative media models, including image, video, and audio production tools, using a user-friendly API. The platform emphasizes cost-effectiveness and flexibility, allowing organizations to run models quickly without extensive setup, and scales efficiently through serverless GPUs and dedicated clusters. With a focus on rapid inference and real-time integration, fal.ai aims to streamline the deployment and management of AI solutions for businesses of all sizes, from startups to enterprises.

View more jobs at fal.ai

ApplyBlast uses AI to match you with the right jobs, tailor your resume and cover letter, and apply automatically so you can land your dream job faster.

© All Rights Reserved. ApplyBlast.com