Senior/Staff Site Reliability Engineer
fal.ai(5 hours ago)
About this role
A Senior Site Reliability Engineer at fal is responsible for maintaining the uptime, scalability, and reliability of critical production systems, primarily focusing on Kubernetes infrastructure, automation, and system monitoring. The role involves improving processes, building automation tools, and collaborating across teams to ensure optimal system performance.
Required Skills
- Kubernetes
- Terraform
- Ansible
- Prometheus
- Grafana
- Python
- Bash
- Networking
- CI/CD
- Monitoring
About fal.ai
fal.aifal.ai is a leading generative AI platform that enables developers to integrate an extensive library of over 600 generative media models, including image, video, and audio production tools, using a user-friendly API. The platform emphasizes cost-effectiveness and flexibility, allowing organizations to run models quickly without extensive setup, and scales efficiently through serverless GPUs and dedicated clusters. With a focus on rapid inference and real-time integration, fal.ai aims to streamline the deployment and management of AI solutions for businesses of all sizes, from startups to enterprises.
View more jobs at fal.ai →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at fal.ai
Similar Jobs
Site Reliability Engineer Staff
Hewlett Packard Enterprise(14 days ago)
Senior Site Reliability Engineer
Zoom(19 days ago)
Site Reliability Engineer Sr. Staff
Hewlett Packard Enterprise(13 days ago)
Site Reliability Engineer Staff
Hewlett Packard Enterprise(14 days ago)
Sr Site Reliability Engineer (SRE) – Infra Focus
LinkedIn(13 days ago)
Site Reliability Engineer Sr. Staff
Hewlett Packard Enterprise(13 days ago)