Senior/Staff Infrastructure Engineer
fal.ai(14 hours ago)
About this role
A hands-on engineer focused on building software and processes to manage a large fleet of GPU servers, including automation, monitoring, and recovery systems. The role involves developing and maintaining infrastructure tools to support AI workloads and optimize hardware health across the fleet.
Required Skills
- Python
- Linux
- Terraform
- Docker
- NVIDIA
- GPUs
- Storage
- Networking
- Container Runtimes
- Systemd
About fal.ai
fal.aifal.ai is a leading generative AI platform that enables developers to integrate an extensive library of over 600 generative media models, including image, video, and audio production tools, using a user-friendly API. The platform emphasizes cost-effectiveness and flexibility, allowing organizations to run models quickly without extensive setup, and scales efficiently through serverless GPUs and dedicated clusters. With a focus on rapid inference and real-time integration, fal.ai aims to streamline the deployment and management of AI solutions for businesses of all sizes, from startups to enterprises.
View more jobs at fal.ai →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at fal.ai
Similar Jobs
HPC Solutions Architect
Lavendo(18 days ago)
AI Infrastructure Engineer
42dot(15 days ago)
Graphics Processing Unit (GPU) Engineer
Example Corp(2 months ago)
NeoCloud Senior Infrastructure Architect (Remote)
Myriad360(19 days ago)
HPE Private Cloud AI Center of Excellence (PCAI CoE)
Hewlett Packard Enterprise(21 days ago)
HPC AI Data Center Network Specialist
Hewlett Packard Enterprise(28 days ago)