Site Reliability Engineer - AI & ML Infrastructure (Kubernetes & Terraform)
Deepgram(6 days ago)
About this role
Deepgram is seeking an experienced Site Reliability Engineer to develop and operate their hybrid AI/ML research and product platform, which spans cloud and on-premises environments. The role involves designing scalable infrastructure, managing high-performance GPU workloads, and collaborating with AI researchers to accelerate development. It offers the opportunity to work with cutting-edge technology at the intersection of platform engineering and AI.
Required Skills
- Kubernetes
- Terraform
- Slurm
- Python
- CNI
- Rook
- Ceph
- Cilium
- PXE
- MAAS
About Deepgram
www.deepgram.comDeepgram is an advanced voice technology company that specializes in providing enterprise solutions through its Speech-to-Text (STT), Text-to-Speech (TTS), and Voice Agent APIs. Its platform offers real-time, highly accurate transcription and voice synthesis capabilities, designed to scale with the needs of businesses. Deepgram's solutions are built for integration into various applications, enabling companies to leverage sophisticated voice AI technology for enhanced customer interactions and operational efficiency.
View more jobs at Deepgram →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at Deepgram
Similar Jobs
Site Reliability Engineer, AI/ML Infrastructure
Boson AI(14 days ago)
Site Reliability Engineer, AI/ML Infrastructure
Boson AI(6 days ago)
Site Reliability Engineer, AI/ML Infrastructure
Boson AI(1 month ago)
Site Reliability Engineer, AI/ML Infrastructure
Boson AI(1 month ago)
AI and ML HPC Cluster Engineer
NVIDIA(1 month ago)
Site Reliability Engineer, AI/ML Infrastructure
Boson AI(1 month ago)