Senior Platform Engineer
Pluralis Research(1 month ago)
About this role
A Senior Platform Engineer at Pluralis Research is responsible for designing and overseeing multi-cloud infrastructure, leveraging infrastructure-as-code tools like Pulumi and Terraform for provisioning and orchestration across AWS, GCP, and Azure. This role involves architecting fault-tolerant distributed training systems for machine learning, focusing on GPU clusters and data management while ensuring real-world networking conditions are met. The engineer will optimize resource scheduling in heterogeneous clusters and implement monitoring and observability solutions to enhance system performance and reliability.
Required Skills
- Multi-Cloud Infrastructure
- Distributed Training Systems
- Real-World Networking
- Infrastructure-as-Code
- Python Engineering
- Container & GPU
- Networking
- ML Infrastructure
- Observability & SRE
- Micro-Services Orchestration
+4 more
About Pluralis Research
pluralis.aiPluralis Research is at the forefront of innovative machine learning techniques, specializing in Protocol Learning, which focuses on decentralized, communication-efficient model-parallel training for foundation models. Their research facilitates the multi-participant training of models, enabling collaborative efforts without any single participant ever possessing the complete model—promoting community ownership and sustainable economies. They strive to create cutting-edge, self-sustaining models that push the boundaries of traditional AI development. With a commitment to open-source collaboration, Pluralis Research is shaping the future of AI technologies.
Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at Pluralis Research
Similar Jobs
Cloud Platform Engineer
Weaviate(1 month ago)
Senior Infrastructure/ DevOps Engineer
Lazer Technologies(2 months ago)
Systems Engineer - AI Infrastructure
Clockwork.io(6 days ago)
Senior Systems Engineer - AI Infrastructure
Clockwork.io(6 days ago)
Software Engineer, Infrastructure
Exa(1 month ago)
Forward Deploy Azure DevOps Engineer
A.Team(2 months ago)