ML Systems Engineer, Infrastructure & Cloud
Basis(2 months ago)
About this role
An ML Systems Engineer at Basis is responsible for developing and maintaining the infrastructure required for scalable and efficient training of machine learning models. This role includes managing distributed training frameworks, optimizing cloud resources and costs, ensuring security compliance, and troubleshooting complex system failures across hardware and software stacks. The engineer will collaborate with researchers to understand their needs, enabling reproducible research and operational excellence through detailed documentation and monitoring systems.
Required Skills
- ML Systems Engineering
- Distributed Training
- Cloud Administration
- Infrastructure Optimization
- Debugging Skills
- Resource Utilization
- AWS/GCP/Azure
- Kubernetes
- Security Compliance
- Documentation
+10 more
About Basis
www.basis.aiBasis is an innovative AI research organization dedicated to developing intelligence that can address complex scientific and societal challenges. The company's mission revolves around understanding and building advanced technological solutions that enhance our capability to solve intractable problems. Basis is focused on creating a universal reasoning engine and is actively engaged in both core AI technologies and tackling significant challenges through its research and initiatives.
Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at Basis
Similar Jobs
Hardware Support & Test Engineer - Contract
Eight Sleep(29 days ago)
Principal Rust Engineer - ML Infrastructure
Alignerr(1 month ago)
Forward Deployed Engineer
Reflection AI(1 month ago)
Staff SRE, Agentic AI
Netskope(1 month ago)
Senior Software Engineer, ML Infrastructure
LMArena(1 month ago)
Principal Python Engineer - ML Infrastructure
Alignerr(1 month ago)