Senior ML Systems Engineer, Frameworks & Tooling
Cohere(2 months ago)
About this role
A Senior ML Systems Engineer, Frameworks & Tooling at Cohere is responsible for designing and maintaining the training framework that powers large-scale language models. This role involves working at the intersection of large-scale training, distributed systems, and HPC infrastructure to develop training abstractions, enhance training throughput and stability, and create tools for monitoring and debugging. Collaboration with infrastructure teams and resolving performance bottlenecks across the ML systems stack are key aspects of the role, allowing for significant impact in advancing AI capabilities.
Required Skills
- Large-Scale Training
- Distributed Systems
- HPC Infrastructure
- Model Training
- Performance Optimization
- Monitoring Tools
- Logging Tools
- Debugging Tools
- Collaboration Skills
- JAX Internals
+9 more
About Cohere
cohere.comSanity is a platform that provides flexible content management solutions tailored for developers, marketers, and content creators. By utilizing a real-time collaborative editor and structured content, it allows users to build and manage high-performance applications and websites. Sanity’s APIs and flexible data model enable seamless integration with various frameworks and technologies, empowering users to deliver customized content experiences. With features like query-driven content fetching and an extensible plugin system, Sanity is designed to enhance productivity and scalability for teams of all sizes.
Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at Cohere
Similar Jobs
Systems Engineer - AI Infrastructure
Clockwork.io(6 days ago)
Staff Site Reliability Engineer, Compute
Crusoe(1 month ago)
Production Engineer, Compute
Crusoe(1 month ago)
HPC Systems Engineer, Consumer Products
OpenAI(1 month ago)
Senior Systems Engineer - AI Infrastructure
Clockwork.io(6 days ago)
Senior HPC Cluster Engineer
Nebius(10 months ago)