Manager I, Engineering - AI Platform - Training & Serving
Datadog(21 days ago)
About this role
Datadog’s AI Platform organization builds the infrastructure that enables large-scale AI training and inference across the company, supporting products like Bits AI and LLM Observability as well as internal AI research. This role leads the Training & Serving team, shaping the technical vision and roadmap for distributed foundation model training and scalable serving systems, in close partnership with Applied AI and core infrastructure teams. The position is within a hybrid workplace focused on collaboration and office culture.
Required Skills
- People Management
- Team Scaling
- Technical Roadmap
- Distributed Training
- Model Serving
- Backend Engineering
- Data Engineering
- Infrastructure
- Data Pipelines
- Storage Systems
+3 more
About Datadog
datadoghq.comDatadog is a SaaS monitoring and observability platform that helps teams see inside infrastructure, applications, logs, and user experience across cloud-scale environments. It ingests metrics, traces, and logs from hundreds of integrations and provides unified dashboards, alerts, APM, log management, synthetics, and security monitoring. Developers, operations, and security teams use Datadog to detect, troubleshoot, and optimize performance and reliability for modern distributed systems.
Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at Datadog
Similar Jobs
AI Engineer - Platform/MLOps
hyperexponential(25 days ago)
Engineering Manager, AI Platform
GetYourGuide(1 month ago)
Security Engineer
Fireworks AI(14 days ago)
Staff Product Manager, Managed Inference (SF/Sunnyvale/New York)
Crusoe(1 month ago)
Engineering Manager, Cloud Inference Azure
Anthropic(20 days ago)
AI Platform Engineer, Applied AI
Circle.so(4 days ago)