Datadog

Manager I, Engineering - AI Platform - Training & Serving

Datadog(21 days ago)

HybridFull TimeManager$126,627 - $165,535 (estimated)Engineering
Apply Now

About this role

Datadog’s AI Platform organization builds the infrastructure that enables large-scale AI training and inference across the company, supporting products like Bits AI and LLM Observability as well as internal AI research. This role leads the Training & Serving team, shaping the technical vision and roadmap for distributed foundation model training and scalable serving systems, in close partnership with Applied AI and core infrastructure teams. The position is within a hybrid workplace focused on collaboration and office culture.

View Original Listing

Required Skills

  • People Management
  • Team Scaling
  • Technical Roadmap
  • Distributed Training
  • Model Serving
  • Backend Engineering
  • Data Engineering
  • Infrastructure
  • Data Pipelines
  • Storage Systems

+3 more

Datadog

About Datadog

datadoghq.com

Datadog is a SaaS monitoring and observability platform that helps teams see inside infrastructure, applications, logs, and user experience across cloud-scale environments. It ingests metrics, traces, and logs from hundreds of integrations and provides unified dashboards, alerts, APM, log management, synthetics, and security monitoring. Developers, operations, and security teams use Datadog to detect, troubleshoot, and optimize performance and reliability for modern distributed systems.

ApplyBlast uses AI to match you with the right jobs, tailor your resume and cover letter, and apply automatically so you can land your dream job faster.

© All Rights Reserved. ApplyBlast.com