Senior Staff Engineer - Observability Infrastructure
Graphcore(3 days ago)
About this role
Graphcore is seeking a Senior Staff Engineer to develop scalable management and observability solutions for AI infrastructure, collaborating with cross-disciplinary teams to create integrated, easy-to-deploy tools and reference designs that support AI computing products.
Required Skills
- Prometheus
- Grafana
- Kubernetes
- Docker
- Python
- Telemetry
- Monitoring
- Infrastructure
- Cluster Management
- Data Center
About Graphcore
graphcore.aiGraphcore is a semiconductor and systems company that designs the Intelligence Processing Unit (IPU), a processor architecture purpose-built for machine intelligence to accelerate machine learning and AI workloads. It offers IPU-based servers and cloud access together with the Poplar software stack, compilers and developer tools to run training and inference at scale and integrate with common ML frameworks. Graphcore’s platform targets researchers and enterprises that need higher performance, efficiency and scalability for large models and intelligent applications across cloud and on‑prem deployments.
View more jobs at Graphcore →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at Graphcore
Similar Jobs
Sr/Staff Software Engineer, Observability (Network Engineering)
Crusoe(1 month ago)
Staff Software Engineer - Grafana Cloud Observability, Kubernetes Monitoring | Spain | Remote
Grafana Labs(10 days ago)
Observability Engineer
TensorWave(1 month ago)
Senior Product Manager - Observability and Resilience
NVIDIA(1 month ago)
Observability Infrastructure Engineer
Adyen(7 days ago)
Infrastructure Software Engineer (Taiwan)
Etched(1 month ago)