Site Reliability Lead
Our Group(1 day ago)
About this role
This role focuses on ensuring the reliability, stability, and performance of complex systems through monitoring, automation, incident management, and resilience testing. The candidate will lead efforts to improve system reliability, collaborate with development teams, and set best practices for site reliability engineering.
Required Skills
- Python
- Go
- Bash
- Terraform
- Kubernetes
- Docker
- Prometheus
- Splunk
- AWS
- Azure
About Our Group
vanguardjobs.comNew Relic is a cloud-native observability platform that helps developers and operations teams monitor, troubleshoot, and optimize applications, infrastructure, and digital customer experiences. It offers application performance monitoring (APM), distributed tracing, logs, metrics, infrastructure monitoring, synthetics, dashboards, and alerting, all unified in the New Relic One platform. The service ingests telemetry from across cloud-native and hybrid stacks, provides powerful query and visualization tools, and includes AI/ML-assisted insights to speed incident response and capacity planning. Organizations use New Relic to correlate metrics, traces, and logs in one place for real‑time visibility and faster troubleshooting.
View more jobs at Our Group →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at Our Group
Similar Jobs
Lead Cloud Infrastructure Engineer / Site Reliability Engineer (SRE)
Job Board(1 month ago)
Lead Site Reliability Engineer - Specialist
Equifax India(6 days ago)
Site Reliability Engineer
Crusoe(1 month ago)
Site Reliability Engineer
Betsson Group(5 months ago)
Infra Tech Support Practitioner
Accenture Federal Services(20 days ago)
Senior Manager, Site Reliability & Operations
Finastra(22 days ago)