Senior Site Reliability Engineer
Northwood Space(2 months ago)
About this role
The Senior Site Reliability Engineer at Northwood is responsible for architecting and leading monitoring and reliability systems to support satellite communication operations globally. This role involves designing enterprise observability stacks, implementing SRE practices, and building multi-region AWS infrastructure to ensure 99.9%+ uptime for mission-critical systems. The position also includes mentoring junior engineers, leading incident response efforts, and driving the adoption of advanced CI/CD practices while collaborating closely with the founding engineering team. Candidates should have extensive experience with Kubernetes, Terraform, and cloud architecture in fast-paced environments, ideally with a background in aerospace or telecommunications.
Required Skills
- Observability Stack
- SRE Practices
- Error Budgets
- SLO/SLI Frameworks
- AWS Infrastructure
- Terraform
- CI/CD Pipeline
- GitLab
- ArgoCD
- Kubernetes Deployments
+23 more
Qualifications
- AWS Professional certification
About Northwood Space
www.northwoodspace.ioNorthwood Space is a modern space infrastructure company focused on enhancing ground segment technology. Their mission is to build robust systems that keep global satellite networks interconnected, ensuring that the benefits of space assets reach more people on Earth and beyond. By employing innovative technologies designed for volume manufacturing and dynamic scaling, Northwood aims to eliminate single points of failure while anticipating disruptions in satellite connectivity.
Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at Northwood Space
Similar Jobs
Lead Devops Engineer (Bangkok based, relocation provided)
Agoda(7 days ago)
Director of Technology - Governance, Risk, and Compliance
Archer(14 days ago)
Technical Product Manager - Reliability Engineering
Mollie(3 months ago)
Senior DevOps Engineer (with Golang expertise)
IDT(14 hours ago)
Staff Software Engineer, Reliability
Veeam Software(5 months ago)
IT Engineer, Facility Security Operations
Allen Control Systems(22 days ago)