Staff Software Engineer, Site Reliability (SRE)
character.ai(1 month ago)
About this role
The Staff Software Engineer, Site Reliability (SRE) at Character.AI is responsible for maintaining and optimizing the reliability, scalability, and performance of a large-scale infrastructure supporting millions of daily users. This role involves collaborating with development teams to implement CI/CD processes, develop automation tools, establish SLAs and SLOs, and manage incident responses. Candidates should have over 5 years of experience in DevOps/SRE, proficiency in Python, Golang, SQL, and familiarity with cloud platforms like GCP, as well as tools such as Kubernetes and Terraform. The position also includes on-call duties and planning for disaster recovery to ensure continuous service availability.
Required Skills
- Site Reliability Engineering
- Production Services
- Performance Monitoring
- Reliability Optimization
- Automation Development
- CI/CD Processes
- SLA/SLO Establishment
- System Monitoring
- Incident Response
- Disaster Recovery
+15 more
About character.ai
character.aiCharacter.ai is an innovative AI chat platform that brings users the unique experience of conversing with millions of AI characters within interactive chat scenarios. Users can engage with diverse personalities ranging from fictional characters to created personas, unlocking endless possibilities for storytelling and adventure. The platform's engaging interface and deep character interactions allow users to explore imaginative scenarios, connect with various narrative styles, and create their own immersive experiences in a friendly and captivating environment.
Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at character.ai
Similar Jobs
Senior Software Engineer, Cloud Infrastructure / SRE
Oscar Health(6 days ago)
SRE Lead (f/m/d)
Upvest(1 month ago)
Senior Site Reliability Engineer, Arlington
Onebrief(2 months ago)
Senior Site Reliability Engineer (SRE)
Sertis(30 days ago)
Senior Software Engineer, Cloud Infrastructure / SRE
Oscar Health(6 days ago)
Site Reliability Engineer (SRE) - Platform Infrastructure team (100% Remote - Spain)
Hopper(3 months ago)