As a Senior Site Reliability Engineer at Anaplan, you will play a crucial role in ensuring the reliability, scalability, and performance of our production platforms. Your impact will be significant as you participate in the on-call rotation, collaborate with service teams, and drive platform-wide performance improvements. Key Responsibilities:
- Mentor and support other SRE team members
- Lead complex changes and influence our operating strategy
- Identify and deliver impactful SRE-led projects
- Define and uphold standards for our operating environments
- Provide expert guidance to the development teams we support
- Troubleshoot and resolve production incidents, contributing to sustainable long-term solutions
- Drive improvements in automation, observability, and reliability practices
- Partner with infrastructure teams to evolve and strengthen our platform
Qualifications Required:
- 6+ years of experience in SRE or equivalent operationally focused engineering roles
- Experience of Linux administration will be a day-one skill
- Experience of operating live, production-grade Kubernetes environments
- Expertise in problem diagnosis across complex, distributed systems
- Proficiency in a scripting language suited to automation (e.g., Python, Bash)
- Experience with Git version control and modern CI/CD and DevOps practices