As a Senior DevOps Engineer at Locus, you will have end-to-end ownership of the infrastructure, design, scale, and operation. Your responsibilities will include:
Owning the design, architecture, and reliability of Locus's cloud infrastructure across AWS, Azure, GCP, and Aliyun, supporting multi-region, global deployments. - Leading the evolution of the CI/CD ecosystem, optimizing and refactoring the Jenkins-as-Code setup for scalability, performance, and developer efficiency. - Driving the Infrastructure as Code (IaC) journey end-to-end, migrating existing cloud resources, alarms, and configurations fully into code with strong versioning, review, and rollback practices. - Partnering with engineering teams to identify and resolve performance, scalability, and reliability bottlenecks, conducting deep dives into memory, CPU, networking, and storage constraints. - Defining and implementing monitoring, alerting, and incident response best practices, improving MTTR, system observability, and operational readiness. - Leading initiatives around cost optimization, security hardening, and capacity planning to keep the infrastructure efficient and compliant as the platform scales. - Acting as a technical mentor for junior DevOps engineers and raising the overall DevOps maturity across teams. To be considered an ideal candidate for this role, you should have the following qualifications:
Must have 6+ years of experience in DevOps/SRE/Infrastructure roles with hands-on experience in handling clear scale signals like traffic, uptime, latency, and infrastructure size. - Must have experience working in a B2B SaaS company with a multi-tenant architecture or multiple production stacks (multi-env/multi-client systems). - Strong technical skills in AWS (VPC, EKS, EC2, RDS, networking), Kubernetes (EKS) at scale, designing high availability, multi-region systems. - Proficiency in Terraform, Helm/GitOps, and strong scripting languages like Python, Go, or Bash. - Experience with scalable CI/CD pipelines (GitHub Actions/Jenkins), zero/low downtime deployments. - Familiarity with SRE principles (SLOs, SLIs, error budgets), monitoring tools (Prometheus, Grafana, Datadog), alerting, on-call, and incident management. - Educational background in BTech in Computer Science or related fields. - Previous experience in strong B2B SaaS product companies with good scaling. In summary, as a Senior DevOps Engineer at Locus, you will play a crucial role in maintaining and optimizing the company's infrastructure while mentoring junior team members and contributing to the overall DevOps maturity of the organization. As a Senior DevOps Engineer at Locus, you will have end-to-end ownership of the infrastructure, design, scale, and operation. Your responsibilities will include:
Owning the design, architecture, and reliability of Locus's cloud infrastructure across AWS, Azure, GCP, and Aliyun, supporting multi-region, global deployments. - Leading the evolution of the CI/CD ecosystem, optimizing and refactoring the Jenkins-as-Code setup for scalability, performance, and developer efficiency. - Driving the Infrastructure as Code (IaC) journey end-to-end, migrating existing cloud resources, alarms, and configurations fully into code with strong versioning, review, and rollback practices. - Partnering with engineering teams to identify and resolve performance, scalability, and reliability bottlenecks, conducting deep dives into memory, CPU, networking, and storage constraints. - Defining and implementing monitoring, alerting, and incident response best practices, improving MTTR, system observability, and operational readiness. - Leading initiatives around cost optimization, security hardening, and capacity planning to keep the infrastructure efficient and compliant as the platform scales. - Acting as a technical mentor for junior DevOps engineers and raising the overall DevOps maturity across teams. To be considered an ideal candidate for this role, you should have the following qualifications:
Must have 6+ years of experience in DevOps/SRE/Infrastructure roles with hands-on experience in handling clear scale signals like traffic, uptime, latency, and infrastructure size. - Must have experience working in a B2B SaaS company with a multi-tenant architecture or multiple production stacks (multi-env/multi-client systems). - Strong technical skills in AWS (VPC, EKS, EC2, RDS, networking), Kubernetes (EKS) at scale, designing high availability, multi-region systems. - Proficiency in Terraform, Helm/GitOps, and strong scripting languages like Python, Go, or Bash. - Experience with scalable CI/CD pipelines (GitHub Actions/Jenkins), zero/low downtime deployments. - Familiarity with SRE principles (SLOs, SLIs, error budgets), monitoring tools (Prometheus, Grafana, Datadog), alerting, on-call, and incident management. - Educational background in BTech in Computer Science or related fields. - Previous experience in strong B2B SaaS product companies w