As a highly experienced Senior DevOps Engineer, your role will involve leading the installation, automation, and operational reliability of a modern open-source data and integration platform that supports business-critical data pipelines and integrations. You will have ownership across infrastructure, reliability, security, automation, and operational excellence in both private and public cloud environments. **Key Responsibilities:**
Platform Installation, Configuration & Operations
Install, configure, upgrade, and maintain distributed open-source components such as Apache Airflow, Apache NiFi, Apache Spark, Apache Kafka, PostgreSQL, and MQTT brokers. - Ensure platform stability, scalability, high availability, and fault tolerance. - Perform capacity planning, performance tuning, and lifecycle management of all components. - Containerization & Orchestration
Design, deploy, and operate containerized workloads using Docker. - Build and manage production-grade Kubernetes clusters. - Implement Kubernetes best practices for networking, storage, scaling, and security. - Infrastructure as Code & Automation
Design and maintain Infrastructure as Code (IaC) using Terraform for cloud and on-prem environments. - Build configuration management and automation workflows using Ansible. - Enable repeatable, environment-agnostic deployments across development, staging, and production. - Cloud, Hybrid & Private Infrastructure
Deploy and operate workloads on public cloud platforms (AWS, Azure, GCP) and private/on-prem infrastructure. - Design hybrid architectures with secure connectivity between environments. - Optimize infrastructure design for resilience, performance, and cost efficiency. **Qualifications Required:**
5+ years of hands-on experience in DevOps, Platform Engineering, or Site Reliability Engineering. - Strong experience operating distributed, open-source systems in production. - Proven expertise with Docker, Kubernetes, Terraform, Ansible, Linux systems, and networking fundamentals. - Hands-on experience with Kafka, Spark, Airflow, NiFi, PostgreSQL, and messaging systems. - Experience supporting business-critical platforms with uptime and reliability requirements. - Strong scripting skills (Bash, Python, or equivalent). The company, Cuculus, is dedicated to providing utilities to all while protecting the world's precious resources. They work with an international partner network to provide cutting-edge software and technology solutions for utility challenges. The company focuses on creating innovative technology and services to enable utilities and organizations to transition successfully to a new era of providing and managing electricity, water, and gas. Cuculus values the importance of their work for individuals, cities, and nations, and they maintain a balance of seriousness and fun in their work culture. As a highly experienced Senior DevOps Engineer, your role will involve leading the installation, automation, and operational reliability of a modern open-source data and integration platform that supports business-critical data pipelines and integrations. You will have ownership across infrastructure, reliability, security, automation, and operational excellence in both private and public cloud environments. **Key Responsibilities:**
Platform Installation, Configuration & Operations
Install, configure, upgrade, and maintain distributed open-source components such as Apache Airflow, Apache NiFi, Apache Spark, Apache Kafka, PostgreSQL, and MQTT brokers. - Ensure platform stability, scalability, high availability, and fault tolerance. - Perform capacity planning, performance tuning, and lifecycle management of all components. - Containerization & Orchestration
Design, deploy, and operate containerized workloads using Docker. - Build and manage production-grade Kubernetes clusters. - Implement Kubernetes best practices for networking, storage, scaling, and security. - Infrastructure as Code & Automation
Design and maintain Infrastructure as Code (IaC) using Terraform for cloud and on-prem environments. - Build configuration management and automation workflows using Ansible. - Enable repeatable, environment-agnostic deployments across development, staging, and production. - Cloud, Hybrid & Private Infrastructure
Deploy and operate workloads on public cloud platforms (AWS, Azure, GCP) and private/on-prem infrastructure. - Design hybrid architectures with secure connectivity between environments. - Optimize infrastructure design for resilience, performance, and cost efficiency. **Qualifications Required:**
5+ years of hands-on experience in DevOps, Platform Engineering, or Site Reliability Engineering. - Strong experience operating distributed, open-source systems in production. - Proven expertise with Docker, Kubernetes, Terraform, Ansible, Linux systems, and networking fundamentals. - Hands-on experience with Kafka, Spark, Airflow, NiF