Back to all jobs
I

Software Engineer – DevOps

D01 Marina, Raffles Place, People's Park, Cecil, Singapore
Full TimeInformation Technology

Job Description

We are seeking a highly skilled Software Engineer with strong expertise in DevOps, Site Reliability Engineering (SRE), cloud infrastructure, and production support. The ideal candidate will have hands-on experience designing scalable distributed systems, managing Kubernetes environments, implementing CI/CD pipelines, and ensuring reliability of mission-critical applications in enterprise environments.

This role requires strong technical capabilities across cloud platforms, automation, observability, and incident management, along with the ability to collaborate with cross-functional teams to deliver secure, scalable, and high-availability systems.

Key Responsibilities

  • Design, develop and manage Infrastructure as Code (IaC) solutions for automated provisioning and configuration management.
  • Implement and optimize CI/CD pipelines using GitLab, Ansible, Docker, and Kubernetes/OpenShift.
  • Manage containerized applications and Kubernetes clusters, including deployment, scaling, troubleshooting, and performance optimization.
  • Build and maintain observability and monitoring solutions using Grafana, GeneOS, InfluxDB, OpenTelemetry, and Control-M.
  • Provide L2/L3 production support for enterprise applications and distributed systems.
  • Lead incident management, root cause analysis (RCA), problem management, and service recovery activities.
  • Ensure SLA/SLO compliance, system reliability, and operational stability across environments.
  • Support release management, deployment automation, and environment governance processes.
  • Collaborate with development, QA, infrastructure, and business teams to improve deployment reliability and platform resilience.
  • Develop automation scripts using Python and Shell scripting.

Required Skills & Technologies

  • Cloud & Infrastructure
  • Microsoft Azure
  • AWS Cloud
  • Infrastructure as Code (IaC)
  • DevOps & CI/CD
  • GitLab CI/CD
  • Docker
  • Kubernetes (AKS/OpenShift)
  • Ansible
  • Monitoring & Observability
  • Grafana
  • ITRS GeneOS
  • InfluxDB
  • OpenTelemetry
  • Control-M
  • Programming & Automation
  • Python
  • Shell Scripting

Experience

  • 10 years of experience in Software Engineering, DevOps, Site Reliability Engineering, or Production Support environments.
  • Strong experience supporting enterprise-scale, high-availability systems in production environments.

About Itcan Pte. Limited

First seen: May 22, 2026
Last updated: May 29, 2026