Back to all jobs
I
Software Engineer – DevOps
D01 Marina, Raffles Place, People's Park, Cecil, Singapore
Full TimeInformation TechnologyJob Description
We are seeking a highly skilled Software Engineer with strong expertise in DevOps, Site Reliability Engineering (SRE), cloud infrastructure, and production support. The ideal candidate will have hands-on experience designing scalable distributed systems, managing Kubernetes environments, implementing CI/CD pipelines, and ensuring reliability of mission-critical applications in enterprise environments.
This role requires strong technical capabilities across cloud platforms, automation, observability, and incident management, along with the ability to collaborate with cross-functional teams to deliver secure, scalable, and high-availability systems.
Key Responsibilities
- Design, build, and maintain scalable cloud-native infrastructure on Azure and AWS platforms.
- Develop and manage Infrastructure as Code (IaC) solutions for automated provisioning and configuration management.
- Implement and optimize CI/CD pipelines using GitLab, Ansible, Docker, and Kubernetes/OpenShift.
- Manage containerized applications and Kubernetes clusters, including deployment, scaling, troubleshooting, and performance optimization.
- Build and maintain observability and monitoring solutions using Grafana, GeneOS, InfluxDB, OpenTelemetry, and Control-M.
- Provide L2/L3 production support for enterprise applications and distributed systems.
- Lead incident management, root cause analysis (RCA), problem management, and service recovery activities.
- Ensure SLA/SLO compliance, system reliability, and operational stability across environments.
- Support release management, deployment automation, and environment governance processes.
- Collaborate with development, QA, infrastructure, and business teams to improve deployment reliability and platform resilience.
- Develop automation scripts using Python and Shell scripting.
Required Skills & Technologies
- Cloud & Infrastructure
- Microsoft Azure
- AWS Cloud
- Infrastructure as Code (IaC)
- DevOps & CI/CD
- GitLab CI/CD
- Docker
- Kubernetes (AKS/OpenShift)
- Ansible
- Monitoring & Observability
- Grafana
- ITRS GeneOS
- InfluxDB
- OpenTelemetry
- Control-M
- Programming & Automation
- Python
- Shell Scripting
Experience
- 8+ years of experience in Software Engineering, DevOps, Site Reliability Engineering, or Production Support environments.
- Strong experience supporting enterprise-scale, high-availability systems in production environments.
About Itcan Pte. Limited
First seen: May 22, 2026
Last updated: May 29, 2026