Back to all jobs
A
Day 2 System Engineer (Singaporean Only)
Islandwide, Singapore
ContractInformation TechnologyJob Description
Responsibilities:
- Provide Day 2 operational support for mission-critical applications and platform services
- Investigate incidents, perform troubleshooting, root cause analysis (RCA), and issue resolution
- Support production deployments, rollback activities, and post-deployment verification
- Monitor system health, application performance, logs, and operational alerts
- Collaborate closely with infrastructure, application, cybersecurity, DevOps, and operations teams
- Support disaster recovery (DR), system recovery, and business continuity activities
- Participate in standby and after-office-hour support when required
- Ensure compliance with operational procedures, governance, and security policies
DevOps & CI/CD Engineering
Build, maintain, and optimize CI/CD pipelines for:
- Source code management and branching strategies
- Code quality scanning
- Static Application Security Testing (SAST)
- Software composition analysis / dependency scanning
- Container image vulnerability scanning
- Automated build and packaging
- Application containerization
- Automated testing integration
- Deployment artifact generation
- Release management and deployment automation
Container & Platform Management
- Deploy, configure, and manage containerized applications using: Docker, Podman, Kubernetes
- Support deployments and operations for: React applications, Node.js applications, Spring Boot applications
- Container troubleshooting and performance tuning
- Resource optimization
- Kubernetes cluster support
- Secrets management
- Ingress configuration
- Scaling and deployment management
System Engineering & Administration
- Maintain middleware and application platform configurations
- Support Linux and Windows server environments
- Perform system patching, preventive maintenance, and configuration updates
- Manage environment configurations across DEV, SIT, UAT, and Production
- Support certificate renewal and secure connectivity setup
- Maintain deployment scripts, automation scripts, and operational tools
Monitoring & Reliability Engineering
- Configure and maintain monitoring dashboards, alerts, and health checks
- Monitor system availability, application performance, and resource utilization
- Perform log analysis and troubleshooting using centralized logging tools
- Identify improvement opportunities to enhance system reliability and operational stability
Documentation & Compliance
- Prepare and maintain System architecture documentation, Deployment procedures, Operational runbooks, Troubleshooting guides and SOPs and technical reports
- Support security compliance reviews, audit activities and operational assessments
Requirements:
- Diploma or Bachelor’s Degree in Computer Science, Information Technology, Software Engineering, Computer Engineering, Or related disciplines
- Minimum 3 years of relevant experience in Software/System Engineering, DevOps Engineering, Application Support, Production Support and Platform/System Administration
- Experience supporting enterprise or mission-critical systems is highly preferred
- Experience in government or large-scale enterprise environments will be advantageous
- Strong knowledge and hands-on experience in: Linux administration, Application deployment and troubleshooting, CI/CD pipeline implementation, Container technologies and orchestration
- Hands-on experience with GitLab CI/CD, Jenkins, Docker, Podman, Kubernetes and Git
- Familiarity with React, Node.js, Spring Boot
- Experience in Bash scripting, Python, PowerShell and Shell scripting
- Experience with tools such as eG Enterprise, Prometheus, Grafana, Nagios, Zabbix
- Database knowledge and understanding of SQL databases, NoSQL databases, Basic database troubleshooting and query analysis
- Knowledge of Secure system hardening, Cybersecurity best practices, Vulnerability management, Secure CI/CD practices, Certificate and secrets management
- Candidates with experience in the following areas will have an added advantage: VMware or virtualization technologies, Kafka or messaging technologies, API integration and middleware support, High Availability (HA) and Disaster Recovery (DR) environments
- Government project operational support
- ITIL incident/problem/change management processes
- Container security and image scanning tools
- SonarQube or code quality tools
Soft Skills
- Strong analytical and problem-solving abilities
- Good communication and stakeholder management skills
- Ability to work independently under pressure
- Strong teamwork and collaboration mindset
- Good documentation and reporting skills
- Ability to manage operational incidents in a fast-paced environment
- 24/7 operational support environment
- Standby and after-office-hour deployment support may be required
- Collaborative environment involving infrastructure, cybersecurity, application, and operations teams
If you are keen, please email your updated resume to [email protected]
EA License no.14C7275/Registration no. R1434860
Please take note that only shortlisted candidate will be contacted. Thank you.
About Apba Tg Human Resource Pte. Ltd.
First seen: May 29, 2026
Last updated: May 29, 2026