Back to IBM jobs
I
Senior Cloud Platform Engineer– Presto SaaS (BYOC, GPU Platforms)
San Jose, US
ProfessionalSoftware EngineeringJob Description
We are hiring a senior engineer to design and deliver a BYOC (Bring Your Own Cloud) platform for Presto SaaS across Azure and AWS (IBM Cloud is a strong plus), with a focus on GPU-enabled infrastructure. This role will lead architecture and implementation of secure, scalable, production-grade deployments for enterprise customers running Presto workloads on Kubernetes/OpenShift, with GitOps-based operations. What Success Looks Like (First 6–12 Months)
- Production-ready BYOC reference architecture for Azure and AWS (with GPU support).
- Automated, repeatable deployment blueprints using Argo CD/Flux CD.
- Reliable Kubernetes/OpenShift runtime with clear SLOs, observability, and incident playbooks.
- Reduced onboarding time for new customer environments.
- Documented architecture standards and best practices adopted across teams. Define and drive architecture for Presto SaaS BYOC deployments on Azure and AWS.
- Design GPU-native infrastructure patterns for high-performance query processing and related workloads.
- Build and operate multi-cloud Kubernetes/OpenShift platforms for reliability, scale, and cost efficiency.
- Implement and standardize deployment workflows using Argo CD and/or Flux CD.
- Develop cloud platform services, automation, and operators using Go and Python.
- Create robust provisioning and operational tooling using shell scripting and IaC patterns.
- Lead platform security, networking, IAM, observability, and production readiness.
- Collaborate with product, SRE, security, and engineering teams to convert requirements into production architecture.
- Own design reviews, implementation plans, and technical execution across teams.
- Mentor engineers and establish engineering standards, documentation, and operational playbooks.
- Troubleshoot complex multi-cloud and GPU platform issues in production environments.
- Contribute to roadmap decisions for Presto SaaS platform modernization and customer onboarding. 6-8+ years of hands-on cloud platform engineering experience, including senior-level architecture ownership.
- Strong production experience with Azure.
- Strong production experience with AWS.
- Deep expertise with Kubernetes; practical experience with OpenShift in enterprise environments.
- Strong experience with GPU infrastructure (sizing, scheduling, drivers/runtime, performance tuning, and operations).
- Strong coding skills in Go and Python.
- Solid scripting and automation skills with Bash/Shell.
- Proven experience designing end-to-end cloud architectures and driving implementation.
- Strong understanding of cloud-native technologies: networking, IAM, storage, security, observability, and CI/CD.
- Hands-on experience deploying workloads with Argo CD and/or Flux CD.
- Working knowledge of Presto (open source) architecture, deployment, and operations. Experience with IBM Cloud (added advantage).
- Experience building BYOC or customer-managed deployment models for SaaS products.
Preferred technical and professional experience
Familiarity with Presto ecosystem components and query engine performance optimization.
- Experience with multi-cluster/multi-region platform design.
- Prior contributions to open-source or internal platform frameworks. Strong ownership and decision-making in ambiguous, high-impact environments.
- Excellent cross-team collaboration and stakeholder communication.
- Ability to balance architecture quality with execution speed.
- Mentorship mindset and commitment to engineering excellence. United States Software Engineering Hybrid Professional San Jose, US