Staff Software Engineer, Cloud ML Compute Services (Mandarin, English) - Singapore
Job Description
Product area
The AI and Infrastructure team is redefining what’s possible. We empower Google customers with breakthrough capabilities and insights by delivering AI and Infrastructure at unparalleled scale, efficiency, reliability and velocity. Our customers include Googlers, Google Cloud customers, and billions of Google users worldwide.
We're the driving force behind Google's groundbreaking innovations, empowering the development of our cutting-edge AI models, delivering unparalleled computing power to global services, and providing the essential platforms that enable developers to build the future. From software to hardware our teams are shaping the future of world-leading hyperscale computing, with key teams working on the development of our TPUs, Vertex AI for Google Cloud, Google Global Networking, Data Center operations, systems research, and much more.
Job description
Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google’s needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.
Qualifications
Job responsibilities
- Recruit, provide mentorship and technical guidance to the team including road map and direction for team deliverables as well as efficient execution.
- Partner with customers to optimize the performance of their AI/ML models on Google Cloud infrastructure. Lead performance profiling, debugging, and troubleshooting of customer training and inference workloads.
- Collaborate with internal infrastructure, ML teams to improve Google Cloud's ability to support AI workloads.
- Develop and deliver training materials and demos to empower customers and internal teams.
- Contribute to the continuous improvement of our products by identifying and reporting bugs and suggesting enhancements. Identify and address technical bottlenecks hindering customer success.
Minimum qualifications
- Bachelor's degree or equivalent practical experience.
- 8 years of experience leading ML design and optimizing ML infrastructure (e.g., model deployment, model evaluation, data processing, debugging, fine tuning)
- Experience working in AI/ML and Infrastructure technologies.
- Experience collaborating with customers and field teams.
- Ability to communicate in Mandarin and English fluently to partner with local clients and regional partners.
Preferred qualifications
- Master’s degree or PhD in Engineering, Computer Science, or a related technical field.
- 3 years of experience working in a complex, matrixed organization involving cross-functional, or cross-business projects.
- Experience developing internal quality and repro testing to cover critical user journeys.
- Ability to drive product improvement through bug fixes and feature enhancements.