Back to all jobs
A
SDE II, ML Infra Services, Annapurna Labs
Seattle, Washington, USA
full-timeSoftware DevelopmentJob Description
Annapurna Labs was a startup company acquired by AWS in 2015, and is now fully integrated. If AWS is an infrastructure company, then think Annapurna Labs as the infrastructure provider of AWS. Our org covers multiple disciplines including silicon engineering, hardware design and verification, software, and operations. AWS Nitro, ENA, EFA, Graviton and F1 EC2 Instances, AWS Neuron, Inferentia and Trainium ML Accelerators, and in storage with scalable NVMe, are some of the products we have delivered, over the last few years.
AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud-scale machine learning accelerators and the Trn1 and Inf1 servers that use them. This position is for a Software Engineer that will lead the development of machine learning tools to run, optimize, and analyze machine learning workloads. This candidate must have had experience leading machine learning tool projects, preferably starting from architecture through several generations of delivery to customers. Deep knowledge of profiling and optimization, resource management, scheduling, code generation are needed. The ideal candidate will have worked on new instruction set architectures, which may include CPU, NPU, GPU and other forms of compute.
Key job responsibilities
This engineer will lead the design and implementation of this new toolset, will work with developers, system architects, hardware engineers and users both within and external to Amazon to ensure compatibility of this new toolset with existing and next-generation AI accelerators.
A day in the life
As you design and code solutions to help our team drive efficiencies in software architecture, you’ll create metrics, implement automation and other improvements, and resolve the root cause of software defects. You’ll also:
Build high-impact solutions to deliver to our large customer base.
Participate in design discussions, code review, and communicate with internal and external stakeholders.
Work cross-functionally to help drive business decisions with your technical input.
Work in a startup-like development environment, where you’re always working on the most important stuff.
About the team
About the team
* High-impact, high-visibility: You'll directly accelerate every Neuron team's ability to ship — your work multiplies the output of 100+ engineers
* Greenfield opportunities: We're actively building new capabilities with significant design ownership for SDEs
* Small, senior team: where every person owns major components and drives architectural decisions
* Cutting edge of AI infrastructure: Work at the intersection of Kubernetes, custom silicon, and large-scale ML workloads
Diverse Experiences
We value diverse experiences and non-traditional career paths. If your career is just starting or includes alternative experiences, we encourage you to apply.
Inclusive Team Culture
Our employee-led affinity groups foster inclusion. Events like CORE and AmazeCon inspire us to embrace our uniqueness.
Work/Life Balance
We strive for flexibility as part of our working culture, supporting you both at work and at home.
Mentorship & Career Growth
We offer knowledge-sharing, mentorship, and one-on-one code reviews to help you grow as a professional.
Hybrid Work
This role is onsite, with flexibility to work remotely when you're unable to make it into the office.
AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud-scale machine learning accelerators and the Trn1 and Inf1 servers that use them. This position is for a Software Engineer that will lead the development of machine learning tools to run, optimize, and analyze machine learning workloads. This candidate must have had experience leading machine learning tool projects, preferably starting from architecture through several generations of delivery to customers. Deep knowledge of profiling and optimization, resource management, scheduling, code generation are needed. The ideal candidate will have worked on new instruction set architectures, which may include CPU, NPU, GPU and other forms of compute.
Key job responsibilities
This engineer will lead the design and implementation of this new toolset, will work with developers, system architects, hardware engineers and users both within and external to Amazon to ensure compatibility of this new toolset with existing and next-generation AI accelerators.
A day in the life
As you design and code solutions to help our team drive efficiencies in software architecture, you’ll create metrics, implement automation and other improvements, and resolve the root cause of software defects. You’ll also:
Build high-impact solutions to deliver to our large customer base.
Participate in design discussions, code review, and communicate with internal and external stakeholders.
Work cross-functionally to help drive business decisions with your technical input.
Work in a startup-like development environment, where you’re always working on the most important stuff.
About the team
About the team
* High-impact, high-visibility: You'll directly accelerate every Neuron team's ability to ship — your work multiplies the output of 100+ engineers
* Greenfield opportunities: We're actively building new capabilities with significant design ownership for SDEs
* Small, senior team: where every person owns major components and drives architectural decisions
* Cutting edge of AI infrastructure: Work at the intersection of Kubernetes, custom silicon, and large-scale ML workloads
Diverse Experiences
We value diverse experiences and non-traditional career paths. If your career is just starting or includes alternative experiences, we encourage you to apply.
Inclusive Team Culture
Our employee-led affinity groups foster inclusion. Events like CORE and AmazeCon inspire us to embrace our uniqueness.
Work/Life Balance
We strive for flexibility as part of our working culture, supporting you both at work and at home.
Mentorship & Career Growth
We offer knowledge-sharing, mentorship, and one-on-one code reviews to help you grow as a professional.
Hybrid Work
This role is onsite, with flexibility to work remotely when you're unable to make it into the office.