Are you interested in advancing Amazon’s Generative AI capabilities? Come work with a talented team of engineers and scientists in a highly collaborative and friendly team. We are building state-of-the-art Generative AI technology that will benefit all Amazon businesses and customers.
Key job responsibilities
As a Senior Software Development Engineer, you will be responsible for designing, developing, testing, and deploying high performance inference capabilities, including but not limited to multi-modality, SOTA model architectures, latency, throughput, and cost. You will collaborate closely with a team of engineers and scientists to influence our overall strategy, and define the team’s roadmap. You will drive system architecture, spearhead best practices, and mentor junior engineers.
A day in the life
You will read papers and consult with scientists to get inspiration of emerging techniques, and blend those into our roadmap; You will design and experiment with new algorithms, benchmark the latency and accuracy of your implementations; Most importantly you will implement production grade solutions, and see them through the deployments swiftly; You may need to collaborate with other science and engineering teams to get things done properly; You will hold highest bar in operational excellence and support production systems, and constantly create solutions to minimize the ops load.
About the team
Our mission is to build best-in-class, fast, accurate, and cost-efficient large language model inference solutions and infrastructure that will enable Amazon businesses to deliver more value to their customers.
BASIC QUALIFICATIONS
– 5+ years of non-internship professional software development experience
– 5+ years of programming with at least one software programming language experience
– 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
– Experience as a mentor, tech lead or leading an engineering team
– Prior experience with software performance optimization Or Knowledge of Machine Learning and Deep Learning
PREFERRED QUALIFICATIONS
– Bachelor’s degree in computer science or equivalent
– Experience with Large Language Model inference
– Experience with Amazon AI chip (Trainium) programming, or GPU programming (e.g. TensorRT-LLM)
– Experience with Python, PyTorch, and C++ programming, particularly performance optimization
Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status.
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.
Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $151,300/year in our lowest geographic market up to $261,500/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, please visit https://www.aboutamazon.com/workplace/employee-benefits. This position will remain posted until filled. Applicants should apply via our internal or external career site.
About the Job: Red Hat’s Customer Experience & Engagement (CEE) team seeks a software engineer looking for AI adventure. CEE...
How to applyKnown for being a great place to work and build a career, KPMG provides audit, tax and advisory services for...
How to applyWe are now looking for a Senior Infrastructure Software Engineer for Deep Learning Libraries! NVIDIA’s Deep Learning Libraries Group is...
How to applyWe are now looking for a TensorRT Software Development Engineer! NVIDIA is hiring software engineers for its AI Computing team....
How to applyMinimum qualifications: Bachelor’s degree or equivalent practical experience. 8 years of experience with software development in one or more programming...
How to applyJob Details: Job Description: Our Engineer: Conducts design and development to build and optimize AI software. Designs, develops, and optimizes...
How to apply