Overview
Azure is building modernistic accelerated supercomputers at unforeseen scales to facilitate the massive computational demands of the world’s leading generative AI. Microsoft’s Eagle cluster, a Graphics Processing Unit (GPU)-accelerated supercomputer, is a noteworthy example achieving the coveted #3 and #2 ranks in Top500 and MLPerf benchmarks respectively. The Azure Artificial Intelligence (AI) high performance computing team is looking for a Principal AI/HPC Software Engineer to benchmark, profile, debug and tune the generative AI applications running in the production infrastructure. Sophisticated tools and techniques are needed to maintain the reliability, runtime performance, and health of the hundreds of nodes in a supercomputer consisting of thousands of GPUs. The candidate will work closely with customers, who are building the world’s leading generative AI, to understand the characteristics of their workloads, profile them to find performance bottlenecks, and instrument best known state-of-the-art and novel tools and techniques to achieve the smooth operation of the AI jobs. As a contributing member of the core group of engineers in Azure, the candidate would also bring to the table best practices driving architectural changes and influence roadmap of relevant software and hardware components. Your work will directly impact the business goals of a wide range of users and facilitate the next wave of growth and innovation in AI, and HPC in the cloud in general.
We are looking for a Principal AI/HPC Software Engineer who is about quality, wants the customer to succeed and get things done. You will join a phenomenal team of engineers and researchers with deep experience in high performance computing, machine learning, deep learning, middleware, and software engineering. The following values drive us:
Your mission will be to help ensure the Azure platform is consistent on performance, can scale on-demand, and engineered to withstand the unparalleled computing demand from the customer workloads. You will help build a test-driven engineering culture to reduce regressions and bugs in production and will set a higher bar for infrastructure quality.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Responsibilities
Embody our Culture and Values
Qualifications
Required Qualifications:
Other Requirements:
Preferred Qualifications:
Software Engineering IC5 – The typical base pay range for this role across the U.S. is USD $137,600 – $267,000 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $180,400 – $294,000 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay
Microsoft will accept applications for the role until July 26, 2024.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form.
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
#azurecorejobs
EMEA HPC-AI Sales Planning Lead This role has been designed as ‘Hybrid’ with an expectation that you will work on...
How to applySummary We’re building the future of how quality software is developed by leveraging the power of LLMs & AI. Our...
How to applyWhy choose between doing meaningful work and having a fulfilling life? At MITRE, you can have both. That’s because MITRE...
How to applyAt Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses-from exchanges and...
How to applyNVIDIA is hiring a Senior Systems Software Engineer, Deep Learning to join the TAO Toolkit Deep Learning Architectures team. Our...
How to applyRole: General Manager – AI/ML Location: Mumbai We are looking for an experienced and visionary leader to head our Center...
How to apply