Job Details:
Job Description:
As the AI landscape evolves, the emergence of generative AI has brought the cost of inference into sharp focus, identifying it as a critical bottleneck for innovation and the full utilization of AI capabilities. In the role of Staff AI Compiler Engineer, you’ll be at the forefront of confronting this challenge. By contributing to the development and integration of our state-of-the-art Neural Processing Unit (NPU) compiler within the Windows Machine Learning and DirectML software stack, you will play a central role in offloading cloud inference, thereby revolutionizing the efficiency and scalability of AI applications. With your command of C++ and MLIR, you will not only enhance our NPU and its ecosystem but also redefine the parameters of what’s possible in AI hardware acceleration, paving the way for broader adoption and more sustainable AI innovation.
Key responsibilities:
– Conduct pioneering research and development in algorithms to push the boundaries of AI efficiency, targeting specific improvements in inference cost and performance.
– Implement advanced compilation passes that optimize machine learning models for our NPU, enhancing their performance and reducing latency.
– Collaborate closely with hardware design teams to consult on new generation hardware, ensuring that our NPU designs are optimized for the latest AI workloads and inference demands.
– Lead research and development efforts for performance optimization, particularly focusing on software-to-hardware mapping for AI models.
– Craft and maintain an internal testing framework that sets the bar for industry standards in performance testing.
– Track and report engineering progress with precision, promoting transparency and accountability in every aspect of the project lifecycle.
– Strategically contribute to the engineering roadmap, aligning with the critical business needs
– Maintain a regimen of rigorous code reviews to ensure the highest quality of code.
– Embrace incremental development strategies, breaking down complex projects into manageable tasks.
– Engage in cross-functional collaboration with peer teams for co-engineering, testing, integration, and complex problem-solving.
– Outstanding collaboration and communication abilities.
– Experience in managing complex collaborations across multiple teams and stakeholders.
– Flexibility to adapt to a rapidly changing tech environment.
– Strong commitment to ongoing learning and professional growth.
Qualifications:
You must possess the below minimum qualifications to be initially considered for this position. Preferred qualifications are in addition to the minimum requirements and are considered a plus factor in identifying top candidates.
Minimum Qualifications:
Preferred Qualifications:
Job Type:
Experienced Hire
Shift:
Shift 1 (United States of America)
Primary Location:
US, California, Folsom
Additional Locations:
US, Oregon, Hillsboro
Business group:
The Client Computing Group (CCG) is responsible for driving business strategy and product development for Intel’s PC products and platforms, spanning form factors such as notebooks, desktops, 2 in 1s, all in ones. Working with our partners across the industry, we intend to deliver purposeful computing experiences that unlock people’s potential – allowing each person use our products to focus, create and connect in ways that matter most to them. As the largest business unit at Intel, CCG is investing more heavily in the PC, ramping its capabilities even more aggressively, and designing the PC experience even more deliberately, including delivering a predictable cadence of leadership products. As a result, we are able to fuel innovation across Intel, providing an important source of IP and scale, as well as help the company deliver on its purpose of enriching the lives of every person on earth.
Posting Statement:
All qualified applicants will receive consideration for employment without regard to race, color, religion, religious creed, sex, national origin, ancestry, age, physical or mental disability, medical condition, genetic information, military and veteran status, marital status, pregnancy, gender, gender expression, gender identity, sexual orientation, or any other characteristic protected by local law, regulation, or ordinance.
Position of Trust
N/A
Benefits:
We offer a total compensation package that ranks among the best in the industry. It consists of competitive pay, stock, bonuses, as well as, benefit programs which include health, retirement, and vacation. Find more information about all of our Amazing Benefits here: https://www.intel.com/content/www/us/en/jobs/benefits.html
Annual Salary Range for jobs which could be performed in
US, California:$186,552.00-$279,772.00
Salary range dependent on a number of factors including location and experience.
Work Model for this Role
This role will be eligible for our hybrid work model which allows employees to split their time between working on-site at their assigned Intel site and off-site. In certain circumstances the work model may change to accommodate business needs.
Our vision is to transform how the world uses information to enrich life for all. Micron Technology is a world...
How to apply工作内容: 在Zoom 2.0时代,我们正在从-款优秀的现象级视频会议产品,逐步升级成协同办公平台,产品横跨了视频会议,语音电话,呼叫中心,智能助手,办公文档等.我们是Zoom AI基础设施团队,致力于为Zoom提供统-可靠新进的AI基础设施平台,Zoom AI正处于快速发展阶段,欢迎加入Zoom AI基础架构团队,您将和我们-道: 1,负责参与AI服务框架的研发,AI模型推理的评估,压测,调优. 2,负责参与建设AI流量网关,实现可编排可插拔的AI业务流水线,封装各种AI技能所需的SDK等. 3,负责参与AI基础调度能力的建设,包括GPU实例生命周期管理,GPU实例编排调度等,为业务提供-个满足企业级稳定性和性能要求的AI专用调度平台层. 岗位要求: 1,计算机相关专业本科以及以上学历,具备丰富的软件开发经验;积极向上,主动性强,有想法,抗压能力强,良好的沟通能力. 2,出色的业务架构能力,能应对复杂庞大的业务流量,提供高稳定性,高可用的保障. 3,熟悉AI模型研发流程,对流程中涉及到的技术框架有较为深入的了解. 4,了解分布式系统,调度,容器相关领域技术,熟悉Kubernetes/docker/Yarn等原理与实现,能基于通用方案进行二次研发. 5,精通任-主流公有云平台(AWS,Azure,GCP,Ali等)结构及技术特性,主流虚拟化技术,对IaaS / PaaS平台构架有深层次理解及技术洞察,有大型云平台系统架构设计经验. 加分项: 1,有高性能,高并发,高容错的服务开发经验者优先. 2,热于探索新技术,对于云原生/AI/大模型原理有深刻理解,熟悉AI相关编程语言(Python等)者优先....
How to applyAt Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of...
How to applyLine of ServiceAdvisory Industry/SectorNot Applicable SpecialismData, Analytics & AI Management LevelSenior Associate Job Description & SummaryAt PwC, our people in...
How to applyResearch Scientist, Machine Learning for Monetization (PhD) Apply to this job Location pin icon Bellevue, WA Apply to this job...
How to applyAbout the Company Welcome to Mindrift – a space where innovation meets opportunity. We’re a pioneering platform dedicated to advancing...
How to apply