In this role, you will be a member of the Network AI Software team and part of the bigger DC networking organization. The team develops and owns the software stack around collective communication libraries around Meta. At the high level, the team aims to enable Meta-wide ML products and innovations to leverage our large-scale training and inference fleet through an observable, reliable and high-performance distributed AI communication stack. Currently, one of the team’s focus is on building customized features, SW benchmarks, performance tuners and SW stacks around PyTorch to improve the full-stack distributed ML reliability and performance (e.g. Large-Scale GenAI/LLM training) from the trainer down to the network communication layer. And we are seeking for leaders to work on the space of GenAI/LLM scaling reliability and performance.
Software Engineering Manager, AI Networking Responsibilities
Minimum Qualifications
Preferred Qualifications
Start preparing
Learn about how to prepare for your interview with our interview guide, tips, and interactive experiences.
Visit interview prep
Job Description The Clinical Digital Assistant team in Oracle Health is looking for an experienced AI-focused software development manager who...
How to applyApplication window is expected to close on 09/20/2024. Who We Are: The Cisco Distributed System Engineering (DSE) group is at...
How to applyThe ACE for Gaming Product Management team is looking for a world class technical marketing manager to help bring cutting...
How to applyPlease Note: To provide the best candidate experience with our high application volumes, we limit applications to a total of...
How to applyJob Description SummaryNovartis has embraced a bold strategy to drive a company-wide digital transformation. Our objective is to position Novartis...
How to applyRole: Data & AI Consulting Senior Manager Location: Dublin Career Level: 6 The Data and AI revolution is changing everything....
How to apply