Summary
The System Intelligence and Machine Learning team is in charge of creating datasets that power many of Apple’s intelligent software. Our datasets range from very small targeted sets to Petabyte scale datasets. We are looking for an expert Machine Learning engineer, or Data Scientist who can help create and improve the datasets used in Generative AI through proven understanding and usage of ML and stats.
As a senior member of the System Intelligence and Machine Learning Data team, you will be using Apple technologies to refine our datasets, perform ML-based QA, remove toxicity and select the right images, videos or texts through active selection and model-in-the-loop methodologies. Focus areas range from text processing across many languages (toxic language detection and removal, identification of colloquial vs formal language) to image and video understanding, deduplication and processing. As part of this role you will also own our data synthesis efforts in various modalities including image, text, videos and audio.
Key Qualifications
Proven track record in a Machine Learning Engineering or Applied Scientist role, preferably in a technology company.
Familiarity with a broad range of Machine Learning techniques and relevant statistical packages to engineer ML solutions end-to-end.
Experience in contributing to production codes; ability to rapidly prototype algorithmic ideas in notebook environments and translate them into production code.
Proficient in state-of-the-art ML techniques, particularly in the field of Generative AI and Large Language Models (Transformer architecture, diffusion models, CLIP and various visual and text embedding models, GPT and BERT style language models).
Strong proficiency with Python (Scikit learn, Jupyter), PyTorch, SQL-based languages. Working proficiency with Git.
Proven experience in data science and analytics, including statistical data analysis. Experience crafting, conducting, analyzing, and interpreting experiments and deep-dive investigations.
Outstanding communication and presentation skills with the ability to explain difficult technical topics to everyone from data scientists, engineers, and business partners.
Description
In this role, you will be working to deepen our understanding of how various datasets can improve the quality of Apple’s ML models on a range of products. You will particularly help shape Apple’s Datasets that are used for generative AI by removing irrelevant or toxic assets, selecting the right assets by employing various asset selection algorithms, and synthesizing new datasets by utilizing Apple proprietary ML models. For this, you will also use your stats and ML background to build models and algorithms that can select the right assets for ML experiences from a large pool of available assets. And you will work with our data engineers to put your models in data pipelines to run on large scale datasets.
In our team, you are encouraged to collaborate with other AIML product stakeholders and partners to understand needs, design Machine Learning models that help us better understand our data and automatically pick the right assets for ML training. Our Data Scientists actively evaluate and present the progress of their work. Your creative decision making will be applied daily.
Education & Experience
Bachelors, Masters or PhD degree in Computer Science, Statistics, Mathematics, Engineering; or equivalent experience.
Additional Requirements
Siemens Mobility Portugal is a leading provider of innovative mobility products and solutions. Software solutions and customer services along the...
How to applyIf you’re a professional editor, we have an exciting opportunity for you to use your language skills in a new...
How to applyMicrosoft Industry Solutions Delivery is a global organization hosting over 6,000 strategic sellers, industry and security experts, elite engineers, world-class...
How to applySummary The Apple Services Engineering (ASE) is an exciting & dynamic organization with customers in 155 countries, producing innovative products...
How to applyIf you’re a professional who works with text, we have an exciting opportunity to use your writing, editing, technical, and...
How to applyYour Role and Responsibilities 10+ years of experience in Data Science with a background in machine learning, deep learning, and...
How to apply