Lead Data Intelligence Machine Learning Engineer
About This Role
Job Description About Us At Dyson, we’re driven by a relentless pursuit of innovation—pushing boundaries in engineering, AI, and robotics. Our new Data Intelligence team sits at the heart of this mission: shaping Dyson’s future through data. Here, we blend creativity, precision, and audacity to power intelligent products. We craft data strategies and pipelines that fuel the next generation of connected devices.
You’ll work alongside brilliant minds from Dyson global engineering team and external software/hardware partners in an environment built for exploration, discovery, delivery and impact.
About The Role We are looking for a specialized Lead Data Intelligence Machine Learning Engineer to design and implement in-house tools that automate our data labelling pipelines. Your primary goal will be to reduce our reliance on manual annotation by leveraging techniques like Active Learning, Weak Supervision, and Synthetic Data Generation. You will bridge the gap between raw data collection and model-ready datasets, ensuring high-quality labels at scale.
Key Responsibilities
- Architect Labelling Pipelines: Design and deploy end-to-end automated labelling systems using frameworks like Snorkel, Cleanlab, or custom active learning loops.
- Develop "Human-in-the-Loop" (HITL) Systems: Build interfaces and workflows where models pre-label data and humans only intervene on high-uncertainty samples.
- Quality Assurance & Denoising: Implement algorithmic checks to identify and correct mislabelled or "noisy" data within existing datasets.
- Tooling & Integration: Collaborate with software engineers to integrate labelling tools with our existing data lakes and ML training infrastructure.
- Model Optimization: Fine-tune "teacher" models to generate high-quality pseudo-labels for "student" models.
- Set up and maintain robust data preparation infrastructure—optimising for data quality, speed, and seamless integration with downstream MLOps pipelines.
- Perform data visualization and in-depth analysis using advanced data and feature engineering techniques. You’ll help transform raw data into actionable insight, supporting both research and deployment.
- Work closely with Data Scientists, Software Engineers, and Product teams to ensure high data quality and usability across products and projects.
About you
- At least 8+ years of professional experience in Machine Learning engineering, specifically focused on data centric-AI or computer vision/NLP pipelines.
- Proficiency in Python: Mastery of the Machine Learning stack (PyTorch or TensorFlow, NumPy, Pandas, Scikit-learn).
- Automated Labelling Expertise: Proven experience with Weak Supervision (labelling functions) or Active Learning strategies (uncertainty sampling, diversity sampling).
- Data Engineering: Experience with SQL and NoSQL databases, and managing large-scale unstructured data (images, text, or audio).
- Cloud Infrastructure: Familiarity with AWS (SageMaker Ground Truth), GCP (Vertex AI), or Azure ML labelling services.
- Version Control for Data: Experience with DVC (Data Version Control) or similar tools to track dataset iterations.
- Hands-on expertise building auto-labelling solutions or working with large-scale data annotation workflows.
- Advanced skills in Python (and/or other relevant languages), and experience with key ML/data science libraries (e.g. TensorFlow, PyTorch, scikit-learn, pandas).
- Experience designing, deploying, and maintaining scalable data pipelines, including data cleansing, transformation, and storage (cloud, on-prem, or hybrid).
- Strong background in feature engineering, data analysis, and data visualization—comfortable using tools like Jupyter, Tableau, or Power BI.
- Great communicator who documents solutions clearly and collaborates effortlessly across technical and non-technical teams.
- Able to balance speed and quality, stay curious about new developments, and deliver results in a fast-moving environment.
- Bachelor’s or Masters degree in computer science, Engineering, Mathematics, Data Science, or a related field.
Dyson is an equal opportunity employer. We know that great minds don’t think alike, and it takes all kinds of minds to make our technology so unique. We welcome applications from all backgrounds and employment decisions are made without regard to race, colour, religion, national or ethnic origin, sex, sexual orientation, gender identity or expression, age, disability, protected veteran status or other any other dimension of diversity.
Job Details
Role Level: Not Applicable Work Type: Full-Time Country: United Arab Emirates City: Dubai Company Website: http://careers.dyson.com Job Function: Engineering Company Industry/
Sector: Appliances Electrical and Electronics Manufacturing
What We Offer
About The Company Searching, interviewing and hiring are all part of the professional life. The TALENTMATE Portal idea is to fill and help professionals doing one of them by bringing together the requisites under One Roof. Whether you're hunting for your Next Job Opportunity or Looking for Potential Employers, we're here to lend you a Helping Hand.
Report
Disclaimer: talentmate.com is only a platform to bring jobseekers & employers together. Applicants are advised to research the bonafides of the prospective employer independently. We do NOT endorse any requests for money payments and strictly advice against sharing personal or bank related information. We also recommend you visit Security Advice for more information. If you suspect any fraud or malpractice, email us at abuse@talentmate.com.
Similar Jobs
Lead Data Engineer
Capgemini · Abu Dhabi
**About the job you are considering** ------------------------------------- Capgemini Global Insights \& Data business line is a market leader in the data, platform, and analytics across all regions and across many secto
Yesterday
Generate Resume ↗Lead Data Engineer
Capgemini · Abu Dhabi
**About the job you are considering** ------------------------------------- Capgemini Global Insights \& Data business line is a market leader in the data, platform, and analytics across all regions and across many secto
Yesterday
Generate Resume ↗Lead Data Scientist
Capgemini · Abu Dhabi
**About the job you are considering** ------------------------------------- **Your Role** ------------- **Your Skills and Experience** ------------------------------ **Why you should consider Capgemini** ----------------
Yesterday
Generate Resume ↗Lead Data Scientist
Capgemini · Abu Dhabi
**About the job you are considering** ------------------------------------- Capgemini Global Insights \& Data business line is a market leader in the data, platform, and analytics across all regions and cross many sector
Yesterday
Generate Resume ↗Lead Data Intelligence Machine Learning Engineer
Dyson · Dubai
**About Us** At Dyson, we’re driven by a relentless pursuit of innovation—pushing boundaries in engineering, AI, and robotics. Our new Data Intelligence team sits at the heart of this mission: shaping Dyson’s future thro
1 weeks ago
Generate Resume ↗Informatica Technical Lead Data Quality & Data Governance
Datamatics Technologies · Dubai
Lead design and implementation of Data Quality rules, manage Data Governance workflows, and develop data integration patterns using Informatica IDMC and Power BI.
2 weeks ago
Generate Resume ↗Lead Data Intelligence Project Manager
Dyson · Dubai
**About Us** At Dyson, we’re driven by a relentless pursuit of innovation—pushing boundaries in engineering, AI, and robotics. Our new Data Intelligence team sits at the heart of this mission: shaping Dyson’s future thro
2 weeks ago
Generate Resume ↗Lead Data Engineer
Inception · Abu Dhabi
Inception, a G42 company, is the region’s leading innovator of AI\-powered domain\-specific as well as industry\-agnostic products, built on a rich heritage of research and development. Within the G42 ecosystem, Inceptio
3 weeks ago
Generate Resume ↗Lead Data Analyst
Net2Source (N2S) · Dubai
**Technical Skills:** * Strong data querying and processing skills using **SQL** * Data Visualization tools – Power BI, Business Objects, Crystal or similar tool * Data Warehousing and ETL concepts **Competencies:** * Ex
3 weeks ago
Generate Resume ↗Stop applying blindly.
Start getting hired.
Base Career automates the hardest parts of job searching — apply smarter, not harder.
AI Resume in 60s
Your resume rewritten for this exact role using the job description as the brief.
ATS-Optimized
Get past automated screening filters with the right keywords matched to each job.
Application Tracker
Track every job, follow-up, and interview in one visual kanban board.
Free plan · No credit card required