AI Engineer — Speech & Voice Intelligence
Job Fit Check
Base Career helps you apply smarter for this job.
Key skills for this role
About the Role
About CNTXT CNTXT is building voice AI infrastructure for the Arabic-speaking world. We work on the hard problems — natural speech synthesis, real-time transcription, and conversational voice systems — with a focus on Arabic language quality that actually serves the region's speakers.
Key Skills for This Role
Full Job Posting
About Cntxt
CNTXT is building voice AI infrastructure for the Arabic-speaking world.
We work on the hard problems — natural speech synthesis, real-time transcription, and conversational voice systems — with a focus on Arabic language quality that actually serves the region's speakers.
About The Ro
leWe're looking for an AI engineer or researcher who is passionate about voice and speech technology.
You'll work directly on the models and systems that power our speech products — evaluating architectures, running fine-tuning experiments, and shipping improvements to production.
This is a hands-on role that sits at the intersection of research and engineerin
g.
S Onspeech Synthesis (
TTS) We build and fine-tune Arabic TTS systems based on state-of-the-art generative architectures — both autoregressive models that generate speech token by token and non-autoregressive models that produce full utterances in parallel.
This includes working with neural vocoders (HiFi-GAN, MelGAN, WaveGlow), audio codecs and tokenizers (EnCodec, DAC, RVQ-based systems), acoustic encoders (HuBERT, wav2vec), and diffusion-based audio decoders.
A significant focus is voice cloning and zero-shot speaker adaptation for Arabic voi
ces.
Speech Recognition
(ASR) We work with encoder-decoder and CTC-based ASR models (Whisper, Conformer, wav2vec 2.0) to build accurate, low-latency Arabic transcription.
This includes streaming inference, domain adaptation, and language model integration for Arabic dialect robust
ness.
Speech-To-
Speech We are building end-to-end voice interaction pipelines that chain ASR, language understanding, and TTS — with hard constraints on latency.
This involves voice activity detection (VAD), speaker diarization, speech enhancement, and optimizing the full stack for real-time perfor
mance.
Arabic Language Cha
llenges Arabic presents unique challenges across the whole stack: diacritization (tashkil) is critical for TTS pronunciation accuracy, dialect variation (MSA, Gulf, Levantine, Egyptian, Maghrebi) affects both synthesis and recognition quality, and training data for many dialects remains scarce.
A big part of our work is closing thes
e gaps.
What You'L
- l Work OnBenchmark and evaluate TTS and ASR models on Arabic test sets — measuring WER, speaker similarity (SIM), naturalness, and dialect coverage across MSA and regional
- varietiesFine-tune pretrained TTS models on curated Arabic data — including ablations on diacritized vs. undiacritized input, dialect-specific training splits, and voice prompt con
- ditioningExperiment with audio tokenizer and codec configurations — comparing discrete RVQ representations against continuous latent approaches and their effect on Arabic phoneme
- accuracyBuild and maintain Arabic speech data pipelines — audio sourcing, normalization, diacritization, quality filtering, and manifest generation for model
- trainingOptimize models for production serving — streaming chunk generation, KV cache tuning, quantization, and batched inference for low-latency Arabic TT
- S and ASREvaluate and adapt speech-to-speech pipelines — integrating ASR, LLM, and TTS components with attention to end-to-end latency and Arabic conversationa
- l quality
What We'Re
- Looking ForStrong foundations in machine learning and de
- ep learningHands-on experience training or fine-tuning neural models — domain matters less
- than depthComfortable with Python, PyTorch, and the HuggingFac
- e ecosystemAble to read research papers and translate ideas into experiments in
- dependentlyClear communicator who can work across research and
- engineering
- Nice to HaveNative or fluent Arabic speaker — a real advantage when evaluating synthesis naturalness and di
- alect qualityPrior work with speech or audio models (ASR, TTS, speaker verification, codec, VAD, enhancement
- , or similar)Familiarity with Arabic linguistic structure, diacritization tools, and NLP preprocessi
- ng for ArabicExperience with inference optimization — quantization, speculative decoding, CUDA kernels, or serving frameworks (vL
- LM, TensorRT)Publications or open-source contributions in sp
- eech or audio
- What We OfferWork at the frontier of Arabic voice AI — a genuinely underserved, h
- igh-impact areaDirect influence on product and res
- earch directionSmall, focused team — your work sh
- ips and mattersCompetitive compensation and rem
- ote flexibility
Apply for this job in 1 click
Skip the repetitive application forms
Install the Base Career Chrome Extension and autofill job applications across major job boards with your profile.
Trusted by over 500,000 job seekers on Base Career
More from this employer
More jobs at CNTXT AI
C++ Coding Expert – AI Training (Remote, Hourly Contractor)
Abu Dhabi, UAE
Position Summary In this remote, hourly contractor role, you will evaluate AI-generated C++ code and develop cases that test coding accuracy and reasoning quality. Tasks may include: Evaluating AI-generated C++ code for
Robotics Data Collector (United Arab Emirates)
Dubai, UAE
CNTXT AI is a leading artificial intelligence company focused on building next-generation AI systems, products, and infrastructure . We specialize in developing multimodal AI, robotics, and intelligent agents , powered b
Python Coding Expert – AI Training (Remote, Hourly Contractor)
Abu Dhabi, UAE
Position Summary In this remote, hourly contractor role, you will evaluate AI-generated Python code and develop cases that test coding accuracy and reasoning quality. Tasks may include: Evaluating AI-generated Python co
Enterprise Sales Manager – Data Services & AI
Abu Dhabi Emirate, UAE
Role Overview CNTXT AI is a UAE-based AI and data company operating at the intersection of enterprise AI adoption and the data infrastructure that makes it possible. Our Data Services practice covers the full spectrum of
Senior QA Engineer
Abu Dhabi, UAE
Key Responsibilities Design, document, and execute test plans, test cases, and regression suites for web and mobile applications across multiple projects Perform functional, usability, localization, and RTL (right-to-l
Data Collection Vehicle Operators
Abu Dhabi Emirate, UAE
Overview We are looking for two Data Collection Vehicle Operators to support an autonomous vehicle data collection program across the UAE. In this role, you will operate sensor-equipped vehicles along designated collecti
Enterprise Sales Lead
Dubai, UAE
Role Overview Munsit is CNTXT AI's Arabic Voice breakthrough: a series of STT & TTS models built in the UAE and trained on 30,000+ hours of real Arabic audio across 25+ dialects, it is deployed by governments, contact ce
Robotics Data Collector (Qatar)
Doha, QAT
CNTXT AI is a leading artificial intelligence company focused on building next-generation AI systems, products, and infrastructure . We specialize in developing multimodal AI, robotics, and intelligent agents , powered b
C++ Coding Expert – AI Training (Remote, Hourly Contractor)
Abu Dhabi, UAE
Robotics Data Collector (United Arab Emirates)
Dubai, UAE
Python Coding Expert – AI Training (Remote, Hourly Contractor)
Abu Dhabi, UAE
Enterprise Sales Manager – Data Services & AI
Abu Dhabi Emirate, UAE
Senior QA Engineer
Abu Dhabi, UAE
Data Collection Vehicle Operators
Abu Dhabi Emirate, UAE
Enterprise Sales Lead
Dubai, UAE
Robotics Data Collector (Qatar)
Doha, QAT