Model Accuracy Development and Test Engineer (Datacentre AI Engineering) - Riyadh, KSA
Qualcomm
Company:
Qualcomm Middle East Information Technology Company LLCJob Area:
Engineering Group, Engineering Group > Software EngineeringGeneral Summary:
About Us
Qualcomm is enabling a world where everyone and everything can be intelligently connected. You interact with products and technologies made possible by Qualcomm every day, including 5G-enabled smartphones that double as pro-level cameras and gaming devices, smarter vehicles and cities, and the technology behind the smart, connected factories that manufactured your latest purchase. Qualcomm 5G and AI innovations are the power behind the connected intelligent edge. You’ll find our technologies behind and inside the innovations that deliver significant value across multiple industries and to billions of people every day.
About the Role
We are seeking an Inference Accuracy engineer to design, develop, and validate model accuracy of deep learning models deployed at scale. The role focuses on deep accuracy analysis, debugging, accuracy evaluation, and recovery during inference on large data-centre hardware platforms. You will have strong problem-solving ability, excellent Python programming skills, and hands-on expertise with inference pipelines.
Key Responsibilities will include:
- Define and implement accuracy KPIs across precision modes
- Develop scalable Python-based accuracy evaluation tools and automated pipelines.
- Implement accuracy-preserving optimizations for inference frameworks (TensorRT, ONNX Runtime, AITemplate, Triton).
- Build and maintain automated pipelines for accuracy evaluation across multiple frameworks (ONNX, TensorFlow, PyTorch).
- Develop reusable plugins for pre-processing, post-processing, and metric evaluation.
- Execute comprehensive accuracy tests for large-scale models (LLMs, vision, diffusion).
- Validate accuracy under various quantization and precision settings (FP32, FP16, INT8).
- Perform accuracy analysis with deep understanding of model architecture, including layers, attention mechanisms, and parameter configurations.
- Identify architecture-driven accuracy degradation trends and propose optimization strategies.
- Identify issues related to pre-processing drift, tokenization mismatches, operator fallback, and quantization effects.
- Analyse accuracy differences across hardware targets, firmware versions, and runtime backends.
- Perform slice-based accuracy analysis (batch size, concurrency, sequence length, domain shifts).
- Design and run experiments to recover accuracy, including fine-tuning, calibration, and hyperparameter adjustments.
- Debug accuracy failures by tracing root causes across data pre-processing, model layers, quantization steps, and deployment pipelines.
- Compare results across different hardware/software stacks and generate actionable insights.
- Document workflows, maintain dashboards, and publish accuracy results for stakeholders.
Required Skills & Experience:
- Strong background in AI/ML model evaluation and accuracy metrics.
- Solid understanding of model architectures (transformers, CNNs, RNNs, MoE) and their impact on accuracy.
- Experience with large language models (LLMs) and generative AI accuracy validation.
- Expertise with inference runtimes (TensorRT, ONNX Runtime, Triton).
- Understanding of quantization (INT8/FP8/INT4), calibration, QAT, and accuracy trade-offs. Experience with model graph conversion (PyTorch ONNX
- backend engines).
- Hands-on experience with accuracy pipeline development and automation frameworks. Understanding of video generation model accuracy and multi-modal evaluation benchmarking
- Proficiency in Python and familiarity with ML toolkits (ONNX Runtime, TensorFlow, PyTorch).
- Expertise in accuracy analysis, including statistical methods and visualization tools
- Ability to design experiments for accuracy recovery and debug accuracy failures effectively.
- Knowledge of quantization techniques and mixed-precision workflows.
- Experience with data-centre accelerators (NVIDIA A100/H100/B200, AI100 Ultra, Gaudi, TPU).
- Knowledge of LLM accuracy evaluation tools (lm-eval, HELM, synthetic benchmarks) is an advantage
- Strong problem-solving and analytical skills with the ability to isolate complex accuracy issues.
- Familiarity with distributed deployment systems (Kubernetes, cloud inference services).
Required Qualifications:
- Bachelor's / Masters degree in Engineering, Machine learning/ AI, Information Systems, Computer Science, or related field.
- 4-10 years’ of Software Engineering or related work experience.
- 4-10 years’ experience with Programming Language such as C,C++, Python.
What's on Offer
Apart from working with great people, we offer the below:
- Salary including housing & transport allowance
- Stock (RSU's) and performance related bonus
- 16 weeks fully paid Maternity Leave
- 6 weeks fully paid Paternity Leave
- Employee stock purchase scheme
- Child Education Allowance
- Relocation and immigration support (if needed)
- Life and Medical Insurance
- Live+ Well Reimbursement for health and recreational membership fees
Minimum Qualifications:
- Bachelor's degree in Engineering, Information Systems, Computer Science, or related field and 2+ years of Software Engineering or related work experience.
Master's degree in Engineering, Information Systems, Computer Science, or related field and 1+ year of Software Engineering or related work experience.
OR
PhD in Engineering, Information Systems, Computer Science, or related field.
- 2+ years of academic or work experience with Programming Language such as C, C++, Java, Python, etc.
- References to a particular number of years experience are for indicative purposes only. Applications from candidates with equivalent experience will be considered, provided that the candidate can demonstrate an ability to fulfill the principal duties of the role and possesses the required competencies.
Qualcomm is an equal opportunity employer. If you are an individual with a disability and need an accommodation during the application/hiring process, rest assured that Qualcomm is committed to providing an accessible process. You may e-mail [email protected] or call Qualcomm's toll-free number found here. Upon request, Qualcomm will provide reasonable accommodations to support individuals with disabilities to be able participate in the hiring process. Qualcomm is also committed to making our workplace accessible for individuals with disabilities. (Keep in mind that this email address is used to provide reasonable accommodations for individuals with disabilities. We will not respond here to requests for updates on applications or resume inquiries).
Qualcomm expects its employees to abide by all applicable policies and procedures, including but not limited to security and other requirements regarding protection of Company confidential information and other confidential and/or proprietary information, to the extent those requirements are permissible under applicable law.
To all Staffing and Recruiting Agencies: Our Careers Site is only for individuals seeking a job at Qualcomm. Staffing and recruiting agencies and individuals being represented by an agency are not authorized to use this site or to submit profiles, applications or resumes, and any such submissions will be considered unsolicited. Qualcomm does not accept unsolicited resumes or applications from agencies. Please do not forward resumes to our jobs alias, Qualcomm employees or any other company location. Qualcomm is not responsible for any fees related to unsolicited resumes/applications.
If you would like more information about this role, please contact Qualcomm Careers.
How to apply
To apply for this job you need to authorize on our website. If you don't have an account yet, please register.
Post a resumeSimilar jobs
Influencers Coordinator - Tamheer Program
Customer Service Agent
Senior Software Engineer - McKinsey Digital