Site Reliability Engineer (SRE) – APM Expertise
TestCrew | Quality Engineering & Software Testing
Date: 5 days ago
City: Riyadh
Contract type: Full time

We are looking for a Site Reliability Engineer (SRE) with deep experience in Application Performance Management (APM) tools like Dynatrace and AppDynamics. In this role, you will play a crucial part in maintaining and enhancing the reliability, performance, and scalability of our applications and infrastructure.
You will collaborate closely with development and operations teams to build robust systems, implement automation, and proactively manage performance and availability through modern observability and incident management practices.
- System Reliability & Performance: Design and manage high-availability, scalable infrastructure.
- APM Tooling: Leverage tools such as Dynatrace, AppDynamics, and New Relic to monitor and optimize application performance.
- Incident Management: Respond to incidents, conduct root cause analysis, and implement long-term fixes.
- Automation: Build automation frameworks and scripts to streamline deployment, monitoring, and operations.
- Monitoring & Alerting: Develop and maintain robust observability stacks to track system health and performance.
- Cross-Team Collaboration: Work with developers and product teams to ensure reliability of new features and releases.
- Capacity Planning: Analyze usage trends to plan for future scaling needs.
- Documentation: Maintain up-to-date documentation of systems, processes, and infrastructure.
- Continuous Improvement: Drive initiatives that improve reliability, scalability, and security.
- Education: Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience).
- Experience: 5+ years in a Site Reliability Engineer or DevOps role.
- Hands-on experience with APM tools (Dynatrace, AppDynamics, New Relic).
- Experience with public cloud platforms: AWS, GCP, or Azure.
- Strong scripting skills (Python, Bash, etc.).
- Familiarity with configuration management (Ansible, Puppet, or Chef).
- Solid understanding of containerization and orchestration (Docker, Kubernetes).
- Proficiency in monitoring/logging tools: Prometheus, Grafana, ELK stack.
- Strong understanding of networking concepts and protocols.
How to apply
To apply for this job you need to authorize on our website. If you don't have an account yet, please register.
Post a resumeSimilar jobs
Head of Finance (Financial Control)
Tabby | تابي,
Riyadh
3 hours ago
Department: Financial ControlLocation: KSADescriptionTabby creates financial freedom in the way people shop, earn, and save by reshaping their relationship with money. Over 15 million users choose Tabby to stay in control of their spending and make the most out of their money.Our flagship offering allows shoppers to split their payments online and in-store with no interest or fees. Over 40,000...

Senior Business Development Executive
Aramex,
Riyadh
3 hours ago
Job description: Purpose of the Job To achieve substantial financial growth of net new and existing customers by developing relationships and interest in Aramex solutions based on customer requirements. Job Description Generate and nurture leads in coordination with vertical marketing and field marketing activities to facilitate targeted new revenue growth. Cultivate leads and foster opportunities within the assigned geographies through...

Technical Specialist
RLDatix,
Riyadh
22 hours ago
RLDatix (RLD) is on a mission to help raise the standard of care…everywhere. Trusted by over 10,000 healthcare organizations around the world, our solutions help improve health and care. Our applications ensure that patients receive the best and safest care while supporting the providers who deliver it.Joining TeamRLD means being part of a global effort of over 2,000 team members...
