Site Reliability Engineer (Azure PaaS / Observability) We're seeking an experienced Site Reliability Engineer (SRE) to join a high-performing team supporting a large-scale SaaS platform built on Microsoft Azure PaaS. This role focuses on observability, system reliability, and incident management for complex cloud environments - ideal for engineers who thrive on problem-solving and rapid issue resolution rather than relying solely on dashboards. Key Responsibilities Ensure reliability, availability, and performance of Azure PaaS-based systems. Build and maintain observability and monitoring frameworks to detect and resolve issues proactively. Diagnose complex production incidents quickly, identify root causes, and implement long-term fixes. Develop automation for deployment, scaling, and monitoring through Infrastructure as Code (IaC) tools. Collaborate closely with development and operations teams to drive continuous service improvement. Optimise performance, capacity, and cost efficiency across the SaaS environment. What We're Looking For Proven hands-on experience with Azure PaaS solutions (App Services, Functions, Service Bus, SQL, etc.) within the last 3 years. Strong background in observability, monitoring, and troubleshooting of distributed cloud systems. Proficiency with IaC and automation (e.g., ARM templates, Bicep, Terraform, PowerShell). Familiarity with monitoring tools such as Azure Monitor, Application Insights, Prometheus, Grafana, or Datadog. Experience in incident response, root cause analysis, and continuous improvement. Excellent diagnostic and problem-solving skills - able to trace issues beyond dashboards. Azure certification preferred (AZ-305 or similar). Ideal Background Previous experience in a SaaS or enterprise production environment. Understanding of cloud architecture, security, and compliance best practices. Strong collaboration and communication skills within cross-functional teams. For more info contact Seamus at Reperio or apply through the link Reperio Human Capital acts as an Employment Agency and an Employment Business. Skills : SRE Azure PaaS SaaS Dublin
Reliability Engineer • Dublin, Leinster, Republic of Ireland