Manager, Site Reliability Engineering Tooling Toast is driven by building the platform that helps restaurants adapt, take control, and focus on what they do best : creating experiences their guests love.
Tremendous business growth has spurred a need for significant investment in Toast's platform teams.
The Site Reliability Engineering team at Toast is responsible for overseeing Toast production services, with a commitment to quality, reliability, and low latency—without heroics.
The team accomplishes this by :
Building tooling to automate, monitor, and manage deployed services using reliability best practices Developing and evangelizing patterns and best practices to improve scalability, observability, and reliability of Toast systems Consulting with teams to enhance product scalability, observability, security, and reliability Participating in outage response and root cause analysis for critical systems and infrastructure incidents As a Manager of the Site Reliability Engineering team, you will provide technical leadership and hands-on code contributions, incorporating reliability best practices for programming and scripting, observability, production triage, incident resolution, and root cause analysis to maintain world-class reliability and uptime.
Responsibilities Enable a geographically distributed team of talented engineers to perform at a high level and increase their impact Drive day-to-day operations and contribute to the development and prioritization of the SRE roadmap for major initiatives Create and lead strategic organization-wide scalability, observability, and reliability initiatives in collaboration with technical leadership and Product Management Influence architecture decisions to optimize resilience and scalability Guide teams to build and maintain reliable, available systems for Toast customers Mentor engineers to support their professional growth Requirements Hands-on experience managing an SRE team, including hiring, mentoring, and cross-functional collaboration Proficiency in coding with Kotlin, Go, Python, Java / JVM Experience leading complex engineering projects in a Scrum environment Experience building and operating distributed systems Knowledge of networking, cloud architectures, and patterns Deep understanding of systems, networking, and scaling challenges Exposure to cloud infrastructure and SaaS solutions This is a hybrid role requiring in-office presence two days per week.
Benefits We offer competitive compensation and benefits designed to attract, retain, and motivate top talent.
Our total rewards package supports a healthy lifestyle and flexibility to meet changing needs.
Learn more at Toast Benefits .
Diversity, Equity, and Inclusion At Toast, our employees are our secret ingredient.
We embrace diversity with authenticity, inclusivity, respect, and humility, creating equitable opportunities for all and delivering exceptional experiences.
Work Culture We support a hybrid work model that fosters in-person collaboration while respecting individual needs.
Our goal is to build a strong, connected culture to empower the restaurant community.
More about our work environment is available at Locations .
Apply Today!
Toast is committed to accessible and inclusive hiring.
Reasonable accommodations are available for applicants with disabilities.
Contact for assistance.
Note : In Massachusetts, it is unlawful to require or administer lie detector tests as a condition of employment.
J-18808-Ljbffr
Site Manager • Dublin, Ireland