Senior DevOps Engineer
Job Description
Join Roofr as a Senior DevOps Engineer and Shape the Future of Roofing Tech!
Are you a passionate and experienced engineering leader seeking a high-impact role in a fast-growing, successful startup? At Roofr, we’re revolutionizing the roofing industry with our customer-centric CRM and innovative solutions. We’re looking for a talented Senior DevOps Engineer to continuously improve our development operations and ensure the reliability and availability of our cloud-based applications and services.
Why Roofr?
We’re not just another tech company. We’re a team of dedicated individuals obsessed with providing exceptional value to our customers. We’re building a seamless, powerful platform that connects sales tools, payments, material ordering, and more, empowering roofing businesses to manage and scale effectively. We have an amazing culture, strong financials, and industry-leading metrics. This is your chance to join a company where you can make a real difference, with significant growth opportunities and equity potential.
What You’ll Do:
- Partner with engineering teams to own and manage the availability, latency, performance, reliability, and scalability of all services, ensuring we meet our customer SLAs.
- Provide strong technical leadership and people management to the DevOps team, fostering a culture of collaboration and innovation.
- Act as a Subject Matter Expert (SME) and champion end-to-end ownership of the SRE function, with an automation-first mindset.
- Collaborate with Engineering Managers, Product Managers, and Software Engineers to deliver comprehensive SRE solutions and set effective SLOs/SLIs and error budgets for all services.
- Define well-defined processes, methodologies, metrics, and KPIs to drive accountability and continuous improvement.
- Own and drive cloud cost management, capacity planning, and performance management, partnering with engineering leaders to optimize for both performance and cost-efficiency.
- Drive instrumentation of services for end-to-end Observability – Monitoring, Alerting, Metrics, Logging, and Dashboarding.
- Lead the incident management process, conduct blameless post-mortems, publish incident reports, and maintain runbooks and on-call schedules.
- Coordinate with product development engineering teams on change management and release management processes.
- Evangelize SRE best practices and promote a culture of reliability and resilience.
- Champion and implement strong DevSecOps principles and SDLC best practices to ensure security is integrated throughout the development lifecycle.
- Coach, mentor, and cross-train team members, fostering their professional growth and development.
- Embody servant leadership principles and embrace an Agile mindset.
What You Bring to the Table:
- 5+ years of professional software engineering experience.
- 5+ years of experience building and managing technical processes in SRE, DevOps, or related domains.
- Proven experience supporting infrastructure and services in public cloud environments, with AWS as a must-have.
- Experience leading SRE teams with a strong emphasis on automation and continuous improvement.
- Hands-on experience in developing applications in one or more language stacks: Java, Python, Ruby, Javascript, Go, etc.
- Strong hands-on knowledge of Infrastructure-as-Code tools and technologies: Terraform, AWS CloudFormation, Packer, etc.
- Solid understanding of network engineering and foundational network protocols and services such as TCP/IP, HTTP, DHCP, DNS, VPN, etc.
- Experience with Agile software development and Scrum methodology.
- Excellent communication, collaboration, and presentation skills.
- Strong problem-solving and analytical skills.
Bonus Points:
- AWS certification.
- Expertise in Linux systems.
- Experience building and managing fault-tolerant, large-scale distributed systems.
- Strong hands-on experience with Containers and Container Orchestration frameworks: Docker, Kubernetes, Amazon ECS, Amazon EKS, Amazon Fargate, etc.
- Experience with CI/CD, DevOps, and Pipeline-As-Code: ArgoCD, Jenkins, Spinnaker, Gitlab CI/CD, etc.
- Experience with Microservices architecture and the CNCF ecosystem.
- Bachelor’s or Master’s Degree in Computer Science, Engineering, or a related discipline.
What We Offer (US + Canada):
At Roofr, we invest in our people. Here’s how we support you in doing the most impactful work of your career:
- Generous Time Off: Mandatory PTO in your first week, monthly “laundry days” (extra Fridays off!), company-wide shutdown between Christmas and New Years, and flexible time off.
- Comprehensive Benefits: 80% employer-paid benefits, RRSP/401k match, and a generous Parental Leave policy.
- Perks That Matter: Bi-annual company retreats, team-building activities, ample learning and development opportunities, home office setup stipend, internet and phone allowance, a remote-first culture, and weekly Friday paydays!
A Note on AI:
We love AI! We use it to make our work lives easier and more efficient. Feel free to use AI to prepare for your interview, but we want to meet *you*, not your AI alter ego. Bring your authentic self!
Important Security Notice:
Be aware of potential scams! Roofr will never ask for payment or financial information during the application process. All legitimate job postings are on our official career site: https://roofr.com/careers. If you receive suspicious messages, contact us at talent@roofr.com.
Roofr is an equal opportunity employer and values diversity. We encourage applications from all qualified individuals.
“