I

Confluent Incident Management Engineer

IBM

toronto, on, Canada Full-time June 11, 2026

Opportunity Description

Join Confluent as an Incident Management Engineer and enhance the reliability of our cloud infrastructure. Manage incident response and implement strategic improvements to prevent future incidents.
This role fuses technical engineering and strategic program ownership, dedicating 75% of your time to hands-on engineering tasks. You'll focus on building automation, analyzing systemic failures, and developing reliability enhancements, while also teaching and coordinating post-mortem processes with various teams. Your impact will help shape critical incident response standards.
Key Responsibilities:
• Design reliability improvements to mitigate incidents
• Manage Rootly configuration and its integration with relevant tools
• Maintain SLO/SLA frameworks using error budgets
• Review and edit customer-facing incident documentation
• Develop and deliver incident training programs
Requirements:
• Over 10 years in SRE or reliability engineering
• Experience with AWS,...
Full-time Engineering

Interested in this opportunity? Apply now through Expertini.

Apply for this Position