D

Infrastructure/SRE Developer

DevOpsChat

winnipeg, mb, Canada Full-time May 21, 2026

Opportunity Description

Things You'll Do:

Ensure high reliability and uptime of production systems through proactive monitoring, incident response, and capacity planning.

Develop and maintain automated solutions for configuration management, deployment, monitoring, and alerting/self-healing.

Participate in on‑call rotations, lead incident response efforts, and drive root cause analysis to prevent recurrence.

Define, measure, and track SLIs, SLOs, and SLAs, ensuring alignment with business and reliability goals.

Collaborate with application and infrastructure teams to design resilient, scalable, and secure architectures.

Adopt and leverage AI‑powered solutions to optimize observability, anomaly detection, automated remediation, and operational forecasting. Implement and refine AI‑assisted automation workflows to streamline incident management and reduce human intervention in repetitive tasks.

Continuously improve system performance, scalability, co...
Full-time Other-General

Interested in this opportunity? Apply now through Expertini.

Apply for this Position