Opportunity Description
Will be responsible for Eyes on glass Monitoring, Triage & Incident Ownership, Troubleshooting & Restoration, Cross-Team Collaboration, Platform & Application Stack Awareness and Service Quality & Process Excellence.• Triage & Incident Ownership
o Perform rapid intake, triage, and prioritization of alerts, tickets, and incidents.
o Act as Incident Owner during high-severity events, ensuring clear communication, timely updates, and swift restoration of service.
o Maintain accurate, real-time incident timelines and post-incident documentation.
• Troubleshooting & Restoration
o Execute root-cause isolation across application, middleware, APIs, data, and infrastructure layers.
o Use observability/monitoring tools (e.g., Kibana, Dynatrace, Cloud Watch, Grafana) to correlate logs, metrics, and traces; identify anomalies, performance bottlenecks, and failure patterns.
o Perform targeted mitigations, rollbacks, config fixes, and coordinate hotfixes to restore service quick...
o Perform rapid intake, triage, and prioritization of alerts, tickets, and incidents.
o Act as Incident Owner during high-severity events, ensuring clear communication, timely updates, and swift restoration of service.
o Maintain accurate, real-time incident timelines and post-incident documentation.
• Troubleshooting & Restoration
o Execute root-cause isolation across application, middleware, APIs, data, and infrastructure layers.
o Use observability/monitoring tools (e.g., Kibana, Dynatrace, Cloud Watch, Grafana) to correlate logs, metrics, and traces; identify anomalies, performance bottlenecks, and failure patterns.
o Perform targeted mitigations, rollbacks, config fixes, and coordinate hotfixes to restore service quick...
Interested in this opportunity? Apply now through Expertini.
Apply for this Position