Job Title: SRE - AWS/Dynatrace with Development experience
Duration: 12 Months contract
Location: Toronto, Ontario, Canada
Job Description:
Reliability, resiliency, and operational excellence for mission‑critical AWS serverless platforms, ensuring high availability, low MTTR, and strong production governance using Dynatrace‑driven observability.
- Resiliency strategy for serverless architectures (Lambda, API Gateway, async/event‑driven systems)
- SLOs / SLIs / Error Budgets for critical API’s
- Incident analysis and post‑incident reviews
- Dynatrace observability: dashboards, alert tuning, dependency mapping, RCA acceleration
- Operational excellence improvements: incident reduction, MTTR improvement, toil automation
- Reliability guardrails embedded into CI/CD and production readiness reviews
Core Responsibilities