Top Benefits
About the role
Join a dynamic IT consulting environment in Montréal as a Production Support Engineer, where you’ll play a critical role in ensuring the stability and performance of large-scale cloud infrastructure systems. This is a hybrid role with scheduled work-from-home flexibility.
Tasks
- Monitor and resolve production incidents and service disruptions
- Optimize system reliability, performance, and availability using SRE principles
- Automate operational workflows and cloud infrastructure on Azure
- Analyze logs, metrics, and alerts to proactively prevent incidents
- Support and maintain Azure resources: VMs, AKS, Storage, Networking, etc.
- Collaborate cross-functionally with DevOps, development, and infrastructure teams
- Participate in on-call rotations for critical systems
Requirements
- Strong Azure Cloud experience (Azure Monitor, App Insights, Log Analytics)
- Scripting & automation: PowerShell, Python, Terraform, or Ansible
- Observability tools: Prometheus, Grafana, Datadog, or Splunk
- CI/CD tools: Azure DevOps, GitHub Actions, Jenkins
- Containers & orchestration: Kubernetes (AKS), Docker
- Solid understanding of cloud-native architecture and distributed systems
- Familiarity with ITIL, SRE, and RCA methodologies
- Basic knowledge of SQL/NoSQL databases and security practices
Benefits
- Hybrid work flexibility (3 days onsite/week)
- Work with modern cloud-native infrastructure
- Join a fast-paced, collaborative environment focused on innovation
- Opportunity to work on mission-critical systems with leading technology
Ready to support enterprise-scale systems and make a real impact? Apply now and grow your cloud operations career.
Top Benefits
About the role
Join a dynamic IT consulting environment in Montréal as a Production Support Engineer, where you’ll play a critical role in ensuring the stability and performance of large-scale cloud infrastructure systems. This is a hybrid role with scheduled work-from-home flexibility.
Tasks
- Monitor and resolve production incidents and service disruptions
- Optimize system reliability, performance, and availability using SRE principles
- Automate operational workflows and cloud infrastructure on Azure
- Analyze logs, metrics, and alerts to proactively prevent incidents
- Support and maintain Azure resources: VMs, AKS, Storage, Networking, etc.
- Collaborate cross-functionally with DevOps, development, and infrastructure teams
- Participate in on-call rotations for critical systems
Requirements
- Strong Azure Cloud experience (Azure Monitor, App Insights, Log Analytics)
- Scripting & automation: PowerShell, Python, Terraform, or Ansible
- Observability tools: Prometheus, Grafana, Datadog, or Splunk
- CI/CD tools: Azure DevOps, GitHub Actions, Jenkins
- Containers & orchestration: Kubernetes (AKS), Docker
- Solid understanding of cloud-native architecture and distributed systems
- Familiarity with ITIL, SRE, and RCA methodologies
- Basic knowledge of SQL/NoSQL databases and security practices
Benefits
- Hybrid work flexibility (3 days onsite/week)
- Work with modern cloud-native infrastructure
- Join a fast-paced, collaborative environment focused on innovation
- Opportunity to work on mission-critical systems with leading technology
Ready to support enterprise-scale systems and make a real impact? Apply now and grow your cloud operations career.