Jobs.ca
Jobs.ca
Language
Desjardins logo

Team Lead - SRE

Desjardins9 days ago
Verified
Hybrid
Toronto, ON
CA$75,900 - CA$141,900/year
Senior Level
Full-time

Top Benefits

Health insurance
Tuition reimbursement
Accident and life insurance

About the role

Application Deadline:

08/25/2025

Address:

4100 Gordon Baker Road

Job Family Group:

Technology

Key Responsibilities

Operational Leadership & Incident Management:

  • Monitor, troubleshoot and restore services for infrastructure, applications (online/offline) and security, ensuring compliance with SLAs.
  • Lead Major Incident Management, coordinating with internal teams and vendors for rapid resolution.
  • Oversee Change Management and Problem Management processes to minimize production disruptions.
  • Apply Agile principles in daily standups, sprint planning and retrospectives to improve incident response efficiency.
  • Lead and collaborate with a team of SA, DBA and SRE personnel to develop and implement the counter‑measures necessary to improve production and equipment reliability, as well as the compliance posture of the team.

On‑Prem Cloud Infrastructure Management:

  • Manage and optimize cloud‑based environments using RHEL Open‑shift infrastructure.
  • Automate deployment, scaling, and recovery processes using CI/CD pipelines.
  • Create docker containers using CI/CD pipelines for on‑prem Open‑shift deployments.
  • Ensure high availability, disaster recovery, and security compliance across systems.
  • Automate deployments and scaling with Agile‑DevOps practices (e.g., iterative improvements, automated testing).

Stakeholder Collaboration & Strategic Support:

  • Partner with business and IT stakeholders to align technology solutions with organizational goals.
  • Analyze operational data to provide insights, recommend improvements, and drive data‑driven decision‑making.
  • Support regulatory audits and ensure adherence to industry best practices (DevOps, SRE).
  • Drives and/or promotes new processes, systems, technology, operations and expanded capabilities for performance, with the flexibility to align to the unique requirements of the project teams and deliverables.

Innovation & Continuous Improvement:

  • Identify emerging trends in Automation, SRE and Cloud Computing to enhance system reliability.
  • Promote SRE practices (Observability, performance tuning, automation) to improve uptime and efficiency.
  • Promote Agile‑SRE integration by embedding reliability practices into sprint workflows.
  • Strengthen operational capabilities through knowledge sharing, mentoring and community building.

Qualifications & Skills:

  • Typically, 5‑10 years of work experience in IT, Banking or business environment and/or BS/BA or MBA/MS in computer science, engineering, information systems, math or business.
  • Understanding of Information Technology operating processes used for systems to ensure effective delivery including but not limited to IT or Banking mandatory operating standards for monitoring, logging, and alerting.
  • Knowledge of support and operations practice, concepts, and technology obtained through formal training and/or work experience.
  • Must have knowledge about:
    • Linux including Redhat
    • Linux System administration experience
    • Automation & scripting (Python, Bash, Ansible)
    • Observability monitoring tools including Dynatrace, Cloud Watch
    • Site Reliability Engineering (SRE) principles
    • Automation & scripting (Python, Bash, Ansible)
    • IBM WAS and MQ
    • Containerization using docker and Open‑shift
    • Service Now
    • Atlassian products JIRA and Confluence
  • Technical and/or business functional knowledge of systems, tools, timing and dependencies.
  • Deep knowledge and technical proficiency gained through extensive education and business experience.
  • Verbal & written communication skills - In‑depth.
  • Influence skills - In‑depth.
  • Data driven decision making - In‑depth.
  • Exceptional problem‑solving, analytical and communication skills.
  • Ability to lead cross‑functional teams and influence stakeholders.
  • Works independently and regularly manages non‑routine situations.
  • Broader work or accountabilities assigned as needed.
  • Great to have:
    • Database experience (Oracle)
    • AI knowledge – Copilot

Note: 2 days/week required in the office. Can be in the Scarborough or Mississauga office.

About Desjardins

Banking
10,000+

Desjardins Group is the largest cooperative financial group in North America and the fifth largest cooperative financial group in the world, with assets of $435.8 billion as at March 31, 2024. It was named one of Canada's Best Employers by Forbes magazine and by Mediacorp. To meet the diverse needs of its members and clients, Desjardins offers a full range of products and services to individuals and businesses through its extensive distribution network, online platforms and subsidiaries across Canada. Ranked among the world's strongest banks according to The Banker magazine, Desjardins has some of the highest capital ratios and credit ratings in the industry and the first according to Bloomberg News.