Site Reliability Engineering - Observability Engineer
Top Benefits
About the role
4100 Gordon Baker Road Toronto Ontario,M1W 3E8
S****RE Observability Engineer
Would you like to be part of a Reliability Engineering chapter in a DevOps model, implementing Ansible automations and eliminating Toil?
The Technology Support for Commercial Banking and Treasury & Payment Solutions team is undergoing an exciting transformation from a traditional IT organization to a modern DevOps and SRE practice. As a Senior Site Reliability Engineer, you will help define and implement key reliability strategies within our teams and platforms.
Reporting directly to the Director of Technology Support, you will collaborate with leaders and administrators and have an opportunity to significantly impact the group.
We are looking for someone that has an innovative mindset looking to push boundaries and improve the status quo, by leveraging emerging best practices to provide a platform for innovation to our critical applications.
About the Role
Observability and Automation: applying modern SRE operational & development practices, you will be involved in the entire operational support, Monitoring, automation, building, testing and delivering high quality solutions for SRE team.
Be a People Person: Working in a collaborative and diverse team-oriented environment, you will contribute to the positive team culture, demonstrate thought-leadership, value diverse ideas, partner with cross functional and remote teams
Be an Agile Person: with keen sense of urgency and a desire to work in a fast-paced dynamic environment, you will deliver solutions against strict timelines.
Be Innovative: you are empowered to try novel approaches and learn innovative technologies. You will contribute innovative ideas, create solutions, and be accountable for end-to-end deliveries
Act as a leader and communicator: through active engagement and communication with cross-functional partners and team members, you will effectively help develop colleagues by sharing your expertise, present and articulate ideas and collaborate on technical developments
Qualifications
- Hands-on experience in setting up monitors, dashboards and alerts for the production infrastructure for proactive and reactive issues and having good exposure working with monitoring tools.
- Demonstrate high proficiency in automation, system monitoring, proactive monitoring of the availability, latency, scalability and efficiency of all services
- Automation and Development: expertise with Ansible, Git, Bitbucket, Jira, Confluence, Linux, Windows, WebSphere, OpenShift, VMWare, and Oracle
- Triage, troubleshoot and resolve issues using golden signals and go past golden signals (Chaos Engineering/Gameday etc.,)
- Strong expertise in automation around monitoring the application health, application metrics, pattern-match and anomaly-detection
- Familiar with setting up and integrating monitoring tools with hybrid infrastructure (preferably Dynatrace, Prometheus)
- Familiar with designing and implementing APM in Dynatrace or similar tools
- Must demonstrate skills to understand end-to-end application design, functionality, performance, workflow and identify the gaps that can be automated.
- Implement SRE frameworks to ensure the highest level of SLA through operational excellence
- Familiar with Incident Management ISRM guidelines adhering to SLO’s and SLI’s. Deploy playbooks to achieve MTTR and MTTI.
- SRE Mindset - Chaos engineering, monitoring, alerting, investigative troubleshooting, 0 downtime, automatic recovery, and self-healing
- Good understanding and proven experience in understanding the CI/CD pipelines, restructuring/ modernizing pipelines
- Bachelor or Masters Degree in Computer Science or STEM” Majors (Science, Technology, Engineering and Math)
- Having experience of 5+ years in DevOps, Site Reliability, Production Incident Management
Nice to have
- Knowledge in cloud computing (AWS and Azure) is a strong asset
- Strong working knowledge of ServiceNow
- Good working knowledge of commercial banking
Salary:
$75,900.00 - $141,900.00
Pay Type:
Salaried
The above represents BMO Financial Group’s pay range and type.
Salaries will vary based on factors such as location, skills, experience, education, and qualifications for the role, and may include a commission structure. Salaries for part-time roles will be pro-rated based on number of hours regularly worked. For commission roles, the salary listed above represents BMO Financial Group’s expected target for the first year in this position.
BMO Financial Group’s total compensation package will vary based on the pay type of the position and may include performance-based incentives, discretionary bonuses, as well as other perks and rewards. BMO also offers health insurance, tuition reimbursement, accident and life insurance, and retirement savings plans. To view more details of our benefits, please visit: https://jobs.bmo.com/global/en/Total-Rewards
About Us
At BMO we are driven by a shared Purpose: Boldly Grow the Good in business and life. It calls on us to create lasting, positive change for our customers, our communities and our people. By working together, innovating and pushing boundaries, we transform lives and businesses, and power economic growth around the world.
As a member of the BMO team you are valued, respected and heard, and you have more ways to grow and make an impact. We strive to help you make an impact from day one – for yourself and our customers. We’ll support you with the tools and resources you need to reach new milestones, as you help our customers reach theirs. From in-depth training and coaching, to manager support and network-building opportunities, we’ll help you gain valuable experience, and broaden your skillset.
To find out more visit us at https://jobs.bmo.com/ca/en
BMO is committed to an inclusive, equitable and accessible workplace. By learning from each other’s differences, we gain strength through our people and our perspectives. Accommodations are available on request for candidates taking part in all aspects of the selection process. To request accommodation, please contact your recruiter.
Note to Recruiters: BMO does not accept unsolicited resumes from any source other than directly from a candidate. Any unsolicited resumes sent to BMO, directly or indirectly, will be considered BMO property. BMO will not pay a fee for any placement resulting from the receipt of an unsolicited resume. A recruiting agency must first have a valid, written and fully executed agency agreement contract for service to submit resumes.
About BMO
At BMO, banking is our personal commitment to helping people at every stage of their financial lives.
The truth is, people’s needs change: so we change too. But we never change who we are. Which means we’ll never waiver from providing our customers the best possible banking experience in the industry.
Our incredible team of over 46,000 people is just the tip of the iceberg. You should get to know us. We’re here to help.
Our social media terms of use: https://www.bmo.com/socialmediatermsofuse
Site Reliability Engineering - Observability Engineer
Top Benefits
About the role
4100 Gordon Baker Road Toronto Ontario,M1W 3E8
S****RE Observability Engineer
Would you like to be part of a Reliability Engineering chapter in a DevOps model, implementing Ansible automations and eliminating Toil?
The Technology Support for Commercial Banking and Treasury & Payment Solutions team is undergoing an exciting transformation from a traditional IT organization to a modern DevOps and SRE practice. As a Senior Site Reliability Engineer, you will help define and implement key reliability strategies within our teams and platforms.
Reporting directly to the Director of Technology Support, you will collaborate with leaders and administrators and have an opportunity to significantly impact the group.
We are looking for someone that has an innovative mindset looking to push boundaries and improve the status quo, by leveraging emerging best practices to provide a platform for innovation to our critical applications.
About the Role
Observability and Automation: applying modern SRE operational & development practices, you will be involved in the entire operational support, Monitoring, automation, building, testing and delivering high quality solutions for SRE team.
Be a People Person: Working in a collaborative and diverse team-oriented environment, you will contribute to the positive team culture, demonstrate thought-leadership, value diverse ideas, partner with cross functional and remote teams
Be an Agile Person: with keen sense of urgency and a desire to work in a fast-paced dynamic environment, you will deliver solutions against strict timelines.
Be Innovative: you are empowered to try novel approaches and learn innovative technologies. You will contribute innovative ideas, create solutions, and be accountable for end-to-end deliveries
Act as a leader and communicator: through active engagement and communication with cross-functional partners and team members, you will effectively help develop colleagues by sharing your expertise, present and articulate ideas and collaborate on technical developments
Qualifications
- Hands-on experience in setting up monitors, dashboards and alerts for the production infrastructure for proactive and reactive issues and having good exposure working with monitoring tools.
- Demonstrate high proficiency in automation, system monitoring, proactive monitoring of the availability, latency, scalability and efficiency of all services
- Automation and Development: expertise with Ansible, Git, Bitbucket, Jira, Confluence, Linux, Windows, WebSphere, OpenShift, VMWare, and Oracle
- Triage, troubleshoot and resolve issues using golden signals and go past golden signals (Chaos Engineering/Gameday etc.,)
- Strong expertise in automation around monitoring the application health, application metrics, pattern-match and anomaly-detection
- Familiar with setting up and integrating monitoring tools with hybrid infrastructure (preferably Dynatrace, Prometheus)
- Familiar with designing and implementing APM in Dynatrace or similar tools
- Must demonstrate skills to understand end-to-end application design, functionality, performance, workflow and identify the gaps that can be automated.
- Implement SRE frameworks to ensure the highest level of SLA through operational excellence
- Familiar with Incident Management ISRM guidelines adhering to SLO’s and SLI’s. Deploy playbooks to achieve MTTR and MTTI.
- SRE Mindset - Chaos engineering, monitoring, alerting, investigative troubleshooting, 0 downtime, automatic recovery, and self-healing
- Good understanding and proven experience in understanding the CI/CD pipelines, restructuring/ modernizing pipelines
- Bachelor or Masters Degree in Computer Science or STEM” Majors (Science, Technology, Engineering and Math)
- Having experience of 5+ years in DevOps, Site Reliability, Production Incident Management
Nice to have
- Knowledge in cloud computing (AWS and Azure) is a strong asset
- Strong working knowledge of ServiceNow
- Good working knowledge of commercial banking
Salary:
$75,900.00 - $141,900.00
Pay Type:
Salaried
The above represents BMO Financial Group’s pay range and type.
Salaries will vary based on factors such as location, skills, experience, education, and qualifications for the role, and may include a commission structure. Salaries for part-time roles will be pro-rated based on number of hours regularly worked. For commission roles, the salary listed above represents BMO Financial Group’s expected target for the first year in this position.
BMO Financial Group’s total compensation package will vary based on the pay type of the position and may include performance-based incentives, discretionary bonuses, as well as other perks and rewards. BMO also offers health insurance, tuition reimbursement, accident and life insurance, and retirement savings plans. To view more details of our benefits, please visit: https://jobs.bmo.com/global/en/Total-Rewards
About Us
At BMO we are driven by a shared Purpose: Boldly Grow the Good in business and life. It calls on us to create lasting, positive change for our customers, our communities and our people. By working together, innovating and pushing boundaries, we transform lives and businesses, and power economic growth around the world.
As a member of the BMO team you are valued, respected and heard, and you have more ways to grow and make an impact. We strive to help you make an impact from day one – for yourself and our customers. We’ll support you with the tools and resources you need to reach new milestones, as you help our customers reach theirs. From in-depth training and coaching, to manager support and network-building opportunities, we’ll help you gain valuable experience, and broaden your skillset.
To find out more visit us at https://jobs.bmo.com/ca/en
BMO is committed to an inclusive, equitable and accessible workplace. By learning from each other’s differences, we gain strength through our people and our perspectives. Accommodations are available on request for candidates taking part in all aspects of the selection process. To request accommodation, please contact your recruiter.
Note to Recruiters: BMO does not accept unsolicited resumes from any source other than directly from a candidate. Any unsolicited resumes sent to BMO, directly or indirectly, will be considered BMO property. BMO will not pay a fee for any placement resulting from the receipt of an unsolicited resume. A recruiting agency must first have a valid, written and fully executed agency agreement contract for service to submit resumes.
About BMO
At BMO, banking is our personal commitment to helping people at every stage of their financial lives.
The truth is, people’s needs change: so we change too. But we never change who we are. Which means we’ll never waiver from providing our customers the best possible banking experience in the industry.
Our incredible team of over 46,000 people is just the tip of the iceberg. You should get to know us. We’re here to help.
Our social media terms of use: https://www.bmo.com/socialmediatermsofuse