Jobs.ca
Jobs.ca
Language
Elastic logo

Senior Systems Developer

Elastic14 days ago
Remote
Remote (Canada)
CA$73,675 - CA$116,550/yearly
Senior Level

Top Benefits

Fully paid health coverage for you and family
Flexible location and schedule for most roles
Distributed workforce with generous vacation days

About the role

Who you are

  • Software engineering background to collaborate with engineers on solutions. Public cloud and managed Kubernetes experience is a plus
  • Passion for developing solutions with inclusive communication methods. Experience in distributed or remote teams is desirable
  • Proven experience in IT systems engineering, infrastructure management, or related fields
  • Strong knowledge of cloud platforms (e.g., AWS, Azure, GCP) and on-premise infrastructure
  • Experience with automation tools and scripting (e.g., Python, Bash, PowerShell, Ansible, GitHub Actions, RunDeck)
  • Experience with a range of tools, including productivity platforms such as Zoom, GitHub, LastPass, Slack, G-Suite, Confluence, and JIRA; GenAI solutions like Gemini AI, Azure OpenAI, Claude AI, CoPilot, Cursor, and WindSurf; as well as secret management systems including HashiCorp Vault, GitHub Secrets, Google Secrets, and AWS Secrets
  • Integration/Architecture experience with ElasticSearch, DataDog, Splunk, or similar platforms
  • Familiarity with Kubernetes and Infrastructure as Code (IaC) principles
  • Understanding of Virtual Private Networks (VPN) and Zero Trust security models
  • Knowledge of X.509 certificate standards, certificate chains, and secure key management
  • Hands-on experience with monitoring and observability tools, ideally the Elastic Stack (Elasticsearch, Kibana, Beats, Logstash)
  • Experience configuring and handling alerting and incident response workflows using tools like PagerDuty
  • Ability to create and maintain custom dashboards and alert rules tailored to business and technical requirements
  • Familiarity with standard methodologies for monitoring distributed systems, cloud environments, and containerized workloads (e.g., Kubernetes)
  • Advanced experience with Linux and Windows environments
  • Knowledge of data centre operations and standard methodologies
  • Strong communication and collaboration abilities
  • Commitment to security, compliance, and operational excellence
  • These represent the types of work you will do as a Site Reliability Engineer at Elastic IT; you don't need to have all of them
  • Previous experience in a distributed, remote-first company
  • Building or operating Kubernetes-at-scale infrastructure, ideally across multiple cloud providers, and supporting automation
  • Writing non-trivial programs in Golang or other programming languages
  • Working with containerized services (e.g., Docker)
  • Proven experience in leading and improving alerting and major incident management systems (e.g., Elastic Stack, Graphite, Prometheus, Influx) to diagnose issues and quantify impacts
  • Experience in system administration with professional skills in Linux on distributed systems at scale
  • Diagnosing or designing, implementing, and creating solutions with the Elastic Stack
  • Thriving in self-organizing and sharing in a globally distributed team environment
  • Strengthening team members by uplifting others with coaching and mentoring

What the job involves

  • As a Sr. Systems Engineer at Elastic, you will design, deploy, and maintain our cloud infrastructure, with a focus on automation, reliability, and security. This includes developing and maintaining code for internal company needs. You will collaborate with multi-functional teams to ensure our technology environment is robust, scalable, and aligned with business objectives
  • Architect, deploy, and maintain cloud infrastructure (network, server, storage)
  • Develop and maintain automation for user lifecycle management (provisioning/de-provisioning)
  • Resolve complex technical issues related to infrastructure and end-user devices
  • Oversee the full lifecycle of SSL/TLS certificates, including procurement, installation, renewal, and revocation across all environments, and automate these processes
  • Build, maintain, and optimize monitoring solutions for infrastructure, applications, and services, with a strong preference for experience using the Elastic Stack
  • Develop and manage dashboards, alerts, and visualizations in Kibana to provide actionable insights and real-time access to system health and performance
  • Implement and refine alerting workflows, ensuring timely and accurate notifications for critical incidents
  • Integrate monitoring platforms with incident management tools such as PagerDuty to automate issue and on-call processes
  • Collaborate with engineering and operations teams to define Service Level Objectives (SLOs) and continuously improve observability practices
  • Participate in incident response, including triage, critical issue, and post-incident analysis, using monitoring and alerting data to drive root cause analysis and remediation
  • Collaborate with internal partners to gather requirements and deliver solutions
  • Document processes, configurations, and solutions

Benefits

  • Toast to your health: Fully paid health coverage for you and your family, in many locations
  • Craft your calendar: Flexible location and schedule for most roles
  • Create space for you: Distributed by design workforce, plus generous number of vacation days each year
  • Embrace parenthood: Minimum of 16 weeks of parental leave, plus generous family formation benefits
  • Give back your time: 40 hours each year to use toward volunteering with organizations and causes you’re passionate about
  • Amplify your impact: Double your charitable giving — we match donations up to $1500 USD (or local currency equivalent)

About Elastic

Software Development
1001-5000

Elastic, the Search AI Company, enables everyone to find the answers they need in real time, using all their data, at scale. Elastic’s solutions for search, observability, and security are built on the Elastic Search AI Platform — the development platform used by thousands of companies, including more than 50% of the Fortune 500.