Jobs.ca
Jobs.ca
Language
DataRobot logo

Database Engineer

DataRobot5 days ago
Remote
Canada
Mid Level

Top Benefits

Medical, Dental & Vision Insurance
Flexible Time Off Program
Paid Holidays

About the role

Who you are

  • This role offers the perfect opportunity to hone your skills and gain direct exposure to advanced cloud database architecture and container orchestration challenges
  • 5 years of experience managing large-scale, high-availability database systems (PostgreSQL and MongoDB) in a SaaS environment
  • Deep Expertise in Kubernetes & Helm (3+ years required):
  • Mandatory: Proven experience managing database deployments using Kubernetes and Helm deployments
  • Strong Proficiency: Experience defining, troubleshooting, and maintaining Kubernetes resources such as StatefulSets, Pod Security Contexts (SCCs), NetworkPolicy, and custom RBAC for database Service Accounts
  • Deep knowledge of advanced PostgreSQL HA concepts (e.g., streaming replication, Repmgr/Patroni) and MongoDB sharding and replication, specifically how they are implemented and configured via Helm values
  • Experience managing database infrastructure on major cloud platforms (AWS, GCP, or Azure)
  • Highly proficient in scripting (Bash/Python) and using GitOps principles to manage infrastructure and deployment pipelines
  • Strong grasp of database performance tuning, scaling concepts, and optimizing SQL/Aggregation queries
  • Container Orchestration Experience with production databases is mandatory
  • Hands on experience in using version control systems, configuration management tools and IaaC such as Terraform, CloudFormation
  • Experience using database tools such as pgAdmin, Pgbench, Robo3t, Studio3t, MongoDB Ops Manager and Mongo mirror
  • Experience with prometheus, cloudwatch and monitoring tools both within kubernetes and external cloud managed infrastructure

What the job involves

  • Datarobot is actively seeking a Database Engineer to join our Fleet Management team
  • This is a pivotal role that requires creativity, deep technical knowledge, and great enthusiasm to manage our stateful infrastructure
  • This position is an exciting opportunity to own the full lifecycle (administration, automation, and troubleshooting) of our critical database systems operating within a large-scale, multi-tenant Kubernetes environment
  • You will be essential in driving our GitOps and Helm-centric deployment strategy, focusing on ensuring zero-downtime upgrades and maximizing performance and stability for our core platform services
  • Design, implement, and maintain database infrastructure using StatefulSets, Operators, and Helm charts to ensure databases are reliable, self-healing, and scalable
  • Own the deployment lifecycle for database clusters by managing version control for Helm charts and configuration templates
  • Support and administer production database systems by proactively instrumenting and monitoring performance, security, and availability within the containerized environment
  • Perform zero-downtime upgrades and migrations for major and minor releases, developing and maintaining Helm hooks and custom scripts to automate complex stateful operations
  • Manage and optimize performance for backend data stores, ensuring data consistency and integrity across pod life cycles
  • Develop and maintain automated backup and recovery processes, specifically designed for containerized databases, including volume snapshots and off-cluster storage integrations
  • Resolve critical production issues related to container resource limits, network policies, storage classes, and database-specific tuning/configuration within a Kubernetes cluster
  • Partner with application teams to implement database changes, review migrations, and ensure efficient resource utilization in the shared Kubernetes infrastructure
  • Develop and maintain automation tools and scripts (Bash, Python) specifically focused on simplifying Kubernetes management tasks, such as provisioning users/secrets and monitoring cluster state

Benefits

  • Medical, Dental & Vision Insurance
  • Flexible Time Off Program
  • Paid Holidays
  • Paid Parental Leave
  • Global Employee Assistance Program (EAP)
  • Work from Home Opportunities
  • A World-class Team
  • Company Outings
  • Open Door Policy

About DataRobot

Software Development
1001-5000

DataRobot delivers the industry-leading AI applications and platform that maximize impact and minimize risk for your business.