Top Benefits
Medical, Dental & Vision Insurance
Flexible Time Off Program
Paid Holidays
About the role
Who you are
- This role offers the perfect opportunity to hone your skills and gain direct exposure to advanced cloud database architecture and container orchestration challenges
- 5 years of experience managing large-scale, high-availability database systems (PostgreSQL and MongoDB) in a SaaS environment
- Deep Expertise in Kubernetes & Helm (3+ years required):
- Mandatory: Proven experience managing database deployments using Kubernetes and Helm deployments
- Strong Proficiency: Experience defining, troubleshooting, and maintaining Kubernetes resources such as StatefulSets, Pod Security Contexts (SCCs), NetworkPolicy, and custom RBAC for database Service Accounts
- Deep knowledge of advanced PostgreSQL HA concepts (e.g., streaming replication, Repmgr/Patroni) and MongoDB sharding and replication, specifically how they are implemented and configured via Helm values
- Experience managing database infrastructure on major cloud platforms (AWS, GCP, or Azure)
- Highly proficient in scripting (Bash/Python) and using GitOps principles to manage infrastructure and deployment pipelines
- Strong grasp of database performance tuning, scaling concepts, and optimizing SQL/Aggregation queries
- Container Orchestration Experience with production databases is mandatory
- Hands on experience in using version control systems, configuration management tools and IaaC such as Terraform, CloudFormation
- Experience using database tools such as pgAdmin, Pgbench, Robo3t, Studio3t, MongoDB Ops Manager and Mongo mirror
- Experience with prometheus, cloudwatch and monitoring tools both within kubernetes and external cloud managed infrastructure
What the job involves
- Datarobot is actively seeking a Database Engineer to join our Fleet Management team
- This is a pivotal role that requires creativity, deep technical knowledge, and great enthusiasm to manage our stateful infrastructure
- This position is an exciting opportunity to own the full lifecycle (administration, automation, and troubleshooting) of our critical database systems operating within a large-scale, multi-tenant Kubernetes environment
- You will be essential in driving our GitOps and Helm-centric deployment strategy, focusing on ensuring zero-downtime upgrades and maximizing performance and stability for our core platform services
- Design, implement, and maintain database infrastructure using StatefulSets, Operators, and Helm charts to ensure databases are reliable, self-healing, and scalable
- Own the deployment lifecycle for database clusters by managing version control for Helm charts and configuration templates
- Support and administer production database systems by proactively instrumenting and monitoring performance, security, and availability within the containerized environment
- Perform zero-downtime upgrades and migrations for major and minor releases, developing and maintaining Helm hooks and custom scripts to automate complex stateful operations
- Manage and optimize performance for backend data stores, ensuring data consistency and integrity across pod life cycles
- Develop and maintain automated backup and recovery processes, specifically designed for containerized databases, including volume snapshots and off-cluster storage integrations
- Resolve critical production issues related to container resource limits, network policies, storage classes, and database-specific tuning/configuration within a Kubernetes cluster
- Partner with application teams to implement database changes, review migrations, and ensure efficient resource utilization in the shared Kubernetes infrastructure
- Develop and maintain automation tools and scripts (Bash, Python) specifically focused on simplifying Kubernetes management tasks, such as provisioning users/secrets and monitoring cluster state
Benefits
- Medical, Dental & Vision Insurance
- Flexible Time Off Program
- Paid Holidays
- Paid Parental Leave
- Global Employee Assistance Program (EAP)
- Work from Home Opportunities
- A World-class Team
- Company Outings
- Open Door Policy
Top Benefits
Medical, Dental & Vision Insurance
Flexible Time Off Program
Paid Holidays
About the role
Who you are
- This role offers the perfect opportunity to hone your skills and gain direct exposure to advanced cloud database architecture and container orchestration challenges
- 5 years of experience managing large-scale, high-availability database systems (PostgreSQL and MongoDB) in a SaaS environment
- Deep Expertise in Kubernetes & Helm (3+ years required):
- Mandatory: Proven experience managing database deployments using Kubernetes and Helm deployments
- Strong Proficiency: Experience defining, troubleshooting, and maintaining Kubernetes resources such as StatefulSets, Pod Security Contexts (SCCs), NetworkPolicy, and custom RBAC for database Service Accounts
- Deep knowledge of advanced PostgreSQL HA concepts (e.g., streaming replication, Repmgr/Patroni) and MongoDB sharding and replication, specifically how they are implemented and configured via Helm values
- Experience managing database infrastructure on major cloud platforms (AWS, GCP, or Azure)
- Highly proficient in scripting (Bash/Python) and using GitOps principles to manage infrastructure and deployment pipelines
- Strong grasp of database performance tuning, scaling concepts, and optimizing SQL/Aggregation queries
- Container Orchestration Experience with production databases is mandatory
- Hands on experience in using version control systems, configuration management tools and IaaC such as Terraform, CloudFormation
- Experience using database tools such as pgAdmin, Pgbench, Robo3t, Studio3t, MongoDB Ops Manager and Mongo mirror
- Experience with prometheus, cloudwatch and monitoring tools both within kubernetes and external cloud managed infrastructure
What the job involves
- Datarobot is actively seeking a Database Engineer to join our Fleet Management team
- This is a pivotal role that requires creativity, deep technical knowledge, and great enthusiasm to manage our stateful infrastructure
- This position is an exciting opportunity to own the full lifecycle (administration, automation, and troubleshooting) of our critical database systems operating within a large-scale, multi-tenant Kubernetes environment
- You will be essential in driving our GitOps and Helm-centric deployment strategy, focusing on ensuring zero-downtime upgrades and maximizing performance and stability for our core platform services
- Design, implement, and maintain database infrastructure using StatefulSets, Operators, and Helm charts to ensure databases are reliable, self-healing, and scalable
- Own the deployment lifecycle for database clusters by managing version control for Helm charts and configuration templates
- Support and administer production database systems by proactively instrumenting and monitoring performance, security, and availability within the containerized environment
- Perform zero-downtime upgrades and migrations for major and minor releases, developing and maintaining Helm hooks and custom scripts to automate complex stateful operations
- Manage and optimize performance for backend data stores, ensuring data consistency and integrity across pod life cycles
- Develop and maintain automated backup and recovery processes, specifically designed for containerized databases, including volume snapshots and off-cluster storage integrations
- Resolve critical production issues related to container resource limits, network policies, storage classes, and database-specific tuning/configuration within a Kubernetes cluster
- Partner with application teams to implement database changes, review migrations, and ensure efficient resource utilization in the shared Kubernetes infrastructure
- Develop and maintain automation tools and scripts (Bash, Python) specifically focused on simplifying Kubernetes management tasks, such as provisioning users/secrets and monitoring cluster state
Benefits
- Medical, Dental & Vision Insurance
- Flexible Time Off Program
- Paid Holidays
- Paid Parental Leave
- Global Employee Assistance Program (EAP)
- Work from Home Opportunities
- A World-class Team
- Company Outings
- Open Door Policy