Platform Engineer

Hookdeck 15 days ago

Remote

Mid Level

About the role

To apply, send an email to hiring@hookdeck.com

We're looking for an experienced Platform Engineer to work on the core capabilities of Hookdeck. This role spans backend development, data engineering, and DevOps—you'll be responsible for building out our platform, adapting and optimizing for growth, and maintaining uptime and resilience.

You'll work on systems that process billions of events per month, ensuring reliability, scalability, and performance at every level. This is a hands-on engineering role where you'll design, build, and operate critical infrastructure that thousands of companies depend on.

We are looking for experienced distributed systems developers who have worked on highly concurrent, event-driven systems. You should be comfortable with the challenges of building and operating systems at scale, from low-level performance optimization to high-level system architecture.

About Hookdeck

Hookdeck provides event infrastructure that ensures reliable, scalable, and observable event ingestion and routing for modern applications. Acting as a gateway between webhook providers and your backend, Hookdeck prevents dropped events, mitigates system overload, and simplifies event management with queueing, throughput control, and monitoring. Our flagship product, the Event Gateway hosts thousands of companies and developers and processes billions of events per month.

We are a decentralized, developer-centric team that values ownership, autonomy, and quality in all aspects of your work and responsibilities.

About You

You're an experienced engineer who has built and operated distributed systems at scale. You understand the complexities of highly concurrent, event-driven architectures and have deep experience with the tools and patterns that make them reliable. You're comfortable working across the stack—from backend services to data pipelines to infrastructure—and you're excited about the challenge of building systems that process billions of events reliably.

You're a strong problem-solver who can debug complex issues across distributed systems, optimize for performance and cost, and design systems that scale. You value reliability, observability, and maintainability, and you're always thinking about how to make systems more resilient.

Responsibilities

Backend Development — Build and maintain core services that handle event ingestion, routing, and delivery
Data Engineering — Design and maintain data pipelines, storage systems, and analytics infrastructure
DevOps & Infrastructure — Manage and optimize our cloud infrastructure, deployment pipelines, and monitoring systems
Platform Development — Build out new platform capabilities to support product growth and new features
Performance Optimization — Identify and optimize bottlenecks in our systems to improve throughput and reduce latency
Reliability & Resilience — Design and implement systems that maintain high uptime and gracefully handle failures
Observability — Build and maintain monitoring, alerting, and debugging tools to ensure system health
Scalability — Design systems that can scale horizontally to handle growth in traffic and data volume

Example Projects

Design and implement a new event routing system that can handle 10x our current throughput
Optimize our ClickHouse data pipeline to reduce query latency and storage costs
Implement a new caching layer using Redis to reduce database load
Optimize service memory usage by reducing duplication of payload data in memory
Build observability tools to help debug issues across our distributed system
Design and implement disaster recovery procedures for critical systems

What We're Looking For

5+ years of experience building and operating distributed systems at scale
Deep experience with highly concurrent, event-driven systems
Strong experience with one or more of: Kafka, GCP, Kubernetes, Node.js, Go, ClickHouse, Redis
Experience with distributed systems patterns (queues, pub/sub, event sourcing, etc.)
Strong understanding of system reliability, observability, and performance optimization
Comfortable with infrastructure as code and modern DevOps practices
Experience with data engineering and building data pipelines at scale
Strong problem-solving skills and ability to debug complex distributed systems
Excellent communication skills and ability to work independently
Passion for building reliable, scalable infrastructure

Additional Application Notes

If this sounds like you, we'd love to hear from you.

To apply, please email hiring@hookdeck.com. In your email, tell us what interests you about this role and share examples of distributed systems you've built or worked on. If you have relevant work examples, code samples, or technical blog posts, please include them. We're particularly interested in hearing about your experience with event-driven systems, high-throughput systems, and the challenges you've solved in production.

About Hookdeck

Website

Platform Engineer

Hookdeck 15 days ago

Remote

Mid Level

About the role

To apply, send an email to hiring@hookdeck.com

About Hookdeck

We are a decentralized, developer-centric team that values ownership, autonomy, and quality in all aspects of your work and responsibilities.

About You

Responsibilities

Backend Development — Build and maintain core services that handle event ingestion, routing, and delivery
Data Engineering — Design and maintain data pipelines, storage systems, and analytics infrastructure
DevOps & Infrastructure — Manage and optimize our cloud infrastructure, deployment pipelines, and monitoring systems
Platform Development — Build out new platform capabilities to support product growth and new features
Performance Optimization — Identify and optimize bottlenecks in our systems to improve throughput and reduce latency
Reliability & Resilience — Design and implement systems that maintain high uptime and gracefully handle failures
Observability — Build and maintain monitoring, alerting, and debugging tools to ensure system health
Scalability — Design systems that can scale horizontally to handle growth in traffic and data volume

Example Projects

Design and implement a new event routing system that can handle 10x our current throughput
Optimize our ClickHouse data pipeline to reduce query latency and storage costs
Implement a new caching layer using Redis to reduce database load
Optimize service memory usage by reducing duplication of payload data in memory
Build observability tools to help debug issues across our distributed system
Design and implement disaster recovery procedures for critical systems

What We're Looking For

5+ years of experience building and operating distributed systems at scale
Deep experience with highly concurrent, event-driven systems
Strong experience with one or more of: Kafka, GCP, Kubernetes, Node.js, Go, ClickHouse, Redis
Experience with distributed systems patterns (queues, pub/sub, event sourcing, etc.)
Strong understanding of system reliability, observability, and performance optimization
Comfortable with infrastructure as code and modern DevOps practices
Experience with data engineering and building data pipelines at scale
Strong problem-solving skills and ability to debug complex distributed systems
Excellent communication skills and ability to work independently
Passion for building reliable, scalable infrastructure

Additional Application Notes

If this sounds like you, we'd love to hear from you.

About Hookdeck

Website