Top Benefits
About the role
- As a Sr. SDET in Agentic QA, you will own the test automation and quality frameworks that support Dialpad’s AI Voice Agent services
- You will develop automated tests for end-to-end product experiences, from frontend UI to backend services to APIs to audio/text interactions
- You will test orchestration flows, agent configuration experiences, and guardian safeguards to create robust automated coverage for functionality, performance, reliability, UX, and more
- In this role, you will develop substantial amounts of automated test infrastructure and partner deeply with the development team to make our fast-growing AI platform more testable, more stable, and more delightful for customers
- Reports to a QA Eng Manager in the United States
- Own end-to-end quality for agentic features and workflows, including strategy, development, execution, and release qualification
- Design and build automation tooling and frameworks for AI/LLM-driven systems, including prompt flows, agent orchestration, and tool integrations
- Develop and maintain evaluation frameworks (evals) to measure response quality, accuracy, and hallucination rates
- Drive automation coverage (80%+ for critical AI workflows) using deterministic + probabilistic validation approaches
- Integrate AI quality checks into CI/CD pipelines with fast feedback cycles (<15 minutes for PR validation)
- Build tooling for LLM observability and debugging, including prompt tracing and response analysis
- Partner with Applied AI teams on prompt engineering, model selection, and evaluation strategies
- Design and execute performance and load tests for AI services (latency, throughput, cost efficiency)
- Identify and mitigate risks related to hallucinations, bias, safety, and edge cases
- Define and track AI quality KPIs (task success rates, precision/recall, latency, etc.)
- Participate in design and architecture reviews to ensure systems are testable, observable, and resilient
- Mentor engineers and contribute to raising the bar on AI quality engineering practices
Benefits
- Company stock options
- 100% paid medical, dental, and vision plan
- Continued education stipend
- Cell phone, home internet, and gym membership stipend
- Catered lunches, free snacks & drinks
- Work from home opportunities- Strong collaboration skills with the ability to work across distributed teams and time zones
- Strong programming skills in Python (preferred), Java, or JavaScript
- AI Stack: LLM APIs, LiveKit, prompt orchestration frameworks, evaluation tooling
- 5+ years of experience in software engineering or SDET roles with an emphasis on software development
- Familiarity with AI evaluation techniques (benchmarking, golden datasets, human-in-the-loop validation)
- Experience with CI/CD pipelines (e.g., Jenkins, GitHub Actions)
- Experience building test frameworks and scalable automation systems
- Demonstrated proficiency in coding with AI agents to accelerate development and improve code quality
- Backend: Python, Go, Google Cloud Platform, Cloud Run / App Engine, Kubernetes, Datastore, Redis, ElasticSearch
- Understanding of non-deterministic systems and probabilistic testing approaches
- Experience testing distributed, cloud-native SaaS systems and APIs
- Hands-on exposure to LLMs or AI/ML systems (e.g., OpenAI, Claude, Gemini, or similar platforms)
- Bachelor’s degree in Computer Science or equivalent practical experience
- Frontend: Vue3, React
- Don’t meet every single requirement? If you’re excited about this role and possess the fundamental traits, drive, and strong ambition we seek, but your experience doesn’t meet every qualification, we encourage you to apply
Not the right fit? Search for Software Development Engineer in Test jobs in Vancouver, Canada
About Dialpad
Dialpad is the leading Ai-Powered Customer Intelligence Platform that is completely transforming how the world works together, with one beautiful workspace that seamlessly combines the most advanced Ai Contact Center, Ai Sales, Ai Voice, and Ai Meetings with Ai Messaging. Over 30,000 innovative brands and millions of people use Dialpad to unlock productivity, collaboration, and customer satisfaction with real-time Ai insights. Customers include WeWork, Uber, Motorola Solutions, Domo and Xero. Investors include Amasia, Andreessen Horowitz, Felicis Ventures, GV, ICONIQ Capital, Salesforce Ventures, Scale Venture Partners, Section 32, Softbank and Work-Bench.
Similar Jobs
Top Benefits
About the role
- As a Sr. SDET in Agentic QA, you will own the test automation and quality frameworks that support Dialpad’s AI Voice Agent services
- You will develop automated tests for end-to-end product experiences, from frontend UI to backend services to APIs to audio/text interactions
- You will test orchestration flows, agent configuration experiences, and guardian safeguards to create robust automated coverage for functionality, performance, reliability, UX, and more
- In this role, you will develop substantial amounts of automated test infrastructure and partner deeply with the development team to make our fast-growing AI platform more testable, more stable, and more delightful for customers
- Reports to a QA Eng Manager in the United States
- Own end-to-end quality for agentic features and workflows, including strategy, development, execution, and release qualification
- Design and build automation tooling and frameworks for AI/LLM-driven systems, including prompt flows, agent orchestration, and tool integrations
- Develop and maintain evaluation frameworks (evals) to measure response quality, accuracy, and hallucination rates
- Drive automation coverage (80%+ for critical AI workflows) using deterministic + probabilistic validation approaches
- Integrate AI quality checks into CI/CD pipelines with fast feedback cycles (<15 minutes for PR validation)
- Build tooling for LLM observability and debugging, including prompt tracing and response analysis
- Partner with Applied AI teams on prompt engineering, model selection, and evaluation strategies
- Design and execute performance and load tests for AI services (latency, throughput, cost efficiency)
- Identify and mitigate risks related to hallucinations, bias, safety, and edge cases
- Define and track AI quality KPIs (task success rates, precision/recall, latency, etc.)
- Participate in design and architecture reviews to ensure systems are testable, observable, and resilient
- Mentor engineers and contribute to raising the bar on AI quality engineering practices
Benefits
- Company stock options
- 100% paid medical, dental, and vision plan
- Continued education stipend
- Cell phone, home internet, and gym membership stipend
- Catered lunches, free snacks & drinks
- Work from home opportunities- Strong collaboration skills with the ability to work across distributed teams and time zones
- Strong programming skills in Python (preferred), Java, or JavaScript
- AI Stack: LLM APIs, LiveKit, prompt orchestration frameworks, evaluation tooling
- 5+ years of experience in software engineering or SDET roles with an emphasis on software development
- Familiarity with AI evaluation techniques (benchmarking, golden datasets, human-in-the-loop validation)
- Experience with CI/CD pipelines (e.g., Jenkins, GitHub Actions)
- Experience building test frameworks and scalable automation systems
- Demonstrated proficiency in coding with AI agents to accelerate development and improve code quality
- Backend: Python, Go, Google Cloud Platform, Cloud Run / App Engine, Kubernetes, Datastore, Redis, ElasticSearch
- Understanding of non-deterministic systems and probabilistic testing approaches
- Experience testing distributed, cloud-native SaaS systems and APIs
- Hands-on exposure to LLMs or AI/ML systems (e.g., OpenAI, Claude, Gemini, or similar platforms)
- Bachelor’s degree in Computer Science or equivalent practical experience
- Frontend: Vue3, React
- Don’t meet every single requirement? If you’re excited about this role and possess the fundamental traits, drive, and strong ambition we seek, but your experience doesn’t meet every qualification, we encourage you to apply
Not the right fit? Search for Software Development Engineer in Test jobs in Vancouver, Canada
About Dialpad
Dialpad is the leading Ai-Powered Customer Intelligence Platform that is completely transforming how the world works together, with one beautiful workspace that seamlessly combines the most advanced Ai Contact Center, Ai Sales, Ai Voice, and Ai Meetings with Ai Messaging. Over 30,000 innovative brands and millions of people use Dialpad to unlock productivity, collaboration, and customer satisfaction with real-time Ai insights. Customers include WeWork, Uber, Motorola Solutions, Domo and Xero. Investors include Amasia, Andreessen Horowitz, Felicis Ventures, GV, ICONIQ Capital, Salesforce Ventures, Scale Venture Partners, Section 32, Softbank and Work-Bench.