About Dialpad
Dialpad is the leading AI-powered customer communications platform, transforming how businesses communicate with their customers. More than 50,000 companies around the globe β including Netflix, RE/MAX, Uber, Randstad, and Tractor Supply β rely on Dialpad to build stronger customer connections using real-time, AI-driven insights. Visit dialpad.com to learn more.
Being a Dialer
At Dialpad, youβll be part of a collaborative team working toward our shared mission of making our customers and their employees wildly successful. We believe that every conversation matters, and we're elevating each one with a platform that drives real-time insights and automation for our customers.
We thrive on continuous evolution, where every employee leverages industry-leading AI to constantly refine our platform and our own skills. We seek individuals who not only meet our high standards but go beyond them. Our ambition is significant, and achieving it requires a team that operates at the highest level. We look for individuals who are not just ambitious but who also possess the traits that are fundamental to our success: Scrappy, Curious, Optimistic, Persistent, and Empathetic.
Your role
As a Sr. SDET in Agentic QA, you will own the test automation and quality frameworks that support Dialpadβs AI Voice Agent services. You will develop automated tests for end-to-end product experiences, from frontend UI to backend services to APIs to audio/text interactions. You will test orchestration flows, agent configuration experiences, and guardian safeguards to create robust automated coverage for functionality, performance, reliability, UX, and more.
In this role, you will develop substantial amounts of automated test infrastructure and partner deeply with the development team to make our fast-growing AI platform more testable, more stable, and more delightful for customers.
This position is based at one of Dialpadβs Canadian offices and reports to a QA Eng Manager in the United States.
What youβll do
β’ Own end-to-end quality for agentic features and workflows, including strategy, development, execution, and release qualification.
β’ Design and build automation tooling and frameworks for AI/LLM-driven systems, including prompt flows, agent orchestration, and tool integrations.
β’ Develop and maintain evaluation frameworks (evals) to measure response quality, accuracy, and hallucination rates.
β’ Drive automation coverage (80%+ for critical AI workflows) using deterministic + probabilistic validation approaches.
β’ Integrate AI quality checks into CI/CD pipelines with fast feedback cycles (<15 minutes for PR validation).
β’ Build tooling for LLM observability and debugging, including prompt tracing and response analysis.
β’ Partner with Applied AI teams on prompt engineering, model selection, and evaluation strategies.
β’ Design and execute performance and load tests for AI services (latency, throughput, cost efficiency).
β’ Identify and mitigate risks related to hallucinations, bias, safety, and edge cases.
β’ Define and track AI quality KPIs (task success rates, precision/recall, latency, etc.).
β’ Participate in design and architecture reviews to ensure systems are testable, observable, and resilient.
β’ Mentor engineers and contribute to raising the bar on AI quality engineering practices.
What youβll bring
β’ 5+ years of experience in software engineering or SDET roles with an emphasis on software development.
β’ Strong programming skills in Python (preferred), Java, or JavaScript.
β’ Experience testing distributed, cloud-native SaaS systems and APIs.
β’ Demonstrated proficiency in coding with AI agents to accelerate development and improve code quality.
β’ Hands-on exposure to LLMs or AI/ML systems (e.g., OpenAI, Claude, Gemini, or similar platforms).
β’ Understanding of non-deterministic systems and probabilistic testing approaches.
β’ Experience building test frameworks and scalable automation systems.
β’ Familiarity with AI evaluation techniques (benchmarking, golden datasets, human-in-the-loop validation).
β’ Experience with CI/CD pipelines (e.g., Jenkins, GitHub Actions).
β’ Strong collaboration skills with the ability to work across distributed teams and time zones.
β’ Bachelorβs degree in Computer Science or equivalent practical experience.
β’ Backend: Python, Go, Google Cloud Platform, Cloud Run / App Engine, Kubernetes, Datastore, Redis, ElasticSearch.
β’ Frontend: Vue3, React.
β’ AI Stack: LLM APIs, LiveKit, prompt orchestration frameworks, evaluation tooling.
For exceptional talent based in British Columbia, Canada the target base salary range for this position is posted below. Our salary ranges are determined by role, level, and location. The range displayed on each job posting reflects the target range for new hire salaries for the position. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process. Please note that the compensation details listed in British Columbia role postings reflect the base salary only, and do not include bonus, equity, or benefits.
British Columbia, Canada Salary Range$150,500β$175,250 CADWe believe in investing in our people. Dialpad offers competitive benefits and perks, alongside a robust training program that helps you reach your full potential. We have designed our offices to be inclusive, offering a vibrant environment to cultivate collaboration and connection. Our exceptional culture, recognized repeatedly as a certified Great Place to Work, ensures every employee feels valued and empowered to contribute to our collective success.
Donβt meet every single requirement? If youβre excited about this role and you possess the fundamental traits, the drive, and strong ambition we seek, but your experience doesnβt satisfy every qualification, we encourage you to apply.
Dialpad is an equal-opportunity employer. We are dedicated to creating an inclusive environment, free of discrimination and harassment.