ABOUT US:
Founded in 2020, Regal is the AI Agent Platform. Regal gives every company the tools to transform customer communications with delightful AI Agents that are connected to your data, easy to customize and monitor, always available, and ready to take action. Power better support, sales, and operations – with way less effort. Our founders, Alex Levin and Rebecca Greene, helped build Angi (Angie’s List, HomeAdvisor, and Handy) to over $1.5B in revenue.Â
Based in Manhattan, we’re building an in-person culture of entrepreneurs who want to win and build something meaningful. We’re backed by top investors including Founder Collective, Homebrew, and Emergence Capital.
Come join us as we create a category-defining company, and follow Regal's company page on LinkedIn to stay up-to-date on our journey and current job openings!
We’re moving fast, and the numbers speak for themselves:
- Partnered with enterprise brands like Google, AAA, Ro, Coursera
- Raised $82M (top tier investors including Emergence & Homebrew)
- Completed 250MM+ calls
- Driven $7B revenue for customers
- Scaled to $## ARR
- Built amazing NYC (Midtown) in office culture
ABOUT THE ROLE:
We are looking for a Senior DevOps / Platform Engineer to design and operate the internal platform that enables engineers to ship quickly, safely, and reliably.
This role goes beyond infrastructure management; you will build systems that make deployment, observability, scaling, and incident response increasingly automated and self-service for product teams.Â
A key part of this role is supporting our AI-native engineering strategy: enabling the tools, environments, and workflows that allow our engineers to work effectively with AI coding assistants and agentic workflows at scale.
OUR STACK:
Cloud: AWS
Compute: ECS Fargate, Lambda, Event-Driven ServicesÂ
Infrastructure as Code: TerraformÂ
Languages: Python, TypeScript, BashÂ
Datastores: Aurora Postgres, DynamoDB, Redis (Elasticache), OpenSearch
Streaming & Queues: Kinesis, SQS, BullMQ
CI/CD: GitHub Actions (Enterprise)
Observability: Datadog, AWS CloudWatch
Architecture: Containerized APIs, Serverless Processing, Distributed Systems
AI Infrastructure: SageMaker and/or self-hosted model serving systems
WHAT YOU'LL WORK ON:
You will help build and operate Regal’s voice agent platform across containerized services, event-driven systems, and AI workloads running at production scale.
• Designing and maintaining Infrastructure as Code using Terraform to provision and manage AWS infrastructure.
• Building and evolving internal developer platforms (IDPs) that establish a golden path. These are opinionated, self-service workflows for deployment, environment provisioning, and operational tooling, so engineers spend less time on undifferentiated infrastructure decisions.
• Supporting self-serve developer environments that enable engineers and AI agents to iterate quickly and independently.
• Enabling and scaling AI-assisted engineering operations, including access management, adoption visibility, and developer productivity measurement across the team’s AI tooling.
• Improving system reliability, scalability, and performance across distributed services and real-time workloads.
• Optimizing CI/CD systems to reduce deployment friction and accelerate engineering feedback loops.
• Operating production systems with strong ownership of uptime, incident response, and recovery automation.
• Partnering with engineering teams to improve service architecture, resiliency, and operational maturity.
• Improving cost efficiency and infrastructure utilization across AWS and other infrastructure.
• Supporting secure and compliant cloud infrastructure aligned with SOC2 and enterprise security practices.
• Participating in on-call rotations.
ABOUT YOU:
• 5+ years of experience in DevOps, Platform Engineering, or Cloud Infrastructure roles.
• Deep experience operating production workloads on AWS.
• Strong hands-on experience with Terraform or comparable IaC tooling.
• Experience running containerized and serverless systems at scale.
• Strong understanding of distributed systems reliability and failure modes.
• Experience building CI/CD pipelines and developer automation tooling.
• Experience with observability platforms such as Datadog, Prometheus, or OpenTelemetry.
• Familiarity with incident response, on-call operations, and production debugging.
• Working knowledge of networking, IAM, and cloud security best practices.
• Comfortable collaborating across engineering teams and influencing technical direction.
NICE TO HAVE:
• Experience building or operating Internal Developer Platforms (IDPs) with golden path tooling.
• Experience building or managing self-serve developer environments at scale.
• Exposure to AI coding tools (e.g., code assistants, agentic coding workflows) and the infrastructure required to support them.
• Experience supporting AI/ML or real-time workloads.
• Cost optimization and cloud efficiency initiatives.
• Exposure to compliance frameworks (SOC2, HIPAA, PCI).
BENEFITS/PERKS:
• We care about your health!
• Medical, Dental, and Vision plans - 80% covered by the company
• Flexible PTO & 11 paid holidays/year
• We care about future you!
• 401k Plan
• Paid parental leave
• Pre-tax commuter benefits
• We care about connection!
• In-office breakfast and snacks daily
• Happy hours, team outings, & annual off-sites
• Complete laptop workstation
• & more to come!
POSITION LOCATION & OFFICE DETAILS:
This position is only available in New York City (HQ- Midtown). Hybrid roles are required in office T/W/TH and office optional M/F.
*If you think you’re missing relevant experience but you’re hungry and a fast learner (and can prove it), we want to hear from you!