At Koalafi, we believe in a world where no one has to put an important purchase on hold. Thatβs why weβre making it easier for more people to pay for big purchases over time.
Retailers across the country rely on us to offer flexible lease-to-own financing to their non-prime consumers, while increasing sales and strengthening customer loyalty. Their 2M+ customers love us because we provide a flexible way for them to make payments and give them an opportunity to improve their credit. Our 200+ Koalafi teammates enjoy inspiring and challenging work that accelerates their careers.
Interested in learning more about how weβre transforming the financing experience and joining our team?
About the role
We're hiring a Tech Lead for DevOps to own the delivery and operational health of our platform engineering team. This is a hands-on leadership role: you carry significant engineering weight alongside the team while owning the operating rhythm, sprint commitments, and accountability for what ships.
You will work in close partnership with our Cloud Architect, who owns platform strategy and technology standards. Your job is to translate that direction into a clear, well-organized backlog: and then drive the team to deliver it with quality and predictability. You are the bridge between strategic direction and team execution.
We build in an AI-forward environment. You'll be expected to leverage AI tools in your own work and create the conditions β in pipelines, tooling, and team culture β for engineers to do the same effectively and responsibly.
What you'll do
Delivery ownership
β’ Set and own team priorities: determine what the team works on, in what order, and why; translate VP direction and Cloud Architect input into a clear, executable backlog
β’ Own sprint commitments and team capacity planning: accountable for what the team commits to and whether it ships
β’ Surface risks early and communicate delivery status accurately to the VP
β’ Run sprint ceremonies: planning, stand-ups, retros, and demos
β’ Maintain Jira hygiene: tickets are well-defined, updated, and always reflect actual state
β’ Identify and resolve blockers before they slow the team down
β’ Communicate cross-team dependencies early and proactively
Technical execution
β’ Be a strong technical contributor: carry significant engineering weight alongside the team and actively deliver on high-impact work
β’ Own day-to-day technical decisions within the team's scope
β’ Translate architectural direction into sprint-level tasks the team can act on
β’ Build and evolve CI/CD pipelines and delivery automation: ensuring deployment safety, consistency, and velocity
β’ Improve observability and operational readiness across metrics, logging, distributed tracing, and alerting (Prometheus, Grafana, Dynatrace, Elasticsearch), including actionable dashboards and SLO-based alerting
β’ Design and implement automation and self-service workflows using infrastructure-as-code, APIs, and developer platforms to reduce developer friction
β’ Implement secure delivery practices with policy-driven pipeline controls
β’ Contribute to infrastructure in Terraform, working within established architectural patterns and standards
β’ Support and improve secrets management patterns across runtime and CI/CD workflows
β’ Champion AI-assisted development practices across the team: prompt engineering workflows, AI-powered code review, and tooling integrations (e.g., GitHub Copilot, Cursor, or equivalent) as first-class parts of the engineering workflow
Operational readiness
β’ Own incident response coordination: drive the process, communicate status, and ensure issues reach the right people
β’ Participate in the on-call rotation and help drive improvements that reduce incidents and alert noise over time
β’ Build and maintain operational runbooks, escalation paths, and documentation for team-owned systems
β’ Drive production readiness as a continuous standard, not a one-time checklist
People management & team development
β’ Manage a team of engineers: this is a formal people manager role with full accountability for the team's performance and growth
β’ Own performance management: regular 1:1s, performance reviews, and direct, constructive feedback
β’ Own career development: growth planning, identifying opportunities, and building engineer capability over time
β’ Mentor engineers through code reviews, pairing, and delivery coaching
β’ Build a team culture that is organized, reliable, and focused on impact
β’ Manage team working norms, address blockers, and partner with the VP on people concerns that require escalation
About you
β’ 7+ years of hands-on experience in cloud infrastructure, DevOps, SRE, or platform engineering
β’ Demonstrated experience as a formal people manager or tech lead with direct reports: including performance management, career development, and building team capability
β’ Demonstrated experience leading technical delivery: owning timelines, driving sprint execution, and being accountable for what a team ships
β’ Ability to translate high-level technical direction into well-scoped, executable team work
β’ Comfort partnering with senior technical peers (Cloud Architect, VP) and sound judgment on when to consult vs. decide independently
β’ Strong hands-on experience with Terraform in production (modules, patterns, environment strategy, state management)
β’ Strong hands-on experience operating Kubernetes in production (EKS strongly preferred)
β’ Strong AWS fundamentals: practical experience with compute, networking, IAM, and production operations
β’ Experience building and maintaining CI/CD pipelines (GitLab CI preferred; GitHub Actions transferable)
β’ Strong observability fundamentals including metrics, logging, distributed tracing, SLO/SLI design, and alerting strategy: with experience evaluating and evolving observability practices at a platform level
β’ Experience building automation using Bash and a general-purpose language (Go or Python)
β’ Strong troubleshooting skills: you drive root cause analysis and implement long-term fixes
β’ Hands-on experience using AI coding tools (e.g., GitHub Copilot, Cursor, Claude) as a productivity multiplier in production engineering work
β’ Location Requirement: This position requires regular in-person attendance at one of our two office locations (Richmond, VA or Arlington, VA). Candidates must already be located within a commutable distance to either location, as relocation assistance is not available at this time.
Preferred qualifications
β’ Experience with Istio or other service mesh technologies
β’ Experience operating relational databases in AWS (RDS PostgreSQL/Aurora/MS SQL)
β’ Experience with AWS Lambda or serverless architectures
β’ Experience improving reliability for distributed systems at scale
β’ Prior experience as a technical anchor or team lead in a platform or infrastructure context
β’ Experience building or operating infrastructure that supports AI/ML workloads (compute, storage, serving patterns) in AWS
Why choose Koalafi: A career at Koalafi means opportunities to tackle exciting challenges every single day. We take pride in a culture of innovation, trust, and ownership. You'll get outside your comfort zone, build meaningful relationships, and most of all, take charge of projects that ultimately help people get the things they need most.
Benefits:
At Koalafi, you will have a direct impact on our products and help shape the companyβs success. We offer competitive compensation & benefits packages to keep you at your best:
β’ Comprehensive medical, dental, and vision coverage
β’ 20 PTO days + 11 paid holidays
β’ 401(k) retirement with company matching
β’ Student Loan & Tuition Reimbursement
β’ Commuter assistance
β’ Parental leave (maternal + paternal)
β’ Inclusion and Associate Engagement Programs
Who we are & what we value:
β’ We focus on whatβs most important
β’ We set clear expectations and deliver
β’ We embrace challenges to reach our full potential
β’ We ask, βHow can this be better?β
β’ We move fast together