Who are we?
Smarsh empowers its customers to manage risk and unleash intelligence in their digital communications. Our growing community of over 6500 organizations in regulated industries counts on Smarsh every day to help them spot compliance, legal or reputational risks in 80+ communication channels before those risks become regulatory fines or headlines.Β Relentless innovation has fueled our journey to consistent leadership recognition from analysts like Gartner and Forrester, and our sustained, aggressive growth has landed Smarsh in the annual Inc. 5000 list of fastest-growing American companies since 2008.
Summary
The SMB Platform Engineering team at Smarsh is responsible for the reliability, automation, and evolution of the infrastructure powering Smarsh's SMB Product, a mission-critical archiving and communication surveillance platform currently running on-premises. Later this year, the team will be taking on cloud infrastructure responsibilities as part of Smarsh's broader platform modernization effort.
As a Level 3 Platform Engineer on the SMB Platform Engineering team, you will own the design, delivery, and operation of infrastructure automation and platform tooling across Smarsh's on-premises environment, with cloud responsibilities coming into scope later this year. You will work independently on complex projects and serve as a technical liaison to adjacent Smarsh platform teams. In year one, success looks like: measurable reduction inΒ operational toil, at least one significant automation or migration initiative delivered, and runbooks for the systems you own.
How will you contribute?
β’ Own Kubernetes platform operations, including cluster health, workload deployments, scaling, and incident response.
β’ Design, implement, and operate infrastructure automation using Ansible, Terraform, and GitOps workflows (ArgoCD / Flux)
β’ Lead migration projects moving on-premises workloads toward One Smarsh (cloud-native) platform services.
β’ Build and maintain CI/CD pipelines (CircleCI, GitHub Actions) for infrastructure and application delivery.
β’ Drive observability improvements across Datadog, Splunk, and ELK, including dashboards, alert tuning, and SLO/SLA definition.
β’ Participate in the on-call rotation, responding to P1/P2 incidents; the team rotates on-call roughly every 4-6 weeks and performs scheduled overnight maintenance windows a few times per year.
β’ Support security and compliance requirements, including patch management, access controls, and audit readiness for regulated workloads.
β’ Contribute to runbooks and operational documentation as systems are built and changed
β’ Collaborate with other Smarsh platform teams on the build and adoption of a One Smarsh platform.
What will you bring?
β’ 4β7 years of experience in platform engineering, SRE, or infrastructure engineering roles.
β’ Strong hands-on experience with Kubernetes (cluster operations, Helm, workload troubleshooting).
β’ Proficiency with infrastructure-as-code tooling, specifically Ansible and/or Terraform in production environments.
β’ Strong Linux systems administration skills (Ubuntu)
β’ Experience with GitOps workflows and CI/CD pipelines at scale.
β’ Experience with VMware vSphere in a production environment.
β’ Demonstrated ability to self-direct and drive projects to completion with minimal oversight.
β’ Comfortable operating in evolving environments where processes and tooling are actively maturing.
β’ Strong communication skills with cross-functional stakeholders.
β’ Experience with one or more of Datadog, Splunk, or ELK for dashboards, monitors, and log management is preffered.
β’ Familiarity with compliance-sensitive or regulated industry infrastructure (financial services, healthcare, or similar) is preffered.
β’ Experience with ArgoCD, Flux, or similar GitOps continuous delivery tooling is preffered.
β’ Familiarity with Jenkins or Concourse for CI/CD pipeline management is preffered.
β’ Familiarity with VMware Kubernetes Service (VKS) or other VMware-native Kubernetes platforms is preffered.
β’ Python scripting for automation and tooling is preffered.
β’ Prior experience in an on-call rotation with a defined SLA structure is preffered.
β’ Experience with cloud infrastructure (AWS), beneficial as the team takes on cloud responsibilities later this year is preffered.
About our culture
Smarsh hires lifelong learners with a passion for innovating with purpose, humility and humor. Collaboration is at the heart of everything we do. We work closely with the most popular communications platforms and the worldβs leading cloud infrastructure platforms. We use the latest in AI/ML technology to help our customers break new ground at scale. We are a global organization that values diversity, and we believe that providing opportunities for everyone to be their authentic self is key to our success. Smarsh leadership, culture, and commitment to developing our people have all garnered Comparably.com Best Places to Work Awards. Come join us and find out what the best work of your career looks like.