About HighLevel:
HighLevel is an AI-powered business operating system that gives agencies, entrepreneurs and SMBs the infrastructure to build, automate and scale. Today, HighLevel supports SMBs across 150+ countries, fueling community-driven growth rooted in real customer outcomes.
To date, businesses operating on HighLevel have generated over $7 billion in ecosystem value, demonstrating the impact of shared infrastructure at scale. By centralizing conversations, automation and intelligence into one system, we help businesses move faster, reduce complexity and execute efficiently.
Behind the platform, HighLevel powers more than 4 billion API hits and 2.5 billion message events daily. With 250 terabytes of distributed data, 250+ microservices and over 1 million domain names supported, our architecture is built for performance, resilience and long-term scalability.
Our People
With over 2,000 team members across 10+ countries, HighLevel operates as a global, remote-first organization built for speed and ownership. We value initiative, clarity and execution, creating space for ambitious people to build systems that support millions of businesses worldwide. Here, innovation thrives, ideas are celebrated and people come first, no matter where they call home.
Our Impact
Every month, HighLevel enables more than 1.5 billion messages, 200 million leads and 20 million conversations for the more than 1 million businesses we support. Behind those numbers are real people building independence, expanding opportunity and creating measurable impact. Weβre proud to be a part of that.
Learn more about us on our YouTube Channel or Blog Posts
About the Role:
We are seeking a Staff Engineer - Cloud Infrastructure & Security to act as a technical architect and leader across HighLevelβs cloud platform.
This role is a senior individual contributor position responsible for designing and evolving secure, scalable, and resilient infrastructure on GCP, with deep ownership across Kubernetes, networking, IAM, and edge security (Cloudflare).
You will work closely with Platform Engineering, SRE, and Cyber Security teams to ensure infrastructure is secure by design, highly available, and aligned with modern best practices, while enabling teams to move fast safely.
Responsibilities:
Cloud Infrastructure Architecture (GCP):
β’
Design and evolve GCP-based infrastructure architecture for scalability, resilience, and security.
β’
Define standards for:
β’
Project and environment structure
β’
Multi-region deployments
β’
High availability and failover strategies
β’
Lead architectural reviews for high-impact infrastructure changes.
β’
Ensure infrastructure supports high-scale, multi-tenant SaaS workloads.
Kubernetes Platform (GKE):
β’
Architect and optimize Kubernetes (GKE) platforms for production workloads.
β’
Define and enforce:
β’
Cluster architecture and node pool strategies
β’
Workload isolation and scheduling policies
β’
Upgrade and lifecycle management strategies
β’
Improve reliability, scalability, and operational efficiency of Kubernetes environments.
Networking & Edge (Cloudflare):
β’
Design and manage secure and scalable cloud networking:
β’
VPCs, subnets, routing, and firewalls
β’
Load balancing and traffic routing
β’
Own integration with Cloudflare, including:
β’
CDN configuration
β’
WAF rules and DDoS protection
β’
Edge security and traffic management
β’
Ensure low-latency, resilient, and secure traffic flows.
Identity & Access Management (IAM):
β’
Design and enforce least-privilege IAM architecture across GCP and platform systems.
β’
Define standards for:
β’
Service accounts and roles
β’
Access control policies
β’
Just-in-time access and auditing
β’
Partner with Cyber Security to continuously improve access posture and reduce risk.
Cloud Security & Platform Hardening:
β’
Build and enforce secure-by-default infrastructure patterns.
β’
Partner closely with Cyber Security teams to:
β’
Identify and remediate vulnerabilities
β’
Implement security controls and guardrails
β’
Support threat modeling and risk assessments
β’
Secure Kubernetes workloads, networking layers, and cloud services.
Infrastructure as Code & Automation:
β’
Drive adoption and quality of Infrastructure as Code (IaC) using tools like Terraform.
β’
Build reusable infrastructure modules and automation frameworks.
β’
Ensure infrastructure changes are Auditable, Repeatable & Safe
β’
Reduce manual operational work through automation.
Reliability, DR & Operational Readiness:
β’
Design and improve disaster recovery (DR) and failover strategies.
β’
Define and validate RTO / RPO objectives.
β’
Partner with SRE teams to improve Incident response, System resilience & Operational readiness
β’
Participate in postmortems and drive systemic improvements.
Performance & Cost Optimization:
β’
Identify infrastructure inefficiencies and performance bottlenecks.
β’
Partner with FinOps and Cloud teams to:
β’
Optimize resource utilization
β’
Improve cost visibility and predictability
β’
Balance performance, reliability, and cost in architectural decisions.
Technical Leadership & Mentorship:
β’
Act as a technical leader across Cloud Infrastructure and Security domains.
β’
Mentor SDE2, SDE3, and Lead engineers.
β’
Drive design reviews, architecture discussions, and best practices.
β’
Influence teams across the organization without direct authority.
Cross-Functional Collaboration:
β’
Work closely with:
β’
Platform Engineering (CI/CD, DevEx)
β’
SRE & InfraOps (operations and reliability)
β’
Cyber Security teams (security and compliance)
β’
Communicate complex technical concepts clearly to stakeholders and leadership.
Requirements:
β’
Bachelorβs degree or equivalent experience in Engineering or related field
β’
9+ years of experience in cloud infrastructure, platform engineering, or security
β’
Deep hands-on experience with:
β’
GCP (preferred) or other cloud platforms
β’
Kubernetes (GKE) in production environments
β’
Cloud networking and distributed systems
β’
Strong experience with:
β’
Cloudflare (CDN, WAF, edge security)
β’
IAM and access control systems
β’
Proven experience designing secure, highly available systems at scale
β’
Strong problem-solving and system design skills
β’
Excellent communication and leadership abilities
Nice to Have:
β’ Experience in high-growth SaaS environments
β’ Familiarity with service mesh (Istio or similar)
β’ Experience with policy-as-code (OPA, Kyverno)
β’ Experience in compliance-driven environments
β’ Scripting or programming experience (Go, Python, Bash)
EEO Statement:
The company is an Equal Opportunity Employer. As an employer subject to affirmative action regulations, we invite you to voluntarily provide the following demographic information. This information is used solely for compliance with government record-keeping, reporting, and other legal requirements. Providing this information is voluntary and refusal to do so will not affect your application status. This data will be kept separate from your application and will not be used in the hiring decision.