Bold is seeking for a professional who owns production ML environments and MLOps infrastructure within infrastructure and data science teams. Handle day-to-day support, ad-hoc requests, and cross-team projects. Architect scalable MLOps pipelines using AWS services like SageMaker, EMR, and OpenSearch.
ABOUT THIS TEAM
Infrastructure team provides various services including automation, observability, cloud/server/network architectures, CICD, infrastructure as code, database administration, incident management, vendor management, security and compliance, and acquiring new skills. These services help to improve efficiency, reduce errors, and ensure fast and reliable application releases while maintaining security and compliance. Techops help teams monitor applications and infrastructure, create resilient infrastructure, identify and resolve IT service issues, manage vendors, and ensure cloud security and compliance. The team also focuses on continuous learning and implementing new technologies to provide better value to the organization.
WHAT YOUβLL DO
β’ Architect and productionize microservices with advanced DevOps pipelines , partnering with data science/ML teams to optimize model serving on Kubernetes platforms with service mesh.
β’ Design comprehensive observability frameworks to minimize MTTR and ensure24x7platform reliability.
β’ Engineer secure, automated CI/CD workflows with governance and compliance controls
β’ Lead 24x7 on-call engineering, CDN & security hardening and cost optimization strategies IAAS & PAAS in Cloud
β’ Manage infrastructure operations, production deployments, and hybrid cloud environments.
β’ Drive cross-functional collaboration with development, QA, operations, and data teams across global time zones to scale OpenSearch/EMR platforms and platform operations
WHAT YOUβLL NEED
β’ 4.5+ years (Sr. Engineer) / 7+ years (Module Lead)AWS/DevOps production experience
β’ AWS: VPC/IAM/GuardDuty/AWS Config, S3/ DynamoDB /Lambda /SageMaker /EMR,Cost optimization
β’ Observability: Splunk, CloudWatch, Prometheus/Grafana, New Relic, Azure Monitor
β’ GitHub: Actions automation, branch/PR governance
β’ Systems/Networking: Nginx/IIS, WAF/firewalls, DDoS mitigation
β’ CI/CD:Jenkins(declarative pipelines,Groovy),Spinnaker, CodePipeline/CodeBuild,Gradle, Maven, Artifactory, GitHub Actions
β’ IaC:Terraform, AWS CloudFormation
β’ Scripting: Python/Bash/Groovy/Java(CI/CD, EKS, FastAPI/Spring Boot)
β’ Containers:EKS+ Istio/Gloo/Karpenter, ECS,Docker, Buildah
β’ CDN:Akamai, CloudFront (cache/SSL/performance), Cloudflare
β’ Search/Data: Solr/OpenSearch, EMR pipelines
β’ DNS management (NS1/Cloudflare/GoDaddy)
EXPERIENCE-
β’ Sr. Engineer Devops AWS- 4.5 years+
About Bold
We Transform Work Lives
As an established global organization, BOLD helps people find jobs. Our story is one of growth, success, and professional fulfillment. We create digital products that have empowered millions of people in 180 countries to build stronger resumes, cover letters, and CVs. The result of our work helps people interview confidently, finding the right job in less time. Our employees are experts, learners, contributors, and creatives.
We Celebrate And Promote Diversity And Inclusion
We value our position as an Equal Opportunity Employer. We hire based on qualifications, merit, and our business needs. We don't discriminate regarding race, color, religion, gender, pregnancy, national origin or citizenship, ancestry, age, physical or mental disability, veteran status, sexual orientation, gender identity or expression, marital status, genetic information, or any other applicable characteristic protected by law.
Under San Francisco's Fair Chance Ordinance, qualified applicants with arrest and conviction records will be considered for the position.