Role: Lead AI SRE/ AI Ops Engineer
Location: Fremont, CA (4 days Onsite) (C2C / LOCAL ONLY)
Experience
Total 13+ years of experience required and around 5+ yearsΒ of hands-on experience in IT operations, cloud operations, SRE, platform support, or production engineering
Proven experience in production support, incident handling, automation, and operational troubleshooting
Experience working with monitoring, observability, scripting, and release validation
Exposure to AIOps, AI-assisted operations, or automation-led support modelsΒ is strongly required.
Key Responsibilities
Lead the adoption and operationalization of the SRE agentΒ across support and reliability workflows
Translate existing scripts, runbooks, SOPs, and operational knowledgeΒ into agent-compatible workflows
Work with teams to identify which use cases should be automated, semi-automated, or remain human-driven
Validate agent outputs, recommendations, and remediation steps before operational use
Support production releases, release validation, smoke testing, and post-release health checks
Drive troubleshooting during incidents and ensure proper root cause analysis and follow-through
Improve alert handling, event correlation, and operational response patterns
Coordinate with engineering, operations, and platform teams on onboarding and process changes
Mentor junior engineers and guide them on workflow design, validation, and operational execution
Maintain high-quality documentation, runbooks, and operational standards
Required Technical Skills
Strong hands-on scripting experience in PowerShell, Python, Shell/Bash
Experience with monitoring, alerting, logs, dashboards, and incident workflows
Good understanding of production support processes, release support, and validation practices
Experience with cloud platforms, preferably Azure
Familiarity with ITSM/ticketing toolsΒ such as ServiceNow, Jira, or similar
Ability to understand existing operational scripts and modernize them into scalable workflows
Experience with APIs, integrations, or automation pipelines is preferred
Experience to Kubernetes / AKS/AI tools - ChatGPT, copilot.
Β
Thanks & Regards
Tania Choudhary || Talent Acquisition
Office:Β +1 (224) 292-5667, Ext - 603Β
[Upgrade to PRO to see contact]