JOB TITLE:Lead AI SRE/ AI Ops Engineer
LOCATION- Β Fremont, CA (C2C) (locals preferred)
β‘οΈ VISA: H1B
β‘οΈ PASSPORT NO. MANDATORY
JOB DES
βοΈ Total 13+ years of experience required and around 5+ yearsΒ of hands-on experience in IT operations, cloud operations, SRE, platform support, or production engineering
Proven experience in production support, incident handling, automation, and operational troubleshooting
Experience working with monitoring, observability, scripting, and release validation
Exposure to AIOps, AI-assisted operations, or automation-led support modelsΒ is strongly required.
Β
Key Responsibilities
Lead the adoption and operationalization of the SRE agentΒ across support and reliability workflows
Translate existing scripts, runbooks, SOPs, and operational knowledgeΒ into agent-compatible workflows
Work with teams to identify which use cases should be automated, semi-automated, or remain human-driven
Validate agent outputs, recommendations, and remediation steps before operational use
Support production releases, release validation, smoke testing, and post-release health checks
Drive troubleshooting during incidents and ensure proper root cause analysis and follow-through
Required Technical Skills
Strong hands-on scripting experience in PowerShell, Python, Shell/Bash
Experience with monitoring, alerting, logs, dashboards, and incident workflows
Good understanding of production support processes, release support, and validation practices
Experience with cloud platforms, preferably Azure
Familiarity with ITSM/ticketing toolsΒ such as ServiceNow, Jira, or similar
Ability to understand existing operational scripts and modernize them into scalable workflows
Experience with APIs, integrations, or automation pipelines is preferred
Experience to Kubernetes / AKS/AI tools - ChatGPT, copilot.
Warm Regards
Insiya Gandotra || Technical Recruiter
Direct: +1 (224) 788-7502Β EXT 604
[Upgrade to PRO to see contact]