Must have Skills : Incident management, Java (Expert), Cloud, Production Support, Stakeholder & Vendor Management 
Knowledge & Experience:
Production Support Leadership
β’ Own and manage end-to-end production support for digital applications, ensuring timely resolution of incidents
β’ and service requests.
β’ Lead the production support team, assigning tickets, overseeing issue analysis, and coordinating fixes and
β’ deployments.
β’ Ensure production go-live handoffs are seamless, with proper documentation, readiness checks, and support coverage.πΉ Incident & Problem Management
β’ Act as the primary escalation point for major incidents, SLA breaches, and recurring issues.
β’ Lead incident triage, root cause analysis (RCA), and post-incident reviews to drive long-term resolution.
β’ Maintain clear and timely communication with IT management, business users, and stakeholders during incidents.πΉ Monitoring & Health Checks
β’ Proactively monitor applications to pre-empt issues, minimize downtime, and ensure performance against SLAs.
β’ Implement and maintain health check routines, dashboards, and alerting mechanisms for critical systems.πΉ Automation & SOP Development
β’ Identify and automate manual, repetitive tasks to improve operational efficiency.
β’ Develop and maintain Standard Operating Procedures (SOPs) for production support activities and incident handling.πΉ Technical Troubleshooting & Consultation
β’ Provide hands-on troubleshooting for Java backend systems, web and mobile technologies, and various
β’ frameworks.
β’ Understand and resolve issues related to certificate exchanges, API integrations, cloud infrastructure, and system dependencies.
β’ Offer technical consultation to support teams and contribute to solution design reviews when needed.πΉ Cloud & Infrastructure Coordination
β’ Leverage hands-on experience with Azure (primary) and AWS (secondary) cloud platforms for troubleshooting and deployment support.
β’ Coordinate with network, firewall, SecOps, and infrastructure teams to resolve cross-domain issues.πΉ Stakeholder & Vendor Management
β’ Maintain strong relationships with business users, IT teams, and external vendors, ensuring alignment and timely resolution.
β’ Review vendor fixes and designs, and coordinate production deployments with third-party teams.πΉ Communication & Reporting
β’ Own communication for major incidents, including status updates, impact assessments, and resolution timelines.
β’ Prepare and deliver incident reports, RCA documentation, and performance dashboards to senior stakeholders.πΉ Team Leadership & Mentorship
β’ Mentor and coach production leads and support engineers, fostering a culture of ownership and continuous improvement.
β’ Ensure team members are equipped with the right tools, knowledge, and support to handle production issues
β’ effectively.πΉ Cross-Functional Collaboration
β’ Work closely with application development teams, infrastructure teams, and business units to ensure smooth operations and issue resolution.
β’ Participate in project planning and go-live readiness to ensure production support requirements are met.πΉ Compliance & Policy Adherence
β’ Ensure all production support activities conform to IT and bank policies, procedures, and security standards.
β’ Support Business Resumption Plan (BRP) testing and other compliance-related activities as required.πΉ Operational Continuity & Coverage
β’ Provide coverage during team absences, and be prepared for temporary reassignment or promotion in case of
β’ business necessity.
β’ Strive to understand peer and superior roles to ensure continuity of operations during emergencies or transitions.Skills:
πΉ Technical & Troubleshooting Skills
β’ Strong troubleshooting skills in Java backend systems
β’ Understanding of web and mobile technologies and various frameworks
β’ Hands-on experience with Azure Cloud (primary) and AWS Cloud (secondary)
β’ Knowledge of certificate management, including exchange and renewal processes
β’ Familiarity with API integrations, system interfaces, and backend services
β’ Ability to analyze and resolve complex production issues across distributed systemsπΉ Production Support & Incident Management
β’ Expertise in incident management, including triage, escalation handling, and RCA
β’ Experience managing high-volume, customer-facing applications
β’ Strong understanding of service request handling, SLAs, and operational KPIs
β’ Ability to lead war rooms, manage stress situations, and drive resolution under pressureπΉ Monitoring & Automation
β’ Proactive monitoring and health check implementation
β’ Experience in automating manual tasks and improving operational efficiency
β’ Development and maintenance of Standard Operating Procedures (SOPs)πΉ Functional & Domain Knowledge
β’ Strong functional and technical understanding of banking applications
β’ Ability to interpret business requirements and assess production impact
β’ Familiarity with digital banking platforms, transaction systems, and customer journeysπΉ Stakeholder & Vendor Management
β’ Strong stakeholder engagement and communication skills
β’ Experience in vendor coordination, design review, and deployment planning
β’ Ability to manage relationships across IT, business, infrastructure, and security teamsπΉ Leadership & Collaboration
β’ Proven leadership in managing production support teams
β’ Ability to mentor and coach team members and leads
β’ Strong collaboration skills across cross-functional teams (network, firewall, SecOps, etc.)πΉ Communication & Reporting
β’ Excellent verbal and written communication skills
β’ Ability to prepare and deliver incident reports, RCA documentation, and executive updates
β’ Skilled in status reporting and stakeholder communication during critical incidents