As an AI Solution Architect, you will be the technical authority responsible for designing, architecting, and guiding the delivery of Agentic AI systems within the Autonomous Networks Programme. You will work closely with the Project Director, Network SMEs, OSS engineers, NOC operations teams, and the Central AI Infrastructure Solution Architect to translate network operations challenges into production-grade AI agent solutions that are safe, explainable, trusted, and operationally effective. This is a hands-on role requiring both strong architecture leadership and practical engineering capability, including prototyping, coding, system integration, and technical review.
 
What will you do?
Agent Architecture & Design
β’ Lead the end-to-end architecture of Agentic AI systems, covering data ingestion, reasoning, decision-making, action execution, and audit logging.
β’ Define agent scope, decision boundaries, tool specifications, and autonomy levels in alignment with programme stakeholders.
β’ Design multi-agent orchestration capabilities such as event routing, resource locking, inter-agent handoff contracts, prompt caching, and shared state management.
β’ Architect RAG pipelines for knowledge and diagnosis agents, including corpus design, chunking strategy, embedding model selection, retrieval design, and evaluation methodology.
β’ Design agentic workflows, evaluation patterns, hybrid search, reranking strategies, benchmarking approaches, and golden datasets.
β’ Develop prompt templates, output schemas, hallucination guardrails, and confidence scoring mechanisms for LLM-based agents.
β’ Produce and own technical design artefacts including architecture decision records (ADRs), sequence diagrams, data flow diagrams, and agent logic flowcharts.OSS Integration & Data Engineering
β’ Adopt and apply Model Context Protocol (MCP) and Agent-to-Agent protocol patterns as governed extension mechanisms for enterprise AI agents.
β’ Design and build data pipeline layers that normalise multi-vendor, multi-format network telemetry into usable schemas for AI reasoning.
β’ Work with the Network Solution Architect to define integration patterns between AI agents and OSS platforms such as NMS, EMS, ServiceNow ITSM, inventory, and provisioning systems through REST APIs.
β’ Design event streaming architecture, such as Kafka or equivalent, to support near real-time alarm and telemetry processing aligned to NOC latency requirements.
β’ Collaborate with Data Engineering teams to improve the quality of alarm, ticket, and inventory data that supports agent accuracy and reliability.Safety, Guardrails & Risk Architecture
β’ Design and implement guardrails for all agents, including hard limits, human confirmation flows, blast radius controls, and escalation logic.
β’ Lead blast radius assessments with Network SMEs and Risk teams, documenting worst-case impacts and defining mitigating controls.
β’ Design and validate rollback procedures for agent-initiated actions, including KPI-based automated rollback triggers.
β’ Ensure solution architectures comply with IMDA requirements, AIVerify expectations, InfoSec policies, and relevant change governance processes.Observability & Evaluation
β’ Design decision audit trail schemas to ensure all agent actions and decisions are traceable, explainable, and reviewable.
β’ Define continuous evaluation approaches for production agent traces using LLM-as-judge and human review samples.
β’ Build or guide the development of Agent Operations Dashboards for NOC teams, showing agent health, decision activity, accuracy, and exceptions.
β’ Define agent evaluation frameworks across factual accuracy, reasoning quality, retrieval performance, planning quality, tool-call accuracy, and confidence thresholds.
β’ Design drift detection mechanisms to identify output quality degradation before operational impact occurs.Delivery & Stakeholder Collaboration
β’ Partner with the Project Director on design planning, sprint planning, design governance, and build readiness.
β’ Work with Network SME owners to validate architectural decisions against real operational workflows and constraints.
β’ Lead technical walkthroughs with NOC management and Central AI Infrastructure leadership to build confidence in the proposed architecture.
β’ Provide mentorship and technical guidance to AI/ML engineers and integration engineers through code reviews, architecture reviews, and design support.
β’ Translate technical decisions into clear, concise, non-technical summaries for programme leadership and steering committees.