Role Name: AI / GenAI Solutions Engineer
Location: Egypt,Β Pakistan (Remote)
Work Week: Sunday β Thursday
Working Hours: 9:00 AM β 6:00 PM (Saudi Arabia Standard Time)
Overview:
Soum is building an AI native layer across our C2C marketplace, from customer conversations to automated order handling, personalised discovery, recommendations, fraud signals, and seller tooling. We're looking for a Senior GenAI Engineer to design and ship production-grade AI systems that touch millions of users across the buying and selling journey.
You'll work end-to-end: architecting LLM-powered services on our Python backend, integrating them into product surfaces on our React frontend, and partnering with teams across the company to find where AI genuinely moves the needle. This is a senior builder's role with high autonomy, broad scope, and real impact on the business.
Β
What Youβll Do
β’
Architect production GenAI systems across multiple domains, including conversational agents, automated order and dispute workflows, personalized discovery and recommendations, content generation, search relevance, and emerging use cases.
β’
Own features end-to-end, from problem framing and model selection through backend services (FastAPI, Python), frontend integration (React, TypeScript), evaluation, deployment, and monitoring.
β’
Design agentic workflows with tool calling, multi-step reasoning, retrieval augmented generation, and integrations with internal APIs, third-party SaaS, and event-driven systems.
β’
Build the retrieval and embeddings stack, including chunking strategies, embedding model selection, vector indexes, hybrid search, reranking, and retrieval evaluation pipelines.
β’
Make it reliable and cost-efficient through streaming, prompt caching, latency budgets, token cost optimization, observability for LLM calls, and graceful fallback when models or upstreams misbehave.
β’
Establish evaluation rigor with offline and online evals covering response quality, tool call correctness, hallucination rate, retrieval precision, and business KPIs.
β’
Drive experimentation and research by evaluating new models, frameworks, and agent patterns, running focused experiments, and bringing what works into production.
β’
Mentor and raise the bar for engineers across the team on AI and ML best practices, prompt engineering, and production readiness.
β’
Partner cross-functionally with Product, Engineering, Data, Ops, and CX to identify high-leverage AI opportunities and ship them.
Where You'll Have Impact
A non-exhaustive list of areas we are actively building in or want to:
β’
Conversational AI for customer support, dispute resolution, and seller assistance
β’
Automated order handling, escalation routing, and workflow orchestration
β’
Personalized discovery, recommendations, and search relevance
β’
Listing quality, including auto-generated titles, descriptions, categorization, and image understanding
β’
Trust and safety, including fraud signals, anomaly detection, and content moderation
β’
Internal agent tooling for ops and CX teams
Qualifications
Required
β’ 5+ years building production software, with at least 2 years shipping LLM and ML-powered features at scale.
β’ Strong Python for backend services, scripting, data pipelines, and ML tooling.
β’ Working ability in TypeScript and React for integrating AI features directly into product surfaces.
β’ Production experience with major LLM providers (Gemini, Claude, GPT) covering tool and function calling, structured outputs, streaming, prompt caching, and cost control.
β’ Deep understanding of Retrieval Augmented Generation (RAG): document ingestion, chunking, embedding generation, vector databases (pgvector, Pinecone, Weaviate, or similar), hybrid retrieval, and reranking.
β’ Solid grounding in embeddings and vector search: dense vs. sparse representations, similarity metrics, indexing strategies (HNSW, IVF), and dimensionality tradeoffs.
β’ Strong ML and NLP fundamentals: transformer architectures, tokenization, fine-tuning vs. prompting tradeoffs, classification, ranking, and evaluation methodology.
β’ Experience with scripting and automation for data preparation, model evaluation harnesses, and offline analysis.
β’ Comfortable with relational databases (PostgreSQL), caching layers (Redis), REST, SSE, WebSockets, and event-driven architectures.
β’ Production experience with observability, A/B testing, and rolling out model changes safely.
Nice to Have
β’ Experience with recommendation systems, learning to rank, or search relevance at scale.
β’ Marketplace or C2C background covering buyer and seller dynamics, disputes, fraud, and payouts.
β’ Multimodal model experience, including vision and image understanding for listings.
β’ Arabic NLP or bilingual product experience.
β’ Experience with agent frameworks (LangGraph, custom orchestrators) and the judgment to know when to use them.
β’ Fine-tuning, LoRA, distillation, or hosting open weight models in production.
β’ Open source contributions to LLM tooling, eval frameworks, or retrieval libraries.
What We Care About
β’
Ship over the architect. Lean code, no premature abstractions, no half-finished frameworks.
β’
Measurement-driven development. Features ship with evals and metrics, not vibes.
β’
Ownership. From idea to deployment to monitoring the first real users.
β’
Curiosity and range. This role spans many problem domains, and we want someone energized by that.