Isokan Logo
AI Platforms Engineer

Ade Daramola.
Building systems
that think and heal.

I design and ship production-grade AI-native infrastructure — autonomous Kubernetes platforms, agentic RAG pipelines, and multi-model orchestration systems that act with confidence and fail gracefully.


About

I'm a senior AI/Cloud engineer specialising in production-grade agentic systems, cloud-native infrastructure, and multi-agent orchestration. My focus is engineering software that can reason, decide, and act with minimal human intervention.

Every platform I build is infrastructure-as-code first — Terraform-managed, properly observed, and documented from day one. Not proof-of-concepts. Deployable, maintainable systems.

My work sits at the intersection of AWS cloud architecture and modern AI frameworks: LangGraph, confidence-gated decision engines, and retrieval-augmented generation at production scale.

"Every remediation is a Git commit. The cluster never changes outside a reviewed pull request or a high-confidence auto-apply."

Selected Work

Three platforms.
One through-line.

01
AIOps Platform

GitOps Sentinel

Autonomous Kubernetes remediation platform — intercepts anomalies, reasons through root cause via a multi-agent pipeline, and resolves incidents without human involvement unless confidence demands it.

Confidence-gated routing: routine incidents resolve automatically; ambiguous or high-risk situations escalate before anything touches the cluster.
Every fix is a Git commit reviewed by Argo CD — the cluster never changes outside a controlled write path; failed remediations trigger an automatic revert.
Multi-agent reasoning covers classification, root cause analysis, action planning, and confidence scoring as discrete, independently testable stages.
PythonAWS LambdaStep FunctionsTerraform EventBridgeDynamoDBArgo CDKubernetes / EKSPrometheusAmazon Bedrock
02
Agentic RAG System

MedQuery

Medical Q&A system that routes queries across a PubMed/FDA corpus and live web search, with a safety guardrail and iterative relevance checking before streaming an answer token-by-token.

Queries are automatically routed to the most appropriate source — structured medical corpus, FDA drug data, or live web — with up to 3 relevance checks before generating a response.
Safety guardrail classifies and rejects non-medical and high-risk queries before any retrieval runs, keeping the system within scope.
Answers stream token-by-token with source-quality labels so users always know whether a response comes from verified data or web search.
PythonLangGraphFastAPITerraform PostgreSQL / pgvectorOpenAI GPTReact · SSEECS FargateCloudFrontDocker
03
LLM Orchestration

Multi-LLM Platform

Inference gateway that abstracts OpenAI, Anthropic, and AWS Bedrock behind a single API — with intelligent routing, cost tracking, and automatic failover.

Routes each request based on complexity, cost, and caller latency SLA — directing simple requests to cheaper models and tight-SLA requests to faster tiers automatically.
Health-based failover skips unhealthy providers in real time; a semantic cache layer returns results for similar prompts without hitting an LLM at all.
Every request emits structured cost telemetry — tokens used, model selected, and cache savings — broken down by provider and caller, giving teams real-time LLM spend visibility.
PythonFastAPILangChain / LangGraph OpenAIAnthropicAWS BedrockDockerTerraform

Technical Expertise

Cloud-native.
AI-first.

AI & Agentic Systems
  • Multi-agent pipeline orchestration
  • LangGraph stateful workflows
  • Retrieval-Augmented Generation
  • Confidence-gated decision engines
  • LLM routing & provider abstraction
  • Semantic vector search (pgvector)
Cloud Infrastructure
  • AWS Lambda, Step Functions, EventBridge
  • ECS Fargate, RDS, ALB, CloudFront
  • Amazon Bedrock & SageMaker
  • DynamoDB, S3, Secrets Manager
  • Terraform (IaC)
  • IAM scoped roles & X-Ray tracing
Kubernetes & GitOps
  • EKS cluster management
  • Argo CD GitOps controller
  • OPA Gatekeeper policy enforcement
  • Prometheus + Grafana observability
  • Helm chart deployment
  • Alertmanager integration
Backend & APIs
  • FastAPI — production-grade
  • SSE streaming responses
  • PostgreSQL + pgvector
  • Docker & docker-compose
  • Rate limiting & API key auth
Languages
  • Python (primary)
  • HCL / Terraform
  • JavaScript / React
  • Bash / Makefile
  • Open Policy Agent (Rego)
  • YAML / JSON
Platform Engineering
  • Test-driven development (pytest)
  • Infrastructure as Code
  • Event-driven architecture
  • CI/CD — GitHub Actions
  • Security-first IAM design
  • Cost-aware architecture

Let's build something
genuinely ambitious.

[email protected]  ·  isokan.dev