BFSIHealthcare· Custom AI agents

SupportIQ — Enterprise AI support agent

<200ms

cache hits

4-route

intent routing

DPDP

compliant

System Architecture

User Query
Presidio PII Scrubber
SPLADE Intent Router
Agentic RAG / Escalation
LLM Synthesis
Presidio Output Scrubber
User

Engineering Deep Dive

Hybrid RAG Approach

Combined SPLADE sparse retrieval with dense embeddings over Qdrant to handle domain-specific financial acronyms that standard embeddings miss.

Sub-process Isolation

Implemented strict LangGraph state machines to ensure that transactional queries could never access conversational endpoints, enforcing security by architecture.

Semantic Caching

Used Redis semantic caching at a 0.85 cosine threshold to instantly serve repeated tier-1 queries, achieving <200ms response times and slashing LLM API costs.

Business Impact

Successfully automated 60% of Tier-1 support tickets while maintaining strict DPDP and GDPR compliance via local PII redaction.

Technologies Used

LangGraph
Qdrant
SPLADE RAG
Presidio
Redis
FastAPI
Groq

Want results like these for your business?

30 minutes. Free. I'll tell you upfront if it's not a fit.