Modern RAG Architecture
The Aris RAG ecosystem moves beyond simple vector similarity search. We employ a multi-stage retrieval pipeline designed for high-precision context recovery in complex enterprise domains.Architecture Overview
The system executes a strictly ordered pipeline: Query Rewriting -> Hybrid Retrieval -> Reciprocal Rank Fusion -> Contextual Compression -> Generation.
Retrieval Strategies
We support three primary retrieval modes, configurable via theretrieval_strategy parameter in the retrieve_context tool.
- Hybrid Search
- GraphRAG
- Contextual Compression
Best for: General-purpose queries requiring both semantic understanding and exact keyword matching.The system executes two parallel searches:
- Dense Retrieval: Embedding-based search using
text-embedding-3-large(256/512/1024/3072 dimensions). - Sparse Retrieval: BM25 keyword search on tokenized documents.
Reciprocal Rank Fusion (RRF)
We strictly use RRF to merge results from different retrieval systems. This creates a “consensus” ranking that is robust to outliers in any single method. Where:- is the document.
- is the set of rankers (Dense, Sparse, Graph).
- is a constant (typically 60).
- is the rank of document in ranker .
Do not use weighted averaging for scores from different models (e.g., BM25 scores vs. Cosine Similarity). They are not on the same scale. RRF is scale-invariant.