Skip to main content

Modern RAG Architecture

The Aris RAG ecosystem moves beyond simple vector similarity search. We employ a multi-stage retrieval pipeline designed for high-precision context recovery in complex enterprise domains.

Architecture Overview

The system executes a strictly ordered pipeline: Query Rewriting -> Hybrid Retrieval -> Reciprocal Rank Fusion -> Contextual Compression -> Generation.

Retrieval Strategies

We support three primary retrieval modes, configurable via the retrieval_strategy parameter in the retrieve_context tool.

Reciprocal Rank Fusion (RRF)

We strictly use RRF to merge results from different retrieval systems. This creates a “consensus” ranking that is robust to outliers in any single method. RRFscore(d)=rR1k+r(d)RRFscore(d) = \sum_{r \in R} \frac{1}{k + r(d)} Where:
  • dd is the document.
  • RR is the set of rankers (Dense, Sparse, Graph).
  • kk is a constant (typically 60).
  • r(d)r(d) is the rank of document dd in ranker rr.
Do not use weighted averaging for scores from different models (e.g., BM25 scores vs. Cosine Similarity). They are not on the same scale. RRF is scale-invariant.