Detailed Comparison
A comprehensive breakdown of how SourcePrep's architecture compares to other AI coding tools and context engines.
Feature Comparison | SourcePrep Continuous Graph RAG | |
|---|---|---|
Architecture | Graph Construction How the codebase is parsed and understood | Native Rust Engine(Tree-sitter) |
Search Architecture How relevant context is found | Local ONNX Embeddings+ BM25 | |
Context Assembly | Context Delivery What the AI actually receives | LOD CapsuleContext |
Token Efficiency Minimizing distractor tokens | Dual-EngineCompression (3–20x) | |
Epistemology & Trust | LLM Augmentation How AI deepens the knowledge graph | Flexible AI Pipeline(Cloud BYOK or Local) |
Continuous Enrichment Refining understanding over time | Trace EpistemologyPipeline | |
Drift Detection Knowing when agent assumptions are stale | Automated viaWatcher & Graph | |
Inspectability Seeing what the AI sees | Dedicated DesktopHealth Dashboard | |
Control & Customization | Scope Management Controlling what the AI can see | Visual Folder-Treewith Include/Exclude |
Edge & Module Weighting Prioritizing what matters most in the graph | Configurable Edge Weights+ Module Importance | |
Privacy & Local-First Where your code data lives | 100% Local: Rust + ONNXZero Cloud |
Architecture
SourcePrep's Rust-native parser uses Tree-sitter to build a complete structural trace graph offline. Unlike tools that depend on an active IDE or LSP server, SourcePrep works headlessly — in CI/CD, on servers, or anywhere Rust runs. The parser handles 15+ languages and produces call-graph, import, and containment edges in a single pass.
GitNexusNode.js
/WASM
SerenaActive
LSP Server
Understand-AnythingLLM Multi-Agent
Pipeline
bloopRust
(Tree-sitter)
GrepaiText Index
EmpiricaGit Notes
/No Graph
VexpSQLite
/Tree-sitter
SourcePrep combines local ONNX embeddings (nomic-embed-text-v1.5) with BM25 keyword search in a hybrid architecture. Semantic search handles conceptual queries ('find the authentication flow') while BM25 catches exact identifiers ('handleLogin'). Everything runs 100% locally with no cloud dependency — embedding latency is ~7ms per query. Intent-aware routing automatically picks the best strategy per query.
GitNexusKuzuDB
/FTS
SerenaLSP
Queries
Understand-AnythingFuzzy +
Semantic
bloopLocal Qdrant
/Vector
GrepaiLocal
Semantic Index
EmpiricaGit Commit
Hashes
VexpFTS5 + TF-IDF
(No Embeddings)
Context Assembly
SourcePrep delivers LOD capsule context: full source for focal nodes, signatures+docstrings for adjacent nodes, and module summaries for distant context. This gives the AI a natural zoom-in/zoom-out perspective that mirrors how human developers understand code. The result is rich, structured context that maximizes signal per token.
GitNexusPrecomputed
Raw Graph Data
SerenaRaw
Symbol Matches
Understand-AnythingJSON Graph
+ Slash Commands
bloopRaw
Snippets
GrepaiRaw
File Chunks
EmpiricaReasoning
Checkpoints
VexpCapsule
Context
SourcePrep achieves 3–20x token compression through a dual-engine approach: LOD-based structural compression for code (signatures instead of full bodies) and LLMLingua-2 token pruning for documentation (~2.4×). The compression level adapts dynamically per query and per client tier — Claude/Gemini get more full-source files, local models get tighter compression to fit constrained windows.
GitNexusHigh
(via Precomputation)
SerenaLow
(Full Symbols)
Understand-AnythingPlain-English
Node Summaries
bloopLow
(Full snippets)
GrepaiLow
(Sends full chunks)
EmpiricaLow
(State Dumps)
VexpHigh
(Signature Only)
Epistemology & Trust
SourcePrep uses local or bring-your-own-key LLMs to continuously augment the structural trace graph with deep semantic understanding. The pipeline generates module summaries, infers cross-module relationships, computes understanding scores, and validates edge correctness — all automatically. This is not simple indexing: it's a multi-stage epistemic enrichment process where each pass deepens the AI's comprehension. You can run it with a local Ollama model for zero-cloud privacy, or use your own OpenAI/Anthropic key for maximum quality.
GitNexusNone
(Static Graph)
SerenaNone
(LSP Only)
Understand-AnythingLLM-First
(6 Agents)
bloopNone
(Index Only)
GrepaiNone
(Embeddings Only)
EmpiricaAgent-Driven
LLM Assessment
VexpNone
(Static AST)
SourcePrep's Trace Epistemology Pipeline continuously enriches the knowledge graph: deep analysis generates module summaries, cross-module relationship analysis, and understanding scores. Each pipeline run builds on previous results, and the file watcher triggers incremental re-enrichment when code changes. The result is a knowledge base that gets measurably smarter over time — visible in the dashboard's health scores.
GitNexusStatic
until re-indexed
SerenaNone
Understand-AnythingIncremental
LLM Re-analysis
bloopNone
GrepaiNone
EmpiricaGit-Native
Pre/Postflight
VexpSession
Memory
SourcePrep's file watcher monitors the codebase for changes and automatically marks affected trace nodes, observations, and enrichment data as stale. When a function changes, all observations about that function are flagged. The dashboard shows drift status at a glance with per-node granularity. No manual intervention needed.
GitNexusManual
git-diff checks
SerenaNone
Understand-AnythingDiff Impact
(on demand)
bloopNone
GrepaiNone
EmpiricaMirror
Drift Detection
VexpManual
Observation Staling
SourcePrep's dedicated desktop dashboard lets you visually browse the trace graph, see module health scores, inspect enrichment pipeline status, and fine-tune scope with a folder tree. You can see exactly what context the AI will receive before it receives it. This bird's-eye perspective of your codebase builds trust and gives developers real control over the AI's knowledge.
GitNexusWeb UI
/Terminal
SerenaOpaque
Understand-AnythingInteractive
Dashboard + Demo
bloopDesktop
App
GrepaiTerminal
Only
EmpiricaGit Log
Only
VexpVS Code
Only
Control & Customization
SourcePrep provides a visual folder-tree in the dashboard for precise scope control. Include or exclude entire directories, individual files, or use glob patterns. Changes take effect immediately and the dashboard shows exactly which files are in-scope, how many nodes are indexed, and what percentage of the codebase is covered. This gives developers fine-grained control over the AI's view of the project.
GitNexus.gitignore-style
Patterns
SerenaLSP
Workspace Scope
Understand-AnythingAuto Project
Detection
bloopRepo-Level
Selection
GrepaiCLI
Path Arguments
EmpiricaGit Repo
Scope Only
VexpVS Code
Workspace Scope
SourcePrep assigns edge weights by kind (call, import, containment, inferred, LSP) that affect trace expansion priority. Module importance scores from the enrichment pipeline influence which context gets included first when token budgets are tight. The dashboard exposes these weights, letting developers fine-tune how the graph prioritizes different parts of the codebase — for example, boosting your core business logic over utility helpers.
GitNexusGraph Centrality
Metrics
SerenaNo
Ranking
Understand-AnythingAuto Layer
Grouping
bloopVector Similarity
Only
GrepaiEmbedding Similarity
Only
EmpiricaN/A
VexpGraph Centrality
in Ranking
Everything in SourcePrep runs 100% locally. The Rust parser, ONNX embeddings (nomic-embed-text-v1.5), SQLite storage, and the dashboard all work fully offline. No code ever leaves your machine unless you explicitly configure team sync to your own S3 bucket. The ONNX runtime embeds at ~7ms per query with zero cloud dependencies, zero API keys, and zero data transmission.
GitNexusLocal
(Node.js + WASM option)
SerenaLocal Server
(LLM calls needed)
Understand-AnythingLocal Artifact,
LLM Calls Required
bloopLocal
(Qdrant Instance)
GrepaiPrivacy-First
Local
EmpiricaGit-Native
(LLM calls needed)
VexpLocal
(VS Code Extension)
Companion, not a framework
SourcePrep isn't trying to replace your favorite tools. It's an MCP-native context engine designed to supercharge the AI IDEs and agents you already use.
Cursor
AI IDEConnect SourcePrep via MCP to give Cursor perfect project-wide context without copying files.
Windsurf
AI IDEWindsurf agents use SourcePrep to autonomously navigate the trace graph before editing.
Cline / Roo
VS Code ExtensionStop dropping raw files into context. Give your agent the SourcePrep LOD capsule instead.
Claude Code
CLI AgentSupercharge Anthropic's CLI with blazing-fast local ONNX semantic routing.
