Similar Tools

Scry is most honestly understood in relation to the other approaches to the agent context problem — not as the only answer, but as one shape among several. This page is that comparison.

Eight categories below, each classified peer, adjacent, or subordinate against scry’s marker contract, with named systems and the precise place each one is genuinely strong or genuinely limited. The classification key sits at the top; a summary at the bottom names where the alternatives are strongest and where the marker contract has the clearest edge.

Classification key

PeerSolves the same problem (curated, durable, queryable knowledge of what already exists in the project) with a different primary mechanism. Someone evaluating tools for this would compare it head-to-head with the marker contract.
Partial PeerSolves a substantially overlapping problem with a different shape — close enough that the comparison is direct, distinct enough that the categories do not fully collapse.
AdjacentSolves a related problem (raw retrieval, code navigation, larger context, ambient memory) that overlaps with agent context but is not the same thing. Often complementary.
SubordinateA technique the marker contract can use as an implementation detail rather than compete with. Calling it a competitor confuses categories.

Scry’s answer is a marker contract with three marker types. @scry.entry declares an artifact and carries the curated fields (id, summary, tags, applies, seeded questions) an indexer pulls into a queryable cache. @scry.anchor turns any code comment or in-file point of interest into a named, queryable location — a primitive for making arbitrary in-file locations findable. @scry.bind is the typed edge — implements, depends_on, supersedes — that makes the graph bidirectional. The agent queries the cache by structured fields, not by similarity. Authoring is deliberate; every marker is part of the file under version control.

01RAG and vector databases

Subordinate

Justification:RAG returns nearest-neighbour chunks by embedding similarity; the marker contract can use embeddings as one ranking signal among structured fields. RAG-as-implementation is fine; RAG-as-substitute confuses retrieval with curation.

What it is

Retrieval-Augmented Generation: chunk the corpus, embed each chunk with a model (OpenAI text-embedding-3, Voyage, BGE, etc.), store vectors in an index, and at query time embed the query and return the nearest-neighbour chunks as additional context to the LLM.

Named systems

FAISS (Meta, library), pgvector (Postgres extension), Pinecone, Weaviate, Chroma, Qdrant, Milvus, LanceDB, ElasticSearch’s dense_vector, OpenAI Assistants File Search, LlamaIndex and LangChain retriever abstractions sitting atop any of the above.

How it approaches the problem

Similarity in embedding space is treated as a proxy for relevance. The corpus is not curated; the index is the curator, by way of the embedding model’s learned geometry.

Limitations relative to the marker contract

Similarity is not relevance. Two chunks that sound alike on the same topic may be the canonical design and a deprecated draft, or a spec and an angry comment about the spec. The similarity ranking does not know which is authoritative.
The corpus has no spine. Nothing in the index represents “this is the current truth on X.” That assertion lives outside the system, in the authors’ heads, or not at all.
Embedding-model coupling. Switch models, re-embed everything; embeddings from different models are not comparable. A marker cache is just text + SQLite.
Top-k bias. Retrieval returns k chunks regardless of whether the right answer is the union of all of them or none of them. A marker query can return zero rows truthfully.

Why subordinate, not peer

Production RAG against a frozen index is deterministic — the same query returns the same chunks. The honest weakness is not raw non-determinism but the opacity of the ranking and the silent sensitivity to embedding-model upgrades: change the embedding model and the same query returns different chunks without anything in the system saying so. A marker contract can use embeddings as one ranking signal among structured fields. Scry’s own scry__file_fts is full-text; a future tier could embed marker summaries and add nearest-neighbour lookup. RAG is a mechanism the marker contract can absorb, not an alternative shape of the same answer.

02Code intelligence

Adjacent

Justification:Code intel names what exists and where; the marker contract names what something is and why. Different layers. They compose.

What it is

Tooling that indexes a codebase by symbol — what functions exist, where they’re defined, where they’re called, what types they have — so an editor or search UI can jump to the right place.

Named systems

ctags / Universal Ctags. Text-file index of symbol → file:line. Decades old; still the floor.
LSP — Language Server Protocol. Per-language servers (rust-analyzer, gopls, pyright, typescript-language-server) expose workspaceSymbol, definition, references, hover.
Sourcegraph. Code search across many repos with structural queries, code navigation, and an LSIF/SCIP-based cross-repo index.
Glean (Meta, open source). Schema-driven code knowledge base; used internally at Meta for cross-language code Q&A.
OpenGrok / Hound / livegrep. Repository-scale grep-with-an-index for fast text search.
tree-sitter. Incremental parser producing concrete syntax trees; the substrate beneath modern code-intel.
Stack Graphs (GitHub). Scope-graph-based name resolution designed for cross-repo navigation.

How it approaches the problem

“What exists” is answered at the level of program structure — symbols, definitions, types, references — not the level of intent, decision, or lesson.

Limitations relative to the marker contract

Code only. Designs, lessons, and decisions sit in markdown, YAML, JSONL, or someone’s head. Code intel doesn’t index them.
Mechanical, not intentional. Code intel will happily index a deprecated function alongside the live one. There’s no field that says “this is the authoritative implementation of FR3.”
The ‘why’ is invisible. Knowing a function exists and where it’s called does not tell an agent why it exists, what tradeoff it embodies, or which lesson it implements.

Why adjacent, not peer

An agent often needs both. A marker can bind to a symbol that LSP resolves, and code intel can be the substrate a marker-aware agent uses to follow implements edges into the implementation.

03Knowledge graphs

Partial Peer

Justification:The closest paradigm match to scry. Scry is, viewed sideways, a knowledge graph whose nodes are in-file YAML markers and whose edges are implements / depends_on / supersedes references. The difference is location: scry’s graph lives in the source, not in a sidecar store.

What it is

Represent the corpus as nodes (entities, files, concepts) and typed edges (depends-on, implements, supersedes) in a graph database; query with a graph language.

Named systems

Neo4j / TigerGraph / ArangoDB. General-purpose graph DBs.
Microsoft GraphRAG. LLM-extracted entity-and-relationship graph layered atop RAG, with community-summarization for global queries; explicitly addresses the “RAG cannot answer global questions” failure mode.
Cognee. Open-source memory engine that builds a knowledge graph and a vector index from ingested text.
Code Property Graphs (CPG). From Yamaguchi et al. and productized as Joern / ShiftLeft; unify AST + CFG + DDG in one graph for code analysis.
Semantic-web stack (RDF / SPARQL / OWL). The original knowledge-graph substrate. Heavyweight; rarely the right shape for an agent index but worth naming for completeness.

How it approaches the problem

Curated or extracted typed relationships between things. Querying becomes graph traversal (“which lessons are bound to specs that this PR’s files implement?”) rather than similarity.

Limitations relative to the marker contract

Authoring overhead. A separate graph store needs a separate authoring discipline. Markers live in the file they describe; graph nodes live in the database. Drift between source and graph is a constant maintenance tax.
Query-time vs write-time materialization. GraphRAG and Cognee infer the graph at query time from the corpus, so every read re-invokes the LLM’s hallucination surface and edges can drift run to run. Scry’s edges (implements / depends_on) are materialized at write time as committed in-file markers. An agent typically authors the marker — the hallucination surface is real — but it is paid once, on a reviewable, correctable, supersede-able artifact, and the graph is stable across reads thereafter.
No artifact-level mark. A node in Neo4j is not visible from the file. An agent reading the file does not know the node exists unless it queries.

Honest comparison point

Joern’s CPG and GraphRAG are genuinely doing similar work for different problems. GraphRAG is the most direct prior art for “structured graph above raw retrieval, queryable by an agent.” Scry’s bet against it is the location bet — in-source vs sidecar — not a claim that structured curation is novel.

04Agent-memory frameworks

Peer

Justification:The projects most likely to be named in the same sentence as scry. The line worth drawing precisely: agent-memory frameworks are the right tool for memory of interaction; the marker contract is the right tool for memory of artifact.

What it is

Libraries and services that give an LLM agent durable, structured memory across sessions — facts, preferences, prior decisions — separate from the model’s context window.

Named systems

MemGPT → Letta. Originally MemGPT (Berkeley, 2023); now Letta. Models a memory hierarchy (core / archival / recall) and exposes tools the agent calls to read and write each tier. “Memory blocks” are first-class typed objects; the agent edits its own block contents as it learns.
mem0. Open-source memory layer. Extracts facts from conversation turns, classifies them, stores in a vector+graph hybrid, retrieves by relevance plus recency. Used by Cursor, Lindy, and others for cross-session user memory.
Zep. Commercial agent memory service. Builds a temporal knowledge graph from conversation history; supports point-in-time queries (“what did the user believe two weeks ago”). Strong on user-fact memory for chat agents.
LangGraph / LangChain memory. Framework-provided memory abstractions: ConversationBufferMemory, ConversationSummaryMemory, persisted via Redis / SQLite / Postgres. Lighter-weight than Letta; bring-your-own discipline.
CrewAI / AutoGen memory. Multi-agent frameworks with per-agent and shared memory primitives; mostly thin wrappers around the above.
Cognee (memory mode). Same project as the graph entry above; explicitly positioned as agent memory.
OpenAI Assistants Memory and ChatGPT memory. Closed, managed; the consumer-grade reference point.

How they approach the problem

Treat memory as the agent’s self-managed substrate. The agent reads and writes via tools. Storage is typically a vector store, sometimes a graph, sometimes a JSON document the framework hands back on each turn.

Limitations relative to the marker contract

Memory is about the conversation, not the artifact. MemGPT, mem0, and Zep are at their best storing “the user is on a Mac” or “we decided last week to use Postgres.” They are not where you store “this spec FR3 is implemented by this function.” That binding lives in the codebase.
Storage outside the source. A Zep graph or a Letta block is not in the repo. Cloning the project does not clone the memory. A new contributor reads the code without the memory; an agent on a fresh machine starts blind.
Inferred at consolidation vs committed at write. mem0 and Zep extract facts via an LLM during background consolidation passes; the same conversation can yield a different fact graph run to run. A scry marker is also typically agent-authored, so the LLM noise floor is real — but it is paid once into a committed in-file artifact, reviewable in the same diff as the code it describes, correctable by ordinary edits, and supersede-able by a later marker. The hallucination surface is materialized once and frozen, not re-inferred on every read.
Schema drift. Most agent-memory frameworks evolve schema as the product evolves. Scry markers are versioned with the spec and stable under the same git discipline as the source.

Where mem0 and scry overlap most directly

Project-scoped fact recall. mem0’s positioning increasingly extends past chat into “memory for coding agents.” That is the line scry sits on. The distinguishing question becomes: does the memory live in the source tree (scry), or in a sidecar service (mem0)? A mature agent could use both, for different things.

05Long-context approaches

Adjacent

Justification:Long context is the substrate inside which retrieval and curation operate. Scry decides what to include; the context window decides what fits. They compose.

What it is

Skip retrieval; put everything in the prompt. Rely on model context windows large enough to hold the relevant slice of the codebase or document set.

Named systems

Gemini 1.5 / 2.0 / 2.5. 1M – 2M token windows; Google’s long-context flagship.
Claude 3.7 / 4 family. 200k standard, 1M tier on enterprise for Sonnet/Opus.
GPT-4.1 / o1 / o3. 128k – 1M depending on tier.
Architectural advances feeding this. FlashAttention, Ring Attention, sliding-window attention, attention sinks (StreamingLLM), state-space models (Mamba), grouped-query attention.

How it approaches the problem

Curation is the model’s job at inference time. Stuff the window; let attention sort it out.

Limitations relative to the marker contract

“Lost in the Middle” (Liu et al., 2023). Recall is uneven across long contexts; middle-of-window content is systematically underweighted. The effect is real and reproducible.
Cost scales with tokens. A 1M-token call is cents-to-dollars per invocation. A scry query is microseconds and free.
Latency. Long-context calls are slow even when affordable.
It is not memory. Each call rebuilds context from scratch. Nothing learned in one call persists into the next.

06Hand-maintained memory files

Partial Peer

Justification:The marker contract is this pattern pulled apart into one marker per artifact, with a schema. Many CLAUDE.md files are scry markers waiting to be extracted.

What it is

Markdown files at known paths that agents are instructed to read at session start. The simplest possible solution; massively deployed.

Named systems

CLAUDE.md (Anthropic Claude Code convention).
AGENTS.md (emerging cross-tool convention; Cursor, Cline, others).
.cursorrules / Cursor rules files.
Cline rules / .clinerules.
GitHub Copilot custom instructions.
Aider’s CONVENTIONS.md.
Any project’s README, ARCHITECTURE.md, DESIGN.md read by the agent on demand.

How it approaches the problem

Pre-load fixed context. Curation is manual; format is freeform prose. Newer conventions support selective imports and hierarchical loading. This is a genuinely viable selection function, albeit a manually curated one.

Limitations relative to the marker contract

Doesn’t scale past one file. Once the memory file grows past a few thousand tokens, agents stop reading it carefully — or they read it on every turn and burn context for nothing.
No retrieval surface beyond imports. Selective imports help, but inside any included file there is no query — only “the whole file or nothing.”
No graph. Cannot express “this lesson supersedes that lesson” except in prose, which the agent must re-parse each time.
Encourages drift. The file is everywhere edited and nowhere reviewed against the code it claims to describe.

07Grep / ripgrep / full-text search

Adjacent

Justification:Grep is the universal floor; the marker contract is the curated ceiling. Scry’s own scry_grep is body FTS underneath, with marker FTS above. They compose.

What it is

Search the source tree by literal or regex pattern. Fast; ubiquitous; the floor every agent already has.

Named systems

GNU grep, ripgrep (rg), ag (the silver searcher), ack, OpenGrok, livegrep, Sourcegraph’s text search, SQLite FTS5, ElasticSearch BM25.

How it approaches the problem

No curation; the corpus is the index. Find by token, not by meaning.

Limitations relative to the marker contract

No ranking by authority. A grep for auth returns every file that contains the word, equally.
No structured fields. Cannot ask “designs with weight ≥ 0.7 in scope:auth.”
No relationships. Cannot follow implements from spec to code.
Misses synonyms. A search for JWT will not find a doc that only says “bearer token.”

08Task graphs for coding agents

Adjacent

Justification:Same audience (long-horizon coding agents), same marketing language (“persistent, structured memory for coding agents”) — different layer. A task graph tracks what needs doing; the marker contract indexes what already exists. They compose; they do not substitute.

What it is

A dependency-aware issue tracker built for AI agents rather than humans. Tasks are nodes in a graph with typed edges (blocks, relates_to, duplicates, supersedes); ready-task detection falls out of the graph. Storage is a versioned SQL database so multiple agents on multiple branches can write concurrently without merge collisions.

Named systems

Beads (bd). Dolt-backed (the version-controlled SQL database) in embedded or server mode; hash-based task IDs (bd-a3f8.1.1) sidestep the merge-collision problem that hits sequential IDs in multi-writer workflows. Targets Claude Code, Copilot CLI, Factory.ai and similar agents.

How it approaches the problem

Treat agent context as the set of pending tasks and their dependencies. The agent queries the graph for the next ready task, marks progress, threads messages against tasks, and lets closed tasks compact away as “memory decay.” State the agent maintains about its own work is durable and queryable across sessions.

Limitations relative to the marker contract

Different object. Beads tracks tasks; scry indexes artifacts. A beads node is “add JWT auth, blocked on user-model refactor.” A scry node is “the JWT-auth design doc, summary X, tags Y, implemented by file Z.” Neither answers the other’s question.
Sidecar storage. Tasks live in .beads/embeddeddolt/ (or an external Dolt server), not in the source files. Cloning the project clones the directory; understanding why the code looks the way it does still requires the artifact layer.
No artifact-level mark. A file does not know it is referenced by a beads task. Discovery is via the task index, not the source.

Why adjacent, not peer

The honest read is that beads and scry are complementary tools that share an audience. A mature coding-agent setup could plausibly run both: beads as the work queue, scry as the artifact index. The categories overlap in marketing copy and diverge in mechanics.

One convergence worth noting: beads and scry independently arrived at the same answer for stable identity under concurrent multi-agent writes. Beads mints hash-based task IDs (bd-a3f8); scry mints collision-checked hash-suffixed marker IDs (design.auth-flow~abcd1234) via scry_mint. Two tools built independently in the agent-tooling space, converging on the same ID design — observation, not endorsement, but it is a signal the shape of the problem rewards the shape of the answer.

Where each side wins

Where the alternatives are genuinely strong

Long context raises the floor for how much an agent can hold at once, and that is real progress. It does not remove the curation problem. A project that fits in 200k tokens is not a project with a 200k-token working budget — the conversation itself consumes tokens, every tool call returns tokens, and loading a 25% slice of the project (~50k) before the conversation even begins is routine. Context compaction then discards that slice and the agent must reload. Worse, an agent without a curated index falls back to grepping the tree, crawling directories, and reading entire files to find what it needs — spending tokens on prose it does not need to win the bytes it does. Scry’s advantage holds even with a 1M-token window: each marker carries summary / rationale / applies / weight, and the agent uses those fields to decide which files to read and which to skip — never reading whole files blind. Long context makes scry cheaper to use, not unnecessary.
Agent-memory frameworks (Letta, mem0, Zep) are doing excellent work on the interaction-memory problem and are legitimately winning that category. Their drift into artifact-memory is real and worth tracking.
GraphRAG is the most serious prior art for structured curation above raw retrieval. Scry’s bet against it is the location bet — in-source vs sidecar — not a claim that structured curation is novel.

Where the marker contract has the clearest edge

In-source authorship. No drift between code and metadata because they ship together. Because the markers live in the source files, they are version-controlled like the code they annotate — a marker change shows up in a pull request diff and stays in git history, so it can be reviewed, blamed, and reverted like any other change.
Per-artifact granularity. Not “one memory file per repo” and not “one node per extracted entity” — one marker per artifact, written by the artifact’s author.
Composability with the rest of the stack. Markers are not hostile to RAG, KG, LSP, or long context. They are the curation layer those mechanisms can operate against.

Scry’s own tradeoffs

Stated honestly, because a comparison page that hides them would not be the page it claims to be. The marker contract has three marker types — @scry.entry declares an artifact, @scry.anchor names a queryable point inside a file, and @scry.bind is the typed edge that makes the graph bidirectional — and each carries its own slice of the cost:

Marker discipline is the cost. A well-authored marker — summary, tags, weight — is what lets a file win the curated tier: the precise, ranked surface a future query lands on. An unmarked file, or a file whose marker fields have drifted out of sync with its contents, forfeits that. It loses the curated ranking signal; the marker no longer speaks for the file, so the file no longer competes where it should. This degradation is silent — it surfaces only to the agent that later queries the system and gets a worse answer than the corpus could have given.

What that doesn’t mean is that the file vanishes. Whether an unmarked file remains findable at all is an implementation choice, not a guarantee of scry-spec. An implementation that does full document indexing — indexing each file’s whole body, not only its marker — keeps the unmarked file present and full-text searchable; it simply competes on raw body text instead of on curated summary, tags, and weight. The reference scry implementation does exactly this: scry_grep runs full-text search over every indexed file body (scry__file_fts), so an unmarked file is still reachable, just uncurated. An implementation that indexed markers alone would not offer even that floor. The spec defines the curated tier; the fallback is something an implementation may choose to provide.
Upfront authoring work. The marker contract trades authoring effort at write time for cheap, deterministic recall at read time. Entries are the bulk of the work; anchors turn any code comment or in-file point of interest into a named, queryable location a scry query can reach — a primitive for making arbitrary in-file locations findable. Worth it when the same knowledge is recalled across many sessions; not worth it for one-off or fast-changing state.
Interaction memory is a kind convention, not a built-in. Scry is not limited to artifact memory. With a kind convention — lessons, journal, memory, notes, preferences, conversations, internal, or whatever taxonomy fits the project — marked files of those kinds are indexed alongside designs and specs, and the agent queries them the same way. Letta, mem0, and Zep ship a managed interaction-memory product out of the box; scry covers the same ground if the project adopts a convention for it. A mature agent can use both, for different reasons.
SQL is scry’s answer, not the spec’s mandate. A different implementation could index the same marker corpus into embeddings and serve recall by similarity; RAG, in that sense, is a possible implementation of the marker contract rather than a peer alternative to it.

Classification key

01RAG and vector databases

What it is

Named systems

How it approaches the problem

Limitations relative to the marker contract

Why subordinate, not peer

02Code intelligence

What it is

Named systems

How it approaches the problem

Limitations relative to the marker contract

Why adjacent, not peer

03Knowledge graphs

What it is

Named systems

How it approaches the problem

Limitations relative to the marker contract

Honest comparison point

04Agent-memory frameworks

What it is

Named systems

How they approach the problem

Limitations relative to the marker contract

Where mem0 and scry overlap most directly

05Long-context approaches

What it is

Named systems

How it approaches the problem

Limitations relative to the marker contract

06Hand-maintained memory files

What it is

Named systems

How it approaches the problem

Limitations relative to the marker contract

07Grep / ripgrep / full-text search

What it is

Named systems

How it approaches the problem

Limitations relative to the marker contract

08Task graphs for coding agents

What it is

Named systems

How it approaches the problem

Limitations relative to the marker contract

Why adjacent, not peer

09Honourable mentions

Where each side wins

Where the alternatives are genuinely strong

Where the marker contract has the clearest edge

Scry’s own tradeoffs