Nexevo.aiNexevo.ai
Back to docs

RecallLong-term AI memory layer

Recall is Nevo's underlying memory layer — distill key decisions, thoughts, and answers from AI conversations into capsules, recallable from any surface (Chat / Agent / Quorum). This doc covers: architecture, 4 ways to save, search & pricing, full MCP / REST API reference, FAQ.

01

What is a Recall capsule

A capsule is a memory asset you intentionally keep — a decision, thought, or key answer pulled out of an AI conversation worth saving long-term. It's distinct from a raw conversation:

Raw conversation
Full message stream auto-logged from Playground / Quorum
Recall capsule
Curated excerpts you save via 💾 hover / MCP conductor_ask mode=save_memory
Retention
Free forever (privacy policy commitment)
Retention
Capacity tiered — Free 5MB / Pro 50MB / Max 300MB
Searchability
Only by conversation list
Searchability
Global literal + pgvector semantic search
Cross-surface recall
No (each conversation isolated)
Cross-surface recall
One-click pull from Chat / Agent / Quorum as context
Delete impact
Deleting raw conversation does not affect capsules (soft link)
Delete impact
Deleting capsule does not affect raw conversation

Design philosophy: raw conversation is the process; capsule is the crystal. Six months later you don't want to find "what we discussed on March 14"; you want "why we picked Clerk over rolling our own auth" — that's the capsule.

02

Four ways to save a capsule

Recall provides four entry points, covering manual / automatic / IDE flows:

A

Chat hover 💾

Hover any assistant message in Playground → 💾 icon appears in the toolbar → right sheet slides in → AI auto-generates title (Haiku 4.5), you tweak content + add tags → save. First save triggers a celebration modal; subsequent saves show a silent toast (with count + jump-to-Recall link).

B

Quorum hover 💾

Same hover → 💾 → sheet flow on either pane of a Quorum session. Quorum capsules get kind=decision and an auto-applied quorum tag.

C

MCP conductor_ask mode=save_memory (Claude Desktop / Cursor / Agent)

MCP-connected clients (Claude Desktop / Cursor / your own agent) can call the conductor_ask unified tool (mode=save_memory) directly, no web UI needed. Agents save "conclusion" capsules after task completion; Claude Desktop users just say "save this".

D

+ New capsule (/dashboard/recall)

Top-right button on the Recall page → empty sheet → write title + content + tags from scratch → save. Suits non-conversation notes (reading excerpts, transient ideas).

📸 Screenshot: Chat message hover showing 💾 + sliding save sheet
03

Search & organization

The Recall page /dashboard/recall currently supports:

  • Semantic search: Voyage-3 + pgvector cosine top-K — synonyms / cross-language / intent matching; auto-falls back to literal search during transient upstream outages (seamless to users)
  • Literal search: ts_query full-text on title + content_md, 300ms debounce
  • Tag filter: click any row tag chip to filter
  • Auto Curator: auto-evaluates conversation segments worth keeping (rule scoring ~ms; gray-zone refinement via Haiku 4.5 on click)
  • Chip recommendations: "Related memories" floating above chat input, semantically matched capsules
  • Timeline view (Activity-band): density heatbar + top topic chips, toggle at top of Recall page
  • Dedup hint: 7-day window cosine ≥ 0.85 → suggest merge before save
  • Month grouping: auto-grouped by created_at YYYY-MM, collapsible
  • Infinite scroll: 50 per page, IntersectionObserver auto-loads

Roadmap: Constellation (time × semantic 2D UMAP projection with auto-clustering + Haiku-generated cluster labels) — unlocks at ≥ 30 capsules. FIFO auto-overflow, weekly digest email, and cross-project project_id isolation arrive in later versions.

📸 Screenshot: Recall list view with month grouping + tag chips
04

Pricing tier (locked at M.2)

Recall is the only capacity-tiered subscription at Nexevo (Chat / Agent / Quorum themselves are token / task pay-as-you-go, no monthly fee). Three tiers Free / Pro / Max:

TierPriceCapacityApprox
Free$05 MB~600 capsules
Pro$10/mo50 MB~6,000 capsules
Max$40/mo300 MB~36,000 capsules

Capacity counts title + content_md + 8KB metadata overhead. Quota exceeded returns HTTP 429 quota_exceeded with upgrade link; per-capsule hard cap is 1MB (infrastructure protection), exceeded returns 413 capsule_too_large.

Admins can override per-user quota at /x_console/users/{user_id}(enterprise POC / support churn fix); set NULL to revert to tier default.

See /pricing.

05

MCP usage (Claude Desktop / Cursor / Agent)

Recall is exposed via the Nexevo MCP server to any MCP-aware client (Claude Desktop / Cursor / your agent). The server has one unified toolconductor_ask; Recall operations are selected via themode param:

  • mode=save_memory — store a capsule
  • mode=search_memory — semantic capsule recall

In Claude Desktop / Cursor, add the server config:

json
{ "mcpServers": { "nexevo": { "command": "npx", "args": [ "-y", "mcp-remote", "https://api.nexevo.ai/v1/mcp", "--header", "Authorization: Bearer YOUR_NEXEVO_API_KEY" ] } } }

Once enabled, you can tell Claude:

"Save what we just discussed about Clerk; tag: auth, decision"

Claude auto-invokes conductor_ask (mode=save_memory) → hits /v1/memory/capsules REST endpoint → stored. Recall is symmetric: mode=search_memory.

Full MCP setup + all modes + Cursor config + FAQ: see/docs/mcp →

06

REST API reference

All endpoints use Bearer JWT auth with per-user isolation (cross-user access returns 404 to prevent existence leakage).

MethodPathDescription
POST/v1/memory/capsulesCreate capsule (429 on quota / 413 on 1MB cap)
GET/v1/memory/capsules?q=&tag=&limit=&cursor=List + literal search + tag filter + cursor pagination
GET/v1/memory/capsules/{id}Single capsule detail
PATCH/v1/memory/capsules/{id}Update title / content_md / tags (429 if resize exceeds quota)
DELETE/v1/memory/capsules/{id}Soft delete, hard cleanup after 30 days
POST/v1/memory/capsules/delete-allBulk soft delete from settings, requires confirm="delete-all" in body
bash
# 创建胶囊 example curl https://api.nexevo.ai/v1/memory/capsules \ -H "Authorization: Bearer $NEXEVO_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "title": "Clerk 选型决策", "content_md": "决定用 Clerk 而非自建...", "kind": "decision", "tags": ["auth", "decision"] }'
07

FAQ

Is there a per-capsule size limit?

Yes: 1MB per capsule (~250k Chinese chars / ~1M ASCII), exceeded returns 413 capsule_too_large with friendly JSON. Pydantic limit on content_md is 800KB, leaving 200KB headroom for metadata / tags. Trim or split if exceeded.

What happens when I hit the tier quota?

POST/PATCH returns 429 quota_exceeded with upgrade link. Delete old capsules to free space, or upgrade tier. A later release adds FIFO cold storage (oldest unaccessed capsules move to cold tier when full; one-click recover within 30 days).

Can I recover a deleted capsule?

Soft-deleted capsules can be recovered within 30 days by emailing [email protected] (a self-service restore UI ships in a later release). After 30 days, cron hard-deletes; not recoverable.

Are capsules used to train models?

No. Recall capsules, like all user-generated content, follow zero-retention training defaults on paid tiers. See privacy policy for details.

How do I export my capsules?

Settings → Recall tab → Export data, downloads a full JSON archive (account info / capsules / API key metadata, GDPR/PIPL-compliant).

Get started with Recall

Go to Playground, chat with the AI, hover any message and click 💾 — that's it.

Open Playground
Recall — Technical Docs | Nexevo | Nexevo.ai