What is a Recall capsule
A capsule is a memory asset you intentionally keep — a decision, thought, or key answer pulled out of an AI conversation worth saving long-term. It's distinct from a raw conversation:
Raw conversation Full message stream auto-logged from Playground / Quorum | Recall capsule Curated excerpts you save via 💾 hover / MCP conductor_ask mode=save_memory |
Retention Free forever (privacy policy commitment) | Retention Capacity tiered — Free 5MB / Pro 50MB / Max 300MB |
Searchability Only by conversation list | Searchability Global literal + pgvector semantic search |
Cross-surface recall No (each conversation isolated) | Cross-surface recall One-click pull from Chat / Agent / Quorum as context |
Delete impact Deleting raw conversation does not affect capsules (soft link) | Delete impact Deleting capsule does not affect raw conversation |
Design philosophy: raw conversation is the process; capsule is the crystal. Six months later you don't want to find "what we discussed on March 14"; you want "why we picked Clerk over rolling our own auth" — that's the capsule.
Four ways to save a capsule
Recall provides four entry points, covering manual / automatic / IDE flows:
Chat hover 💾
Hover any assistant message in Playground → 💾 icon appears in the toolbar → right sheet slides in → AI auto-generates title (Haiku 4.5), you tweak content + add tags → save. First save triggers a celebration modal; subsequent saves show a silent toast (with count + jump-to-Recall link).
Quorum hover 💾
Same hover → 💾 → sheet flow on either pane of a Quorum session. Quorum capsules get kind=decision and an auto-applied quorum tag.
MCP conductor_ask mode=save_memory (Claude Desktop / Cursor / Agent)
MCP-connected clients (Claude Desktop / Cursor / your own agent) can call the conductor_ask unified tool (mode=save_memory) directly, no web UI needed. Agents save "conclusion" capsules after task completion; Claude Desktop users just say "save this".
+ New capsule (/dashboard/recall)
Top-right button on the Recall page → empty sheet → write title + content + tags from scratch → save. Suits non-conversation notes (reading excerpts, transient ideas).
Search & organization
The Recall page /dashboard/recall currently supports:
- Semantic search: Voyage-3 + pgvector cosine top-K — synonyms / cross-language / intent matching; auto-falls back to literal search during transient upstream outages (seamless to users)
- Literal search: ts_query full-text on title + content_md, 300ms debounce
- Tag filter: click any row tag chip to filter
- Auto Curator: auto-evaluates conversation segments worth keeping (rule scoring ~ms; gray-zone refinement via Haiku 4.5 on click)
- Chip recommendations: "Related memories" floating above chat input, semantically matched capsules
- Timeline view (Activity-band): density heatbar + top topic chips, toggle at top of Recall page
- Dedup hint: 7-day window cosine ≥ 0.85 → suggest merge before save
- Month grouping: auto-grouped by created_at YYYY-MM, collapsible
- Infinite scroll: 50 per page, IntersectionObserver auto-loads
Roadmap: Constellation (time × semantic 2D UMAP projection with auto-clustering + Haiku-generated cluster labels) — unlocks at ≥ 30 capsules. FIFO auto-overflow, weekly digest email, and cross-project project_id isolation arrive in later versions.
Pricing tier (locked at M.2)
Recall is the only capacity-tiered subscription at Nexevo (Chat / Agent / Quorum themselves are token / task pay-as-you-go, no monthly fee). Three tiers Free / Pro / Max:
| Tier | Price | Capacity | Approx |
|---|---|---|---|
| Free | $0 | 5 MB | ~600 capsules |
| Pro | $10/mo | 50 MB | ~6,000 capsules |
| Max | $40/mo | 300 MB | ~36,000 capsules |
Capacity counts title + content_md + 8KB metadata overhead. Quota exceeded returns HTTP 429 quota_exceeded with upgrade link; per-capsule hard cap is 1MB (infrastructure protection), exceeded returns 413 capsule_too_large.
Admins can override per-user quota at /x_console/users/{user_id}(enterprise POC / support churn fix); set NULL to revert to tier default.
See /pricing.
MCP usage (Claude Desktop / Cursor / Agent)
Recall is exposed via the Nexevo MCP server to any MCP-aware client (Claude Desktop / Cursor / your agent). The server has one unified toolconductor_ask; Recall operations are selected via themode param:
mode=save_memory— store a capsulemode=search_memory— semantic capsule recall
In Claude Desktop / Cursor, add the server config:
json{ "mcpServers": { "nexevo": { "command": "npx", "args": [ "-y", "mcp-remote", "https://api.nexevo.ai/v1/mcp", "--header", "Authorization: Bearer YOUR_NEXEVO_API_KEY" ] } } }
Once enabled, you can tell Claude:
"Save what we just discussed about Clerk; tag: auth, decision"Claude auto-invokes conductor_ask (mode=save_memory) → hits /v1/memory/capsules REST endpoint → stored. Recall is symmetric: mode=search_memory.
Full MCP setup + all modes + Cursor config + FAQ: see/docs/mcp →
REST API reference
All endpoints use Bearer JWT auth with per-user isolation (cross-user access returns 404 to prevent existence leakage).
| Method | Path | Description |
|---|---|---|
| POST | /v1/memory/capsules | Create capsule (429 on quota / 413 on 1MB cap) |
| GET | /v1/memory/capsules?q=&tag=&limit=&cursor= | List + literal search + tag filter + cursor pagination |
| GET | /v1/memory/capsules/{id} | Single capsule detail |
| PATCH | /v1/memory/capsules/{id} | Update title / content_md / tags (429 if resize exceeds quota) |
| DELETE | /v1/memory/capsules/{id} | Soft delete, hard cleanup after 30 days |
| POST | /v1/memory/capsules/delete-all | Bulk soft delete from settings, requires confirm="delete-all" in body |
bash# 创建胶囊 example curl https://api.nexevo.ai/v1/memory/capsules \ -H "Authorization: Bearer $NEXEVO_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "title": "Clerk 选型决策", "content_md": "决定用 Clerk 而非自建...", "kind": "decision", "tags": ["auth", "decision"] }'
FAQ
Is there a per-capsule size limit?
Yes: 1MB per capsule (~250k Chinese chars / ~1M ASCII), exceeded returns 413 capsule_too_large with friendly JSON. Pydantic limit on content_md is 800KB, leaving 200KB headroom for metadata / tags. Trim or split if exceeded.
What happens when I hit the tier quota?
POST/PATCH returns 429 quota_exceeded with upgrade link. Delete old capsules to free space, or upgrade tier. A later release adds FIFO cold storage (oldest unaccessed capsules move to cold tier when full; one-click recover within 30 days).
Can I recover a deleted capsule?
Soft-deleted capsules can be recovered within 30 days by emailing [email protected] (a self-service restore UI ships in a later release). After 30 days, cron hard-deletes; not recoverable.
Are capsules used to train models?
No. Recall capsules, like all user-generated content, follow zero-retention training defaults on paid tiers. See privacy policy for details.
How do I export my capsules?
Settings → Recall tab → Export data, downloads a full JSON archive (account info / capsules / API key metadata, GDPR/PIPL-compliant).
Get started with Recall
Go to Playground, chat with the AI, hover any message and click 💾 — that's it.
Open Playground