Nexevo.ai

What is a Recall capsule

A capsule is a memory asset you intentionally keep — a decision, thought, or key answer pulled out of an AI conversation worth saving long-term. It's distinct from a raw conversation:

Raw conversation Full message stream auto-logged from Playground / Quorum	Recall capsule Curated excerpts you save via 💾 hover / MCP conductor_ask mode=save_memory
Retention Free forever (privacy policy commitment)	Retention Capacity tiered — Free 5MB / Pro 50MB / Max 300MB
Searchability Only by conversation list	Searchability Global literal + pgvector semantic search
Cross-surface recall No (each conversation isolated)	Cross-surface recall One-click pull from Chat / Agent / Quorum as context
Delete impact Deleting raw conversation does not affect capsules (soft link)	Delete impact Deleting capsule does not affect raw conversation

Design philosophy: raw conversation is the process; capsule is the crystal. Six months later you don't want to find "what we discussed on March 14"; you want "why we picked Clerk over rolling our own auth" — that's the capsule.

Four ways to save a capsule

Recall provides four entry points, covering manual / automatic / IDE flows:

Chat hover 💾

Hover any assistant message in Playground → 💾 icon appears in the toolbar → right sheet slides in → AI auto-generates title (Haiku 4.5), you tweak content + add tags → save. First save triggers a celebration modal; subsequent saves show a silent toast (with count + jump-to-Recall link).

Quorum hover 💾

Same hover → 💾 → sheet flow on either pane of a Quorum session. Quorum capsules get kind=decision and an auto-applied quorum tag.

MCP conductor_ask mode=save_memory (Claude Desktop / Cursor / Agent)

MCP-connected clients (Claude Desktop / Cursor / your own agent) can call the conductor_ask unified tool (mode=save_memory) directly, no web UI needed. Agents save "conclusion" capsules after task completion; Claude Desktop users just say "save this".

+ New capsule (/dashboard/recall)

Top-right button on the Recall page → empty sheet → write title + content + tags from scratch → save. Suits non-conversation notes (reading excerpts, transient ideas).

📸 Screenshot: Chat message hover showing 💾 + sliding save sheet

Search & organization

The Recall page /dashboard/recall currently supports:

Semantic search: Voyage-3 + pgvector cosine top-K — synonyms / cross-language / intent matching; auto-falls back to literal search during transient upstream outages (seamless to users)
Literal search: ts_query full-text on title + content_md, 300ms debounce
Tag filter: click any row tag chip to filter
Auto Curator: auto-evaluates conversation segments worth keeping (rule scoring ~ms; gray-zone refinement via Haiku 4.5 on click)
Chip recommendations: "Related memories" floating above chat input, semantically matched capsules
Timeline view (Activity-band): density heatbar + top topic chips, toggle at top of Recall page
Dedup hint: 7-day window cosine ≥ 0.85 → suggest merge before save
Month grouping: auto-grouped by created_at YYYY-MM, collapsible
Infinite scroll: 50 per page, IntersectionObserver auto-loads

Roadmap: Constellation (time × semantic 2D UMAP projection with auto-clustering + Haiku-generated cluster labels) — unlocks at ≥ 30 capsules. FIFO auto-overflow, weekly digest email, and cross-project project_id isolation arrive in later versions.

📸 Screenshot: Recall list view with month grouping + tag chips

Pricing tier (locked at M.2)

Recall is the only capacity-tiered subscription at Nexevo (Chat / Agent / Quorum themselves are token / task pay-as-you-go, no monthly fee). Three tiers Free / Pro / Max:

Tier	Price	Capacity	Approx
Free	$0	5 MB	~600 capsules
Pro	$10/mo	50 MB	~6,000 capsules
Max	$40/mo	300 MB	~36,000 capsules

Capacity counts title + content_md + 8KB metadata overhead. Quota exceeded returns HTTP 429 quota_exceeded with upgrade link; per-capsule hard cap is 1MB (infrastructure protection), exceeded returns 413 capsule_too_large.

Admins can override per-user quota at /x_console/users/{user_id}(enterprise POC / support churn fix); set NULL to revert to tier default.

See /pricing.

MCP usage (Claude Desktop / Cursor / Agent)

Recall is exposed via the Nexevo MCP server to any MCP-aware client (Claude Desktop / Cursor / your agent). The server has one unified toolconductor_ask; Recall operations are selected via themode param:

mode=save_memory — store a capsule
mode=search_memory — semantic capsule recall

In Claude Desktop / Cursor, add the server config:

json
{
  "mcpServers": {
    "nexevo": {
      "command": "npx",
      "args": [
        "-y", "mcp-remote",
        "https://api.nexevo.ai/v1/mcp",
        "--header", "Authorization: Bearer YOUR_NEXEVO_API_KEY"
      ]
    }
  }
}

Once enabled, you can tell Claude:

"Save what we just discussed about Clerk; tag: auth, decision"

Claude auto-invokes conductor_ask (mode=save_memory) → hits /v1/memory/capsules REST endpoint → stored. Recall is symmetric: mode=search_memory.

Full MCP setup + all modes + Cursor config + FAQ: see/docs/mcp →

REST API reference

All endpoints use Bearer JWT auth with per-user isolation (cross-user access returns 404 to prevent existence leakage).

Method	Path	Description
POST	/v1/memory/capsules	Create capsule (429 on quota / 413 on 1MB cap)
GET	/v1/memory/capsules?q=&tag=&limit=&cursor=	List + literal search + tag filter + cursor pagination
GET	/v1/memory/capsules/{id}	Single capsule detail
PATCH	/v1/memory/capsules/{id}	Update title / content_md / tags (429 if resize exceeds quota)
DELETE	/v1/memory/capsules/{id}	Soft delete, hard cleanup after 30 days
POST	/v1/memory/capsules/delete-all	Bulk soft delete from settings, requires confirm="delete-all" in body

bash
# 创建胶囊 example
curl https://api.nexevo.ai/v1/memory/capsules \
  -H "Authorization: Bearer $NEXEVO_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "title":       "Clerk 选型决策",
    "content_md":  "决定用 Clerk 而非自建...",
    "kind":        "decision",
    "tags":        ["auth", "decision"]
  }'

FAQ

Is there a per-capsule size limit?

Yes: 1MB per capsule (~250k Chinese chars / ~1M ASCII), exceeded returns 413 capsule_too_large with friendly JSON. Pydantic limit on content_md is 800KB, leaving 200KB headroom for metadata / tags. Trim or split if exceeded.

What happens when I hit the tier quota?

POST/PATCH returns 429 quota_exceeded with upgrade link. Delete old capsules to free space, or upgrade tier. A later release adds FIFO cold storage (oldest unaccessed capsules move to cold tier when full; one-click recover within 30 days).

Can I recover a deleted capsule?

Soft-deleted capsules can be recovered within 30 days by emailing [email protected] (a self-service restore UI ships in a later release). After 30 days, cron hard-deletes; not recoverable.

Are capsules used to train models?

No. Recall capsules, like all user-generated content, follow zero-retention training defaults on paid tiers. See privacy policy for details.

How do I export my capsules?

Settings → Recall tab → Export data, downloads a full JSON archive (account info / capsules / API key metadata, GDPR/PIPL-compliant).

Get started with Recall

Go to Playground, chat with the AI, hover any message and click 💾 — that's it.

Open Playground

RecallLong-term AI memory layer

Chat hover 💾

Quorum hover 💾

MCP conductor_ask mode=save_memory (Claude Desktop / Cursor / Agent)

+ New capsule (/dashboard/recall)

Get started with Recall