Long-term Context Lifecycle

Overview

Long-term memory in Synap is not static. Every memory has a lifecycle: it enters the system through ingestion, is processed and enriched through a multi-stage pipeline, is stored in both vector and graph datastores, serves retrieval queries over time, and eventually ages out based on retention policies. Understanding this lifecycle helps you design memory architectures that balance recall quality with storage efficiency. This page traces a memory from the moment it enters the system to its eventual eviction.

Lifecycle Stages

Ingestion — Raw Content Enters the System

Memories enter Synap through the SDK:

Runtime ingestion: Your application calls sdk.memories.create() to submit content as conversations happen. This is the most common path for conversational data.
Bootstrap ingestion: Use sdk.memories.batch_create() for bulk imports, historical backfills, and organizational context loads (see Bootstrap Ingestion).

At this stage, the content is raw — unprocessed text with user_id, customer_id, and an optional document_id.

# Ingest a memory via the SDK
await sdk.memories.create(
    document="The customer prefers email communication over phone calls.",
    document_type="ai-chat-conversation",
    user_id="user_abc",
    customer_id="cust_xyz",
    metadata={"source": "support_conversation"}
)

Processing — The Multi-Stage Pipeline

Once ingested, content enters the processing pipeline. This is where raw text is transformed into structured, queryable memories. The pipeline consists of several stages:

Categorization — Content is classified into categories (factual, preference, procedural, episodic, emotional, temporal) to determine how it should be processed and weighted.
Extraction — Key information is extracted based on category:
- Facts: concrete statements about the user or world (“prefers email”)
- Preferences: explicit or inferred preferences (“likes dark mode”)
- Episodes: event descriptions with participants and outcomes
- Emotions: sentiment and emotional context
- Temporal events: time-bound information (“meeting next Tuesday”)
Chunking — Long content is split into semantically coherent chunks for embedding. Each chunk maintains a reference to its source.
Entity Resolution — Named entities (people, companies, products) are identified, resolved against the entity registry, and linked. This stage creates graph connections between memories.
Organization — Processed memories are organized by scope, linked to relevant entities, and assigned confidence scores.

Each stage enriches the memory with metadata that improves retrieval quality downstream.

Storage — Dual-Store Persistence

Processed memories are persisted in two complementary stores:

Vector Store

Embeddings of memory chunks are stored for semantic similarity search. When a retrieval query arrives, the vector store finds memories that are semantically close to the query — even if they don’t share exact keywords.

Graph Store

Entity relationships and memory connections are stored in the graph. This enables relationship-based queries: “What do we know about this customer’s team?” traverses the graph to find connected memories.

Every memory is scoped to one of four levels:

Scope	Description	Example
USER	Specific to one end-user	”Alice prefers dark mode”
CUSTOMER	Shared across users in a customer org	”Acme Corp uses Slack for communication”
CLIENT	Organizational knowledge for your app	”Our refund policy requires manager approval”
WORLD	Universal knowledge	”Python 3.12 was released in October 2023”

Memories are stored with timestamps, confidence scores, and full provenance metadata linking back to the source content.

Active Retrieval — Serving Queries

When an agent needs context, it queries the retrieval system. The retrieval process:

Embeds the query
Searches the vector store for semantically similar memories
Traverses the graph for relationship-connected memories
Merges results across all applicable scopes (USER + CUSTOMER + CLIENT + WORLD)
Ranks by relevance, recency, and confidence
Returns the top results within the configured context budget

Retrieval activity feeds the aging mechanism — frequently surfaced memories stay relevant, while memories that are rarely surfaced gradually age out over time.

import uuid

# Retrieve context for the current query
context = await sdk.conversation.context.fetch(
    conversation_id=str(uuid.uuid4()),
    search_query=["What's our refund policy?"],
)

Aging — Memories Become Stale

Over time, memories that are rarely surfaced naturally lose prominence in retrieval results. Retrieval ranking weighs relevance and recency, so older, less-relevant memories are gradually deprioritized while more current information takes precedence. The specifics of how aging is applied are governed by the Memory Architecture Configuration.Aging is a natural process — it ensures that your context stays relevant without requiring manual cleanup. A memory about a customer’s preferred product version from two years ago will naturally give way to more recent preferences.

Retention — Policy-Based Lifecycle Management

Retention — how long memories are kept and what happens when they are no longer relevant — is governed by the Memory Architecture Configuration. Synap derives sensible retention behavior from your use-case file: compliance-sensitive agents lean toward longer retention, while consumer agents lean toward shorter retention.Retention behavior is configurable per Instance, so older, less-relevant memories may be aged out over time according to your configuration.

Eviction — End of the Memory Lifecycle

When a memory is no longer retained, it may be either archived or deleted, depending on your configuration:

Archived memories are moved to cold storage. They no longer appear in standard retrieval results but can be accessed through explicit archive queries. Archiving preserves the memory for compliance or audit purposes.
Deleted memories are permanently removed from both the vector store and graph store. Entity connections are cleaned up, and any orphaned entities are flagged for review.

Eviction runs as a background process.

What Happens at Each Stage

Confidence Scores

During extraction, each memory is assigned a confidence score (0.0 to 1.0) indicating how certain the system is about the extracted information. A direct statement like “I prefer email” gets a high confidence (0.9+), while an inferred preference based on behavior patterns may score lower (0.5-0.7). Confidence scores influence retrieval ranking.

Entity Linking

Entity resolution identifies named entities in the content and links them to canonical entries in the entity registry. If the user mentions “John from the DevOps team,” the system resolves “John” to a specific entity record and creates graph edges connecting the memory to that entity. This enables powerful relationship queries later.

Provenance Tracking

Every memory maintains provenance metadata: which conversation it came from, which user submitted it, what the original content was, and what transformations were applied. This chain of custody is essential for debugging, compliance, and trust.

Scope Assignment

Scope is determined by the presence of identity fields during ingestion:

user_id + customer_id present → USER scope
Only customer_id present → CUSTOMER scope
Neither present → CLIENT scope

Scope cannot be changed after storage — it is immutable.

How Entity Resolution Enriches Memories

Entity resolution is a critical enrichment step that transforms isolated memories into a connected knowledge graph. When a memory mentions “Sarah from engineering,” the entity resolution system:

Identifies the entity mention (“Sarah from engineering”)
Searches the entity registry for matches (semantic + exact matching)
Resolves to a canonical entity (e.g., entity_sarah_chen_engineering)
Links the memory to the entity in the graph store
Auto-registers new entities if no match is found (at CUSTOMER scope by default)

The result is that future queries about “Sarah” will surface all memories linked to her entity — even if they use different names (“Sarah Chen,” “S. Chen,” “the engineering lead”).

Entity resolution operates within scope boundaries. A USER-scope entity won’t match against a different user’s entities. The scope chain for resolution is: USER → CUSTOMER → CLIENT → WORLD (narrowest first).

Storage Lifecycle States

Every memory transitions through these states:

Active → Stale → Archived → Evicted
  │         │         │          │
  │         │         │          └── Permanently removed (delete policy)
  │         │         └── Cold storage, not in standard retrieval
  │         └── Still retrievable, but deprioritized in ranking
  └── Fully active, included in retrieval with full ranking weight

State	Retrievable	Ranking Weight	Storage
Active	Yes	Full	Hot (vector + graph)
Stale	Yes	Reduced (recency penalty)	Hot (vector + graph)
Archived	On-demand only	N/A	Cold storage
Evicted	No	N/A	Removed

Eviction with the delete policy is irreversible. If you need to retain memories for compliance or audit purposes, use the archive policy instead.

Next Steps

Conversational Context Lifecycle

How context works within a single conversation.

Org-context Lifecycle

How organizational knowledge enters and surfaces in retrieval.

Entity Resolution

Deep dive into how entities are identified, resolved, and linked.

Documentation Index

​Overview

​Lifecycle Stages

Vector Store

Graph Store

​What Happens at Each Stage

​How Entity Resolution Enriches Memories

​Storage Lifecycle States

​Next Steps

Conversational Context Lifecycle

Org-context Lifecycle

Entity Resolution

Overview

Lifecycle Stages

What Happens at Each Stage

How Entity Resolution Enriches Memories

Storage Lifecycle States

Next Steps