Two Kinds of Memory
Cognitive science distinguishes between: Semantic Memory - Facts and concepts- “The capital of France is Paris”
- “Water boils at 100°C”
- Timeless, context-free, declarative
- “I remember the meeting where we discussed the budget”
- “That call where the client mentioned timeline concerns”
- Time-stamped, contextual, experiential
Why Episodic Matters
Consider these queries:| Query | Memory Type | What’s Needed |
|---|---|---|
| ”What is our pricing model?” | Semantic | Retrieved from docs |
| ”What did the client say about pricing last Tuesday?” | Episodic | Retrieved from recordings |
| ”How many people attended the meeting?” | Episodic | Visual memory of the event |
| ”What was on screen when they mentioned the deadline?” | Episodic | Multimodal temporal context |
Video as Natural Episodic Memory
Video is inherently episodic:- Time-indexed - Every frame has a timestamp
- Multi-sensory - Visual + audio together
- Contextual - Shows the environment, not just content
- Continuous - Captures the flow of events
The Memory Problem
Raw recordings aren’t queryable. You can’t ask an MP4 file “what happened?” Traditional approaches:- Full transcription - Converts audio to text, loses visual context
- Frame extraction - Expensive, loses temporal flow
- Manual notes - Doesn’t scale, subjective
- Just store it - Recording exists but no one can find anything
Indexed Episodic Memory
The solution: indexes that understand what happened and when.- What happened (semantic content)
- When it happened (timestamps)
- Evidence (playable links)
Ephemeral vs Persistent
Not all perception needs permanent memory. Ephemeral - Process but don’t store- Real-time event detection
- Privacy-sensitive contexts
- Temporary sessions
- Meeting recordings
- Training content
- Compliance archives
Desktop as Continuous Input
Desktop capture creates continuous episodic input:Multi-Session Memory
Episodic memory spans sessions:Grounded Answers
Episodic memory enables grounded responses: Without episodic memory:“I believe the pricing discussion happened last week…”With episodic memory:
“At 14:32 in yesterday’s meeting, Sarah said ‘We need to revisit the enterprise tier pricing.’ Here’s the clip: [play]”The difference is trust. Episodic memory provides verifiable evidence.
The Future
The agents we’re building will:- Perceive continuously (screens, mics, cameras)
- Index what they perceive (spoken, visual, events)
- Remember across sessions (episodic recall)
- Answer with evidence (playable proof)