Give your AI agent persistent memory. One command. No account. Works offline.
Awareness Local is a local-first memory system for AI coding agents. It runs a lightweight daemon on your machine that stores memories as Markdown files, searches with hybrid FTS5 + embedding, and connects to any IDE via the MCP protocol.
npx @awareness-sdk/setupThat's it. Your AI agent now remembers everything across sessions.
Evaluated on LongMemEval — the industry standard benchmark for long-term conversational memory. 500 human-curated questions across 5 core capabilities.
╔══════════════════════════════════════════════════════════════╗
║ ║
║ Awareness Memory — LongMemEval Benchmark Results ║
║ ───────────────────────────────────────────────── ║
║ ║
║ Benchmark: LongMemEval (ICLR 2025) ║
║ Dataset: 500 human-curated questions ║
║ Variant: LongMemEval_S (~115k tokens per question) ║
║ ║
║ ┌─────────────────────────────────────────────────┐ ║
║ │ │ ║
║ │ Recall@1 77.6% (388 / 500) │ ║
║ │ Recall@3 91.8% (459 / 500) │ ║
║ │ Recall@5 95.6% (478 / 500) ◀ PRIMARY │ ║
║ │ Recall@10 97.4% (487 / 500) │ ║
║ │ │ ║
║ └─────────────────────────────────────────────────┘ ║
║ ║
║ Method: Hybrid RRF (BM25 + Semantic Vector Search) ║
║ Embedding: all-MiniLM-L6-v2 (384d) ║
║ LLM Calls: 0 (pure retrieval, no generation cost) ║
║ Hardware: Apple M1, 8GB RAM — 14 min total ║
║ ║
╚══════════════════════════════════════════════════════════════╝
┌─────────────────────────────────────────────────────────────┐
│ Long-Term Memory Retrieval — R@5 Leaderboard │
│ LongMemEval (ICLR 2025, 500 questions) │
├─────────────────────────────────┬───────────┬───────────────┤
│ System │ R@5 │ Note │
├─────────────────────────────────┼───────────┼───────────────┤
│ MemPalace (ChromaDB raw) │ 96.6% │ R@5 only * │
│ ★ Awareness Memory (Hybrid) │ 95.6% │ Hybrid RRF │
│ OMEGA │ 95.4% │ QA Accuracy │
│ Mastra (GPT-5-mini) │ 94.9% │ QA Accuracy │
│ Mastra (GPT-4o) │ 84.2% │ QA Accuracy │
│ Supermemory │ 81.6% │ QA Accuracy │
│ Zep / Graphiti │ 71.2% │ QA Accuracy │
│ GPT-4o (full context) │ 60.6% │ QA Accuracy │
├─────────────────────────────────┴───────────┴───────────────┤
│ * MemPalace 96.6% is Recall@5 only, not QA Accuracy. │
│ Palace hierarchy was NOT used in the evaluation. │
└─────────────────────────────────────────────────────────────┘
┌─────────────────────────────────────────────────────────────┐
│ Awareness Memory — R@5 by Question Type │
│ │
│ knowledge-update ████████████████████████████ 100% │
│ multi-session ███████████████████████████▋ 98.5%│
│ single-session-asst ███████████████████████████▌ 98.2%│
│ temporal-reasoning █████████████████████████▊ 94.7%│
│ single-session-user ████████████████████████▎ 88.6%│
│ single-session-pref ███████████████████████▏ 86.7%│
│ │
│ Overall █████████████████████████▉ 95.6%│
│ │
│ ┌───────────────────────────────────────────────┐ │
│ │ Ablation Study │ │
│ │ ───────────────────────────────────────── │ │
│ │ Vector-only: 92.6% ▓▓▓▓▓▓▓▓▓▓▓▓▓░░░ │ │
│ │ BM25-only: 91.4% ▓▓▓▓▓▓▓▓▓▓▓▓▓░░░ │ │
│ │ Hybrid RRF: 95.6% ▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓░ ★ │ │
│ │ Hybrid = +3% over any │ │
│ │ single method alone │ │
│ └───────────────────────────────────────────────┘ │
│ │
│ arxiv.org/abs/2410.10813 awareness.market │
└─────────────────────────────────────────────────────────────┘
Zero LLM calls. Reproducible benchmark scripts →
Before: Every session starts from scratch. You re-explain the codebase, re-justify decisions, watch the agent redo work.
After: Your agent says "I remember you were migrating from MySQL to PostgreSQL. Last session you completed the schema changes and had 2 TODOs remaining..."
Session 1 Session 2
┌─────────────────────────┐ ┌─────────────────────────┐
│ Agent: "What database?" │ │ Agent: "I remember we │
│ You: "PostgreSQL..." │ │ chose PostgreSQL for │
│ Agent: "What framework?"│ → │ JSON support. You had │
│ You: "FastAPI..." │ │ 2 TODOs left. Let me │
│ (repeat every session) │ │ continue from there." │
└─────────────────────────┘ └─────────────────────────┘
| IDE | Auto-detected | Plugin |
|---|---|---|
| Claude Code | ✅ | awareness-memory |
| Cursor | ✅ | via MCP |
| Windsurf | ✅ | via MCP |
| OpenClaw | ✅ | @awareness-sdk/openclaw-memory |
| Cline | ✅ | via MCP |
| GitHub Copilot | ✅ | via MCP |
| Codex CLI | ✅ | via MCP |
| Kiro | ✅ | via MCP |
| Trae | ✅ | via MCP |
| Zed | ✅ | via MCP |
| JetBrains (Junie) | ✅ | via MCP |
| Augment | ✅ | via MCP |
| AntiGravity (Jules) | ✅ | via MCP |
Your IDE / AI Agent
│
│ MCP Protocol (localhost:37800)
▼
┌────────────────────────────────────┐
│ Awareness Local Daemon │
│ │
│ Markdown files → Human-readable, git-friendly
│ SQLite FTS5 → Fast keyword search
│ Local embedding → Semantic search (optional: npm i @huggingface/transformers)
│ Knowledge cards → Auto-extracted decisions, solutions, risks
│ Web Dashboard → http://localhost:37800/
│ │
│ Cloud sync (optional) │
│ → One-click device-auth │
│ → Bidirectional sync │
│ → Semantic vector search │
│ → Team collaboration │
└────────────────────────────────────┘
All memories stored as Markdown files in .awareness/ — human-readable, editable, git-friendly:
.awareness/
├── memories/
│ ├── 2026-03-22_decided-to-use-postgresql.md
│ ├── 2026-03-22_fixed-auth-bug.md
│ └── ...
├── knowledge/
│ ├── decisions/postgresql-over-mysql.md
│ └── solutions/auth-token-refresh.md
├── tasks/
│ └── open/implement-rate-limiting.md
└── index.db (search index, auto-rebuilt)
| Tool | What it does |
|---|---|
awareness_init |
Load session context — recent knowledge, tasks, rules |
awareness_recall |
Search memories — progressive disclosure (summary → full) |
awareness_record |
Save decisions, code changes, insights — with knowledge extraction |
awareness_lookup |
Fast lookup — tasks, knowledge cards, session history, risks |
awareness_get_agent_prompt |
Get agent-specific prompts for multi-agent setups |
Instead of dumping everything into context, Awareness uses a two-phase recall:
Phase 1: awareness_recall(query, detail="summary")
→ Lightweight index (~80 tokens each): title + summary + score
→ Agent reviews and picks what's relevant
Phase 2: awareness_recall(detail="full", ids=[...])
→ Complete content for selected items only
→ No truncation, no wasted tokens
Visit http://localhost:37800/ to browse memories, knowledge cards, tasks, and manage cloud sync.
Connect to Awareness Cloud for:
- Semantic vector search (100+ languages)
- Cross-device real-time sync
- Team collaboration
- Memory marketplace
npx @awareness-sdk/setup --cloud
# Or click "Connect to Cloud" in the dashboardAwareness Local is part of the Awareness ecosystem:
| Package | For | Install |
|---|---|---|
| Awareness Local | Local daemon + MCP server | npx @awareness-sdk/setup |
| Python SDK | wrap_openai() / wrap_anthropic() interceptors |
pip install awareness-memory-cloud |
| TypeScript SDK | wrapOpenAI() / wrapAnthropic() interceptors |
npm i @awareness-sdk/memory-cloud |
| OpenClaw Plugin | Auto-recall + auto-capture | openclaw plugins install @awareness-sdk/openclaw-memory |
| Claude Code Plugin | Skills + hooks | /plugin marketplace add edwin-hao-ai/Awareness-SDK → /plugin install awareness-memory@awareness |
| Setup CLI | One-command setup for 13+ IDEs | npx @awareness-sdk/setup |
Full SDK docs: awareness.market/docs
- Node.js 18+
- Any MCP-compatible IDE
No Python, no Docker, no cloud account needed.
Apache 2.0