AIVectorMemory

AIVectorMemory

Give your AI coding assistant a memory — Cross-session persistent memory MCP Server

Still using CLAUDE.md / MEMORY.md as memory? This Markdown-file memory approach has fatal flaws: the file keeps growing, injecting everything into every session and burning massive tokens; content only supports keyword matching — search "database timeout" and you won't find "MySQL connection pool pitfall"; sharing one file across projects causes cross-contamination; there's no task tracking, so dev progress lives entirely in your head; not to mention the 200-line truncation, manual maintenance, and inability to deduplicate or merge.

AIVectorMemory is a fundamentally different approach. Local vector database storage with semantic search for precise recall (matches even when wording differs), on-demand retrieval that loads only relevant memories (token usage drops 50%+), automatic multi-project isolation with zero interference, and built-in issue tracking + task management that lets AI fully automate your dev workflow. All data is permanently stored on your machine — zero cloud dependency, never lost when switching sessions or IDEs.

✨ Core Features

Feature	Description
🧠 Cross-Session Memory	Your AI finally remembers your project — pitfalls, decisions, conventions all persist across sessions
🔍 Semantic Search	No need to recall exact wording — search "database timeout" and find "MySQL connection pool issue"
💰 Save 50%+ Tokens	Stop copy-pasting project context every conversation. Semantic retrieval on demand, no more bulk injection
🔗 Task-Driven Dev	Issue tracking → task breakdown → status sync → linked archival. AI manages the full dev workflow
📊 Desktop App + Web Dashboard	Native desktop app (macOS/Windows/Linux) + Web dashboard, visual management for memories and tasks, 3D vector network reveals knowledge connections at a glance
🏠 Fully Local	Zero cloud dependency. ONNX local inference, no API Key, data never leaves your machine
🔌 All IDEs	Cursor / Kiro / Claude Code / Windsurf / VSCode / OpenCode / Trae / Codex — one-click install, works out of the box
📁 Multi-Project Isolation	One DB for all projects, auto-isolated with zero interference, seamless project switching
🔄 Smart Dedup	Similarity > 0.95 auto-merges updates, keeping your memory store clean — never gets messy over time
🌐 7 Languages	简体中文 / 繁體中文 / English / Español / Deutsch / Français / 日本語, full-stack i18n for dashboard + Steering rules

QQ群：1085682431 | 微信：changhuibiz
共同参与项目开发加QQ群或微信交流

Login

Project Selection

Overview & Vector Network

🏗️ Architecture

┌─────────────────────────────────────────────────┐
│                   AI IDE                         │
│  OpenCode / Codex / Claude Code / Cursor / ...  │
└──────────────────────┬──────────────────────────┘
                       │ MCP Protocol (stdio)
┌──────────────────────▼──────────────────────────┐
│              AIVectorMemory Server               │
│                                                  │
│  ┌──────────┐ ┌──────────┐ ┌──────────────────┐ │
│  │ remember │ │  recall   │ │   auto_save      │ │
│  │ forget   │ │  task     │ │   status/track   │ │
│  └────┬─────┘ └────┬─────┘ └───────┬──────────┘ │
│       │            │               │             │
│  ┌────▼────────────▼───────────────▼──────────┐  │
│  │         Embedding Engine (ONNX)            │  │
│  │      intfloat/multilingual-e5-small        │  │
│  └────────────────────┬───────────────────────┘  │
│                       │                          │
│  ┌────────────────────▼───────────────────────┐  │
│  │     SQLite + sqlite-vec (Vector Index)     │  │
│  │     ~/.aivectormemory/memory.db            │  │
│  └────────────────────────────────────────────┘  │
└──────────────────────────────────────────────────┘

🚀 Quick Start

Option 1: pip install (Recommended)

# Install
pip install aivectormemory

# Upgrade to latest version
pip install --upgrade aivectormemory

# Navigate to your project directory, one-click IDE setup
cd /path/to/your/project
run install

run install interactively guides you to select your IDE, auto-generating MCP config, Steering rules, and Hooks — no manual setup needed.

macOS users note:
If you get externally-managed-environment error, add --break-system-packages
If you get enable_load_extension error, your Python doesn't support SQLite extension loading (macOS built-in Python and python.org installers don't support it). Use Homebrew Python instead:
brew install python
/opt/homebrew/bin/python3 -m pip install aivectormemory

Option 2: uvx (zero install)

No pip install needed, run directly:

cd /path/to/your/project
uvx aivectormemory install

Requires uv to be installed. uvx auto-downloads and runs the package — no manual installation needed.

Option 3: Manual configuration

{
  "mcpServers": {
    "aivectormemory": {
      "command": "run",
      "args": ["--project-dir", "/path/to/your/project"]
    }
  }
}

📍 IDE Configuration File Locations

IDE	Config Path
Kiro	`.kiro/settings/mcp.json`
Cursor	`.cursor/mcp.json`
Claude Code	`.mcp.json`
Windsurf	`.windsurf/mcp.json`
VSCode	`.vscode/mcp.json`
Trae	`.trae/mcp.json`
OpenCode	`opencode.json`
Codex	`.codex/config.toml`

For Codex, use project-scoped TOML instead of JSON:

[mcp_servers.aivectormemory]
command = "run"
args = ["--project-dir", "/path/to/your/project"]

Codex only loads project-scoped .codex/config.toml after the repository is marked as a trusted project.

🛠️ 8 MCP Tools

`remember` — Store a memory

content (string, required)   Memory content in Markdown format
tags    (string[], required)  Tags, e.g. ["pitfall", "python"]
scope   (string)              "project" (default) / "user" (cross-project)

Similarity > 0.95 auto-updates existing memory, no duplicates.

`recall` — Semantic search

query   (string)     Semantic search keywords
tags    (string[])   Exact tag filter
scope   (string)     "project" / "user" / "all"
top_k   (integer)    Number of results, default 5

Vector similarity matching — finds related memories even with different wording.

`forget` — Delete memories

memory_id  (string)     Single ID
memory_ids (string[])   Batch IDs

`status` — Session state

state (object, optional)   Omit to read, pass to update
  is_blocked, block_reason, current_task,
  next_step, progress[], recent_changes[], pending[]

Maintains work progress across sessions, auto-restores context in new sessions.

`track` — Issue tracking

action   (string)   "create" / "update" / "archive" / "list"
title    (string)   Issue title
issue_id (integer)  Issue ID
status   (string)   "pending" / "in_progress" / "completed"
content  (string)   Investigation content

`task` — Task management

action     (string, required)  "batch_create" / "update" / "list" / "delete" / "archive"
feature_id (string)            Linked feature identifier (required for list)
tasks      (array)             Task list (batch_create, supports subtasks)
task_id    (integer)           Task ID (update)
status     (string)            "pending" / "in_progress" / "completed" / "skipped"

Links to spec docs via feature_id. Update auto-syncs tasks.md checkboxes and linked issue status.

`readme` — README generation

action   (string)    "generate" (default) / "diff" (compare differences)
lang     (string)    Language: en / zh-TW / ja / de / fr / es
sections (string[])  Specify sections: header / tools / deps

Auto-generates README content from TOOL_DEFINITIONS / pyproject.toml, multi-language support.

`auto_save` — Auto save preferences

preferences  (string[])  User-expressed technical preferences (fixed scope=user, cross-project)
extra_tags   (string[])  Additional tags

Auto-extracts and stores user preferences at end of each conversation, smart dedup.

📊 Web Dashboard

run web --port 9080
run web --port 9080 --quiet          # Suppress request logs
run web --port 9080 --quiet --daemon  # Run in background (macOS/Linux)

Visit http://localhost:9080 in your browser. Default username admin, password admin123 (can be changed in settings after first login).

Multi-project switching, memory browse/search/edit/delete/export/import
Semantic search (vector similarity matching)
One-click project data deletion
Session status, issue tracking
Tag management (rename, merge, batch delete)
Token authentication protection
3D vector memory network visualization
🌐 Multi-language support (简体中文 / 繁體中文 / English / Español / Deutsch / Français / 日本語)

Scan to join WeChat group | Scan to join QQ group

⚡ Pairing with Steering Rules

AIVectorMemory is the storage layer. Use Steering rules to tell AI when and how to call these tools.

Running run install auto-generates Steering rules and Hooks config — no manual setup needed.

IDE	Steering Location	Hooks
Kiro	`.kiro/steering/aivectormemory.md`	`.kiro/hooks/*.hook`
Cursor	`.cursor/rules/aivectormemory.md`	`.cursor/hooks.json`
Claude Code	`CLAUDE.md` (appended)	`.claude/settings.json`
Windsurf	`.windsurf/rules/aivectormemory.md`	`.windsurf/hooks.json`
VSCode	`.github/copilot-instructions.md` (appended)	`.claude/settings.json`
Trae	`.trae/rules/aivectormemory.md`	—
OpenCode	`AGENTS.md` (appended)	`.opencode/plugins/*.js`
Codex	`AGENTS.md` (appended)	—

📋 Steering Rules Example (auto-generated)

# AIVectorMemory - Workflow Rules

## 1. New Session Startup (execute in order)

1. `recall` (tags: ["project-knowledge"], scope: "project", top_k: 100) load project knowledge
2. `recall` (tags: ["preference"], scope: "user", top_k: 20) load user preferences
3. `status` (no state param) read session state
4. Blocked → report and wait; Not blocked → enter processing flow

## 2. Message Processing Flow

- Step A: `status` read state, wait if blocked
- Step B: Classify message type (chat/correction/preference/code issue)
- Step C: `track create` record issue
- Step D: Investigate (`recall` pitfalls + read code + find root cause)
- Step E: Present plan to user, set blocked awaiting confirmation
- Step F: Modify code (`recall` pitfalls before changes)
- Step G: Run tests to verify
- Step H: Set blocked awaiting user verification
- Step I: User confirms → `track archive` + clear block

## 3. Blocking Rules

Must `status({ is_blocked: true })` when proposing plans or awaiting verification.
Only clear after explicit user confirmation. Never self-clear.

## 4-9. Issue Tracking / Code Checks / Spec Task Mgmt / Memory Quality / Tool Reference / Dev Standards

(Full rules auto-generated by `run install`)

🔗 Hooks Config Example (Kiro only, auto-generated)

Auto-save on session end removed. Dev workflow check (.kiro/hooks/dev-workflow-check.kiro.hook):

{
  "enabled": true,
  "name": "Dev Workflow Check",
  "version": "1",
  "when": { "type": "promptSubmit" },
  "then": {
    "type": "askAgent",
    "prompt": "Core principles: verify before acting, no blind testing, only mark done after tests pass"
  }
}

🇨🇳 Users in China

The embedding model (~200MB) is auto-downloaded on first run. If slow:

export HF_ENDPOINT=https://hf-mirror.com

Or add env to MCP config:

{
  "env": { "HF_ENDPOINT": "https://hf-mirror.com" }
}

📦 Tech Stack

Component	Technology
Runtime	Python >= 3.10
Vector DB	SQLite + sqlite-vec
Embedding	ONNX Runtime + intfloat/multilingual-e5-small
Tokenizer	HuggingFace Tokenizers
Protocol	Model Context Protocol (MCP)
Web	Native HTTPServer + Vanilla JS

📋 Changelog

v2.1.7

Fix: Playwright MCP Config — No Longer Force-Injected

🔧 Playwright MCP config is now opt-in during install (prompted only when npx is available, default: No)
🩹 install auto-cleans legacy Playwright configs written by older versions — fixes OpenCode "mcp.playwright: Invalid input" crash
🗑️ Removed auto_repair_playwright_config from server startup (unreachable when config validation fails)
➕ Added avmrun as a short CLI alias (avmrun install, avmrun web, etc.)

v2.1.6

Fix: CLI Entry Point Renamed

🔧 Renamed CLI entry point from run to aivectormemory — uvx aivectormemory now works directly without --from workaround
♻️ Updated argparse prog name and install runner config to match

v2.1.5

Fix: Playwright MCP Config Compatibility

🔧 Fixed mcp.playwright: Invalid input error on OpenCode after upgrade — _build_playwright_config was missing OpenCode format handling (missing type: local + array command)
♻️ Refactored _build_playwright_config to reuse _build_config format logic — eliminates duplicate if-else branches, automatically adapts to all IDE formats
🩹 Added auto_repair_playwright_config: MCP server auto-detects and fixes incorrect Playwright config on startup — seamless upgrade, no manual reinstall needed

v2.1.4

Fix: Superseded Memory Visibility

🔓 Removed hard filter that completely hid superseded memories from recall results — previously exclude_superseded=true (default) blocked memories before scoring, making them permanently invisible
📊 Superseded memories now ranked naturally via importance reduction (×0.3) + sqrt(importance) scoring — they appear lower in results instead of disappearing entirely
🧹 Removed _load_superseded_ids function and related dead code

v2.1.3

Fix: Scoring Engine Overhaul

🧮 Fixed critical bug: composite score now uses original vector similarity instead of RRF rank score — previously a ~0.8 similarity was replaced by ~0.015 RRF score, destroying semantic relevance signal
√ importance changed from direct multiplier to sqrt(importance) — reduces extreme penalty (0.15 → 0.387 instead of 0.15) while preserving supersede suppression
🛡️ Similarity floor: memories with similarity ≥ 0.85 get a guaranteed minimum score, preventing high-relevance memories from being buried by low importance
⚖️ Rebalanced weights: similarity 0.55 (was 0.5), recency 0.30, frequency 0.15 (was 0.2) — semantic relevance now dominates ranking
📉 FTS-only fallback reduced from 0.5 to 0.3 — pure keyword matches no longer get inflated similarity scores

v2.1.2

Fix: Memory Recall Accuracy

🔍 Fixed tiered search greedy cutoff: long_term results previously blocked short_term memories from being searched, causing highly relevant memories to be invisible
🔧 Both tiers now searched simultaneously, ranked by composite score (similarity × recency × frequency × importance)
🛡️ Fixed filters dict mutation bug in _search_tier — original filters no longer modified by reference

v2.1.1

Enhancement: AI Rule System Upgrade

📋 CLAUDE.md completion: added Identity & Tone (§1), 7 Core Principles (§3), message type judgment examples, expanded IDE safety and self-test sections
⚠️ Hook added Common Violations Reminder: ❌ negative examples reinforcing the 4 most frequently missed rules (self-test, recall, track create, IDE safety)
🌐 All 7 language rule files updated in sync (zh-CN/zh-TW/en/ja/es/de/fr)
🔢 CLAUDE.md sections renumbered to §1–§11, cross-references updated accordingly

v2.1.0

New: Smart Memory Engine + Uninstall

🧠 FTS5 full-text search with Chinese tokenization (jieba) — keyword search now actually works for CJK content
🔀 Hybrid retrieval: vector + FTS5 dual-path with RRF (Reciprocal Rank Fusion) merging
📊 Composite scoring: results ranked by similarity × 0.5 + recency × 0.3 + frequency × 0.2, weighted by importance
⚡ Conflict detection: similar memories (0.85–0.95) auto-superseded, old facts fade automatically
📦 Memory tiers: frequently accessed memories auto-promote to long_term and get searched first
🗑️ Auto-archive: stale short_term memories (90 days inactive + low importance) cleaned up automatically
🔗 Relation expansion: tag overlap ≥ 2 builds related links, 1-hop expansion surfaces connected memories
📝 Auto-summary: long memories (>500 chars) get summaries, brief mode returns summaries to save tokens
🧹 Code cleanup: removed 15 dead code items, refactored 7 duplicate patterns into shared utilities
❌ run uninstall — cleanly removes all IDE configurations (MCP, steering, hooks, permissions) while preserving memory data

v2.0.9

Enhancement: Security & Rule Optimization

🔒 Fixed SQL injection, command injection, and path traversal vulnerabilities
🛡️ Added transaction protection for data integrity (archive, insert, update operations)
🧠 Unified similarity formula across all search paths
📏 Compressed AI workflow rules by 38% (219→136 lines) with zero process removal
🧹 v12 migration cleans up legacy garbage memories automatically
🌐 All 7 languages synchronized

v2.0.8

New: Playwright Browser Testing Built-in

🎭 run install now automatically configures Playwright browser testing — AI can open a real browser to verify frontend changes instead of guessing
🎭 Uses a dedicated test browser (Chrome for Testing) that won't interfere with your personal browser tabs
🔑 Simplified permission setup — no more manual permission popups for common tools
📏 Updated AI rules across all 7 languages to enforce proper browser testing behavior

v2.0.7

Enhancement: More IDE Support

🖥️ Added support for Antigravity and GitHub Copilot IDEs
🔑 run install now auto-configures tool permissions, reducing manual setup
📏 Streamlined AI self-testing rules

v2.0.6

Enhancement: Faster Startup

⚡ Optimized memory loading on session start — loads faster with less context usage
🔑 Auto-configures Claude Code permissions during installation
🌐 All 7 languages synchronized

v2.0.5

Enhancement: Simpler Rules

📏 AI workflow rules restructured for clarity and reduced token usage
💾 AI now automatically saves your preferences at the end of each session
🌐 All 7 languages synchronized

v2.0.4

Fix: Tool Reliability

🔧 Comprehensive audit and fix of all MCP tool parameters — improved reliability across all IDEs

v2.0.3

Enhancement: Better Search & Safety

🔍 Memory search now combines semantic and keyword matching for more accurate recall
🛡️ Added cross-project protection — AI won't accidentally modify files in other projects

v2.0.2

Enhancement: Rule Generalization & Desktop Version Fix

📏 Added "recall before asking user" rule — AI must query memory system before asking user for project information (server address, passwords, deploy config, etc.)
📏 Generalized pre-operation check rule — removed specific examples to apply to all operation scenarios
🖥️ Fixed desktop app settings page showing hardcoded version "1.0.0" instead of actual app version
🌐 All 7 language i18n steering rules and workflow prompts synchronized

v2.0.1

Fix: Hook Cross-Project Compatibility

🔧 check_track.sh now derives project path from script location instead of $(pwd), fixing track detection failure when Claude Code runs hooks from non-root working directory
🔧 compact-recovery.sh now uses relative path derivation instead of hardcoded absolute paths, ensuring correct behavior when installed to any project
🔧 Removed redundant CLAUDE.md re-injection from compact-recovery (already auto-loaded by Claude Code)
🔧 install.py template synchronized with all hook fixes
🌐 All 7 language i18n compact-recovery hints updated

v2.0

Performance: ONNX INT8 Quantization

⚡ Embedding model auto-quantized from FP32 to INT8 on first load, model file from 448MB down to 113MB
⚡ MCP Server memory usage reduced from ~1.6GB to ~768MB (50%+ reduction)
⚡ Quantization is transparent to users — automatic on first use, cached for subsequent loads, falls back to FP32 on failure

New: Remember Password

🔐 Login page on both desktop and web dashboard now has a "Remember password" checkbox
🔐 When checked, credentials are saved to localStorage and auto-filled on next login; when unchecked, saved credentials are cleared
🔐 Checkbox is hidden in registration mode

Enhancement: Steering Rules

📝 IDENTITY & TONE section strengthened with more specific constraints (no pleasantries, no translating user messages, etc.)
📝 Self-testing requirements now distinguish between backend-only, MCP Server, and frontend-visible changes (Playwright required for frontend)
📝 Development rules now mandate self-testing after completing development
📝 All 7 language versions synchronized

v1.0.11

🐛 Desktop app version comparison switched to semantic versioning, fixing false upgrade prompts when local version is higher
🐛 Health check page field names aligned with backend, fixing consistency status always showing Mismatch
🔧 check_track.sh hook adds Python fallback, resolving silent hook failure when system sqlite3 is unavailable (#4)

v1.0.10

🖥️ Desktop app one-click install + upgrade detection
🖥️ Auto-detect Python and aivectormemory installation status on startup
🖥️ Show one-click install button when not installed, check PyPI and desktop new versions when installed
🐛 Installation detection switched to importlib.metadata.version() for accurate package version

v1.0.8

🔧 Fix PyPI package size anomaly (sdist from 32MB down to 230KB), excluded accidentally packaged dev files

v1.0.6

New: Native Desktop App

🖥️ Native desktop client supporting macOS (ARM64), Windows (x64), Linux (x64)
🖥️ Desktop app shares the same database as Web dashboard, fully feature-equivalent
🖥️ Dark/light theme switching, Glass frosted visual style
🖥️ Login auth, project selection, stats overview, memory management, issue tracking, task management, tag management, settings, data maintenance — full feature coverage
📦 Auto-published installers via GitHub Releases, download and use

New: CI/CD Auto Build

🔄 GitHub Actions auto-builds desktop installers for all 3 platforms
🔄 Push a tag to trigger the full compile, package, and release pipeline

Fixes

🐛 Windows platform compatibility fixes
🐛 sqlite-vec extension download URL fix

v1.0.5

Optimization: Token Usage Reduction

⚡ Steering rules changed from per-message dynamic injection to static loading, reducing repeated token consumption
⚡ Greatest impact for Claude Code users — ~2K fewer tokens per message

v1.0.4

New: Full-Stack i18n (7 Languages)

🌐 Web dashboard + desktop UI fully supports 7 languages: 简体中文 / 繁體中文 / English / Español / Deutsch / Français / 日本語
🌐 One-click language switch in settings page, takes effect immediately
🌐 MCP tool responses follow language setting, AI replies automatically use the corresponding language
🌐 Switching language auto-regenerates steering rules for all installed projects

New: Web Dashboard Settings Page

⚙️ Language switch, theme settings, system info display
⚙️ Database health check, repair, backup and other maintenance tools

v1.0.3

Optimization: Memory Search

🔍 recall search supports OR/AND tag matching modes, fixing missed results with multi-tag searches
🔍 Semantic search + tag filter defaults to OR matching (broader), tags-only browsing keeps AND matching (more precise)

📋 v0.2.x and earlier changelog

See CHANGELOG-archive.md

License

Apache-2.0

Name		Name	Last commit message	Last commit date
Latest commit History 102 Commits
.github		.github
aivectormemory		aivectormemory
desktop		desktop
docs		docs
scripts		scripts
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
light-memories-check.png		light-memories-check.png
pyproject.toml		pyproject.toml

Folders and files

Latest commit

History

Repository files navigation

AIVectorMemory

✨ Core Features

🏗️ Architecture

🚀 Quick Start

Option 1: pip install (Recommended)

Option 2: uvx (zero install)

Option 3: Manual configuration

🛠️ 8 MCP Tools

remember — Store a memory

recall — Semantic search

forget — Delete memories

status — Session state

track — Issue tracking

task — Task management

readme — README generation

auto_save — Auto save preferences

📊 Web Dashboard

⚡ Pairing with Steering Rules

🇨🇳 Users in China

📦 Tech Stack

📋 Changelog

v2.1.7

v2.1.6

v2.1.5

v2.1.4

v2.1.3

v2.1.2

v2.1.1

v2.1.0

v2.0.9

v2.0.8

v2.0.7

v2.0.6

v2.0.5

v2.0.4

v2.0.3

v2.0.2

v2.0.1

v2.0

v1.0.11

v1.0.10

v1.0.8

v1.0.6

v1.0.5

v1.0.4

v1.0.3

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 21

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`remember` — Store a memory

`recall` — Semantic search

`forget` — Delete memories

`status` — Session state

`track` — Issue tracking

`task` — Task management

`readme` — README generation

`auto_save` — Auto save preferences

Packages