Quickstart
What runs today: clone, install, capture an X tweet (and a few other sources), then recall them by meaning.
1. Install
git clone https://github.com/KhiipAI/khiip.git ~/projects/khiipcd ~/projects/khiippip install -e ".[dev]"2. Start the daemon
# First run downloads the bundled MiniLM-L6 ONNX model (~80MB, one time)khiipd serve &The daemon listens on 127.0.0.1:8478 by default and reads/writes
~/.config/khiip/, ~/.local/share/khiip/, and ~/khiip-vault/.
3. Capture across multiple sources
Different extractors, same vault:
khiipd capture https://x.com/karpathy/status/2040470801506541998 # X: Karpathy's "LLM knowledge base" idea-filekhiipd capture https://simonwillison.net/2025/Dec/31/the-year-in-llms/ # web: Simon Willison — the year in LLMskhiipd capture https://en.wikipedia.org/wiki/Quipu # wiki: the Inca knot-record system Khiip is named forkhiipd capture https://www.youtube.com/watch?v=-TAowrw97-4 # YouTube: Tiago Forte — the AI Second Brain4. Recall by meaning
khiipd recall "Andrej Karpathy's LLM knowledge wiki"khiipd recall "building a second brain with AI"khiipd recall "Inca knot record-keeping system"Recall ranks by cosine similarity over the bundled MiniLM-L6 embeddings — zero LLM cost, works offline after the first model fetch.
5. Inspect and maintain
# Check vault <-> SQLite consistency (read-only)khiipd validate
# Refetch a capture along independent dimensionskhiipd refetch 01JX9... # re-extract (creates a new superseding capture)khiipd refetch 01JX9... --media # re-walk the media-fetcher registry in placekhiipd refetch 01JX9... --wayback # re-submit to the Wayback Machine in placeWhat you end up with
- A SQLite index at
~/.local/share/khiip/index.db - Markdown captures with YAML frontmatter under
~/khiip-vault/captures/<source>/ - Raw Source-tier bytes preserved under your configured
data_root - Semantic recall over everything you’ve captured
Next steps
- Reddit captures work credential-free out of the box (old.reddit HTML); an optional Reddit app upgrades comment depth + rate limit — see Installation → per-source credentials.
- Connect an agent over the MCP server or the REST API.