Karpathy LLM Wiki

approved

by Greener-Dalii

This plugin has not been manually reviewed by Obsidian staff. Karpathy's LLM Wiki implementation - multi-page knowledge generation with entity/concept pages and conversational query.

★ 13 stars↓ 944 downloadsUpdated 8d agoMIT

Install in Obsidian View on GitHub

llm_wiki_banner

🧠 Karpathy LLM Wiki Plugin for Obsidian

AI-powered structured knowledge base that ingests your notes and generates a connected Wiki — based on Andrej Karpathy's LLM Wiki concept.

Obsidian official score 94/100 | Native support for 8 languages | Actively maintained, continuously evolving

Version Build Status GitHub Stars

Official Site | Feedback & Discussion | 🤖 Explore Repo with DeepWiki

💡 What is LLM-Wiki?

You write. AI organizes. You ask. That's it.

🎯 The problem. Your notes are a goldmine — people, concepts, ideas, connections. But right now they're just files in folders. Finding what relates to what means searching, tagging, and hoping you remember the thread.

✨ The fix. Andrej Karpathy suggested something elegant: treat your notes as raw material, and let an LLM do the architect work. It reads what you write, pulls out entities and concepts, and weaves them into a structured Wiki — complete with [[bidirectional links]], an auto-generated index, and a chat interface that answers questions from your knowledge.

📚 So you don't have to be the librarian. No deciding what deserves a page. No maintaining cross-links. No wondering if something is out of date. Drop notes into sources/ and the LLM reads, extracts, writes, links, and even flags contradictions — while you stay in flow.

🤖 And it's not another chatbot. ChatGPT knows the internet. LLM-Wiki knows you — or rather, what you've taught it. Every answer carries [[wiki-links]] back into your knowledge graph. Every response is a trailhead, not a dead end.

⚡ Why Obsidian + LLM-Wiki?

Obsidian is brilliant at linked thinking. But there's a catch: you're the one doing all the linking.

LLM-Wiki flips that. Instead of you building the graph by hand, the AI grows it with you. Add a note about a new concept — it finds the connections you'd miss. Ask a question — it walks your own knowledge graph and brings back answers with citations.

🔗 Your Graph View comes alive. New notes don't just sit there — they sprout links to entities, concepts, and sources. The graph grows organically, and the plugin maintains it: detecting duplicates, fixing dead links, bridging languages with aliases.
💬 Your notes learn to talk back. Search becomes conversation. "What did I write about X?" becomes a dialogue, with streaming responses and [[wiki-links]] as breadcrumbs. Every answer is a path deeper into your own knowledge.
🧠 Obsidian becomes a thinking partner. It stops being a cabinet for notes and starts being something that helps you think — surfacing hidden connections, flagging contradictions, remembering what you forgot you knew.

🚀 Quick Start

📦 Installation

🌟 Recommended — Obsidian Community Plugin Market:

In Obsidian, go to Settings → Community plugins
Click Browse and search for "Karpathy LLM Wiki"
Click Install, then Enable

🌐 Or from the Community Plugin website — visit community.obsidian.md/plugins/karpathywiki and click Add to Obsidian to install directly.

⚙️ Manual (alternative):

Download main.js, manifest.json, styles.css from Releases
In Obsidian, go to Settings → Community plugins. On the Installed plugins tab, click the folder icon to open your plugins directory
Create a folder named karpathywiki, drop the three files inside
Back in Obsidian, click the refresh icon — Karpathy LLM Wiki will appear under Installed plugins
Toggle it on to enable

🔨 Development: git clone, pnpm install, pnpm build.

🔄 Updating

This project evolves rapidly — new features, bug fixes, and improvements are shipped frequently. We recommend keeping up to date:

Option A — Manual update (recommended):

Go to Settings → Community plugins
Click Check for updates
Find Karpathy LLM Wiki in the list and click Update

Option B — Enable auto-update:

Go to Settings → Community plugins
Toggle on Automatically check for plugins updates
New versions will be detected automatically; update manually at your convenience

💡 Why stay updated? Each release may include new features, performance improvements, and important bug fixes. We actively maintain this plugin — missing updates means missing out on a better experience.

🔑 Configure an LLM Provider

Open Settings → Karpathy LLM Wiki
Pick a provider from the dropdown (Anthropic, Anthropic Compatible, Google Gemini, OpenAI, DeepSeek, Kimi, GLM, Ollama, OpenRouter, or custom)
Enter your API key (not needed for Ollama)
Click Fetch Models to populate the model dropdown, or type a model name manually
Click Test Connection, then Save Settings

🦙 Ollama (local, no API key): Install Ollama, pull a model (ollama pull gemma4), select "Ollama (Local)" in the provider dropdown.

See README_CN.md for provider-specific instructions in Chinese.

🎮 Usage

Method	How
📥 Ingest single source	`Cmd+P` → "Ingest single source" — select a note to extract entities and concepts into Wiki pages
📂 Ingest from folder	`Cmd+P` → "Ingest from folder" — pick a folder, batch generate Wiki from all notes inside
🔍 Query wiki	`Cmd+P` → "Query wiki" — ask questions, get streaming answers with `[[wiki-links]]`
🛠️ Lint wiki	`Cmd+P` → "Lint wiki" — health scan: duplicates, dead links, empty pages, orphans, missing aliases
📋 Regenerate index	`Cmd+P` → "Regenerate index" — rebuild `wiki/index.md` with current pages and aliases
💡 Suggest schema updates	`Cmd+P` → "Suggest schema updates" — LLM analyzes Wiki and proposes schema improvements

Re-ingesting the same source does incremental updates on entity/concept pages (new info merged in). Summary pages are regenerated.

💡 Smart Batch Skip: When ingesting a folder, the plugin automatically detects already-processed files and skips them to save time and API costs. The batch report shows skipped count.

⚠️ Upgrading from an Older Version?

If you're upgrading from a version before v1.7.11 (or much earlier), your existing Wiki pages were generated without several capabilities added over many releases. Follow these steps after upgrading to bring your Wiki up to date:

1️⃣ Rebuild your index Cmd+P → "Regenerate index" — This rebuilds wiki/index.md with alias entries for every page, enabling alias-aware search (e.g., searching "DSA" finds "DeepSeek-Sparse-Attention"). The old index format only listed page titles.

2️⃣ Run Lint wiki Cmd+P → "Lint wiki" — This scans your entire Wiki and shows:

🏷️ Missing aliases: Pages without aliases (all pre-v1.7.11 pages). Click "Complete Aliases" — the LLM generates translations, acronyms, and alternate names in bulk. This is critical for duplicate detection.
🔄 Duplicate pages: Pages with overlapping content (e.g., "CoT" vs "思维链" created by older versions that didn't have alias-aware dedup). Click "Merge Duplicates" to fuse them and preserve all aliases.
💀 Dead links / Empty pages / Orphans: Standard wiki maintenance issues.

3️⃣ Use Smart Fix All Click "Smart Fix All" in the Lint report for a one-click, causality-ordered repair: aliases completed → duplicates merged → dead links fixed → orphans linked → empty pages expanded. This is the fastest way to clean up a wiki built across many versions.

4️⃣ Enable parallel page generation Settings → Ingestion Acceleration:

⚡ Page Generation Concurrency: Set to 3 for most providers (was 1/serial by default before v1.7.3). Speeds up ingestion 2–3× on sources with 10+ entities.
⏱️ Batch Delay: Start at 300ms. Increase to 500–800ms if you hit rate limits.

5️⃣ Review new settings (added since v1.4.0–v1.7.x):

🌐 Wiki Output Language (v1.6.5): Independent from UI language — your Wiki can be in Chinese while the plugin UI stays in English, or vice versa.
📊 Extraction Granularity (v1.6.2, expanded in v1.10.0): Five options control how deeply the LLM extracts entities from sources:
- Fine (~100 items) — Deep analysis, edge-case mentions included. High token cost, best for key sources.
- Standard (~50 items) — Balanced extraction. Good default for daily notes.
- Coarse (~10 items) — Quick overview, core entities only. Low cost, fast ingestion.
- Minimal (~5 items) — Essential items only. Ideal for batch processing 100+ files or testing new sources.
- Custom (1–300 items) — User-defined entity/concept limits for specialized workflows.
💡 Recommendation: Use Minimal or Coarse for large folders to save time and API costs. Use Fine selectively on key documents that warrant deep analysis.
🔄 Auto-Maintenance (v1.4.0): Optional file watcher, periodic Lint, and startup health check. All default OFF — enable only if you want automatic background processing.

🛡️ Safety: Parallel generation uses Promise.allSettled — if one page fails, others continue. Failed pages are retried individually with exponential backoff. Smart Batch Skip (v1.7.7) automatically detects already-ingested files to save time and API costs.

✨ Features

📊 Knowledge Quality

🔍 Entity/Concept Extraction — LLM extracts entities (people, orgs, products, events) and concepts (theories, methods, terms) from your notes with flexible extraction granularity (Minimal~~5 items, Coarse~~10, Standard~~50, Fine~~100, Custom 1–300) to balance analysis depth vs. API cost
🏷️ Mandatory Page Aliases — Every generated page includes at least 1 alias (translation, acronym, alternate name), enabling cross-language duplicate detection
🔄 Duplicate Detection & Merge — Semantic tiering catches true duplicates (cross-language translations, abbreviations, spelling variants); intelligent LLM merge fuses content and preserves aliases
🧩 Smart Knowledge Fusion — Multi-source updates merge new info without redundancy, contradictions preserved with attribution, reviewed: true pages protected from overwrite
📏 Content Truncation Protection — 8000 max_tokens with automatic stop_reason detection and retry at 2× tokens across all providers
📝 Verbatim Source Mentions — Original language quotes preserved with optional translation for traceability

🛠️ Maintenance

🔍 Lint Health Scan — Detects duplicates, dead links, empty pages, orphans, missing aliases, and contradictions in one comprehensive report
🎯 Semantic-Tier Duplicate Detection — Tier 1 (direct name matches: cross-language, abbreviations, high-similarity titles) always verified; Tier 2 (indirect signals: shared links, moderate similarity) fills token budget
⚡ Smart Fix All — Causality-ordered batch fix: duplicates merged → dead links resolved → orphans linked → empty pages expanded
🏷️ Alias Completion — One-click parallel batch generation of missing aliases, improving future duplicate detection
🔄 Auto-Maintenance — Multi-folder file watcher, periodic lint, startup health check (all optional)
⚠️ Contradiction State Machine — detected → review_ok → resolved (AI fix) or detected → pending_fix (manual)

💬 Query & Feedback

🤖 Conversational Query — ChatGPT-style dialog with streaming Markdown and [[wiki-links]], multi-turn history
📤 Query-to-Wiki Feedback — Save valuable conversations to Wiki with entity/concept extraction, semantic dedup before save
🔒 Duplicate Save Prevention — Hash tracking prevents re-evaluation of unchanged conversations

🌐 LLM & Language

🔌 Multi-Provider — Anthropic, Anthropic Compatible (Coding Plan), Gemini, OpenAI, DeepSeek, Kimi, GLM, OpenRouter, Ollama, custom endpoints
🔄 5xx Retry — Automatic exponential backoff retry (max 2) on HTTP 5xx/429 errors across all clients
📋 Dynamic Model List — Real-time fetching from provider APIs
🌐 Wiki Output Language — 8 languages independent of UI (EN/ZH/JA/KO/DE/FR/ES/PT), with custom input
🌍 Full UI Internationalization — Plugin UI supports 8 languages (EN/ZH/JA/KO/DE/FR/ES/PT), 269+ UI fields fully translated with natural local expressions
⚡ Rate Limit Guardian — When parallel generation triggers rate limits, auto-detects and suggests: lower concurrency, increase batch delay, switch provider
🦙 Web Clipper Compatible — One-click add Obsidian Web Clipper's Clippings/ folder to watch list, auto-ingest web clips into Wiki

🏗️ Architecture & Performance

⚡ Parallel Page Generation — Configurable 1–5 concurrent pages, default 3 (parallel), 2–3× faster for large sources, error isolation per page
📚 Iterative Batch Extraction — Adaptive batch sizing eliminates max_tokens bottleneck for long documents
🏛️ Three-Layer Architecture — sources/ (read-only) → wiki/ (LLM-generated) → schema/ (co-evolved config)
🧩 Modular Codebase — 13 focused modules in src/

⌨️ Commands

Command	Description
📥 Ingest single source	Select a note → generate Wiki pages with entities, concepts, and summary
📂 Ingest from folder	Select a folder → batch generate Wiki from existing notes
🔍 Query wiki	Conversational Q&A over your Wiki, streaming responses with `[[wiki-links]]`
🛠️ Lint wiki	Full health scan: duplicates, dead links, empty pages, orphans, missing aliases, contradictions
📋 Regenerate index	Manually rebuild `wiki/index.md`
💡 Suggest schema updates	LLM analyzes Wiki and proposes schema improvements

📖 Example

Input: sources/machine-learning.md

# Machine Learning
Machine learning uses algorithms to learn from data.

## Types
- Supervised learning
- Unsupervised learning
- Reinforcement learning

Output — Entity page: wiki/entities/supervised-learning.md

---
type: entity
created: 2026-05-15
updated: 2026-05-15
sources: ["[[sources/machine-learning]]"]
tags: [method]
aliases: ["监督学习", "Supervised Learning"]
---

# Supervised Learning

## Basic Information
- Type: method
- Source: [[sources/machine-learning]]

## Description
Supervised learning is a machine learning paradigm where models learn
from labeled training data to make predictions on unseen data...

## Related Concepts
- [[concepts/Machine-Learning|Machine Learning]]
- [[concepts/Unsupervised-Learning|Unsupervised Learning]]

## Related Entities
- [[entities/Arthur-Samuel|Arthur Samuel]]

## Mentions in Source
- "Supervised learning uses labeled data to train predictive models..."

🤖 Model Selection Guide

This plugin follows Karpathy's philosophy: feed the LLM full Wiki context, not chunked RAG retrieval. Long-context models are strongly recommended — the larger your Wiki grows, the more context the LLM needs.

💡 Why not RAG? Karpathy's original critique argues that RAG fragments knowledge and breaks the LLM's ability to reason across the full knowledge graph.

💰 Value-First Strategy: You don't need flagship models. The following cost-effective alternatives deliver excellent results at lower prices:

Tier	Model	Context	Why
🌟 Value Pick	DeepSeek V4-Flash	1M tokens	Lowest cost ($0.14/M), 284B MoE, ideal for batch ingestion
🌟 Value Pick	Gemini-3.5-Flash	1M tokens	4× faster output than GPT-5.5, great for agent tasks
🌟 Value Pick	Qwen3.6-Plus	1M tokens	Strong coding & agentic capabilities, competitive pricing
🌟 Value Pick	Grok-4	2M tokens	2M context window, excellent for very large wikis
Balanced	Claude Sonnet 4.6	1M tokens	Great quality/cost balance, $3/$15 per million tokens
Lightweight	Claude Haiku 4.5	200K tokens	Fast and affordable for smaller wikis
Budget	MiMo-V2.5-Flash	1M tokens	Xiaomi's cost-effective option, 309B MoE architecture
Flagship	Claude Opus 4.7	1M tokens	Ultimate quality, higher cost — use selectively
Flagship	GPT-5.5	1M tokens	Top reasoning, higher cost — use selectively

For local models (Ollama): context windows are typically smaller (8K–128K). Consider using a cloud provider for ingestion + local model for query.

🔌 Anthropic Compatible (Coding Plan): If your provider offers an Anthropic-compatible API endpoint, select "Anthropic Compatible" and enter your provider's Base URL and API Key.

💡 Subscription plans: Coding Plan, OpenAI Pro, or Anthropic Pro plans are excellent options for cost control with frequent use. This plugin supports these services.

🏗️ Architecture

Karpathy's three-layer separation design:

sources/     # 📄 Your source documents (read-only)
  ↓ ingest
wiki/        # 🧠 LLM-generated Wiki pages
  ↓ query / maintain
schema/      # 📋 Wiki structure configuration (naming, templates, categories)

Codebase (src/):

wiki/               # Wiki engine modules
  wiki-engine.ts    # 🎯 Orchestrator
  query-engine.ts   # 💬 Conversational query
  source-analyzer.ts # 📊 Iterative batch extraction
  page-factory.ts   # 🏗️ Entity/concept CRUD + merge
  lint-controller.ts # 🔍 Lint orchestration
  lint-fixes.ts     # 🛠️ Fix logic for dead links, empty pages, orphans
  lint/             # Lint sub-modules
    duplicate-detection.ts  # 🔄 Programmatic candidate generation
    fix-runners.ts          # ⚡ Batch fix execution helpers
  contradictions.ts # ⚠️ Contradiction detection
  system-prompts.ts # 🗣️ Language directive + section labels
schema/             # Schema co-evolution
  schema-manager.ts # 📋 Schema CRUD + suggestions
  auto-maintain.ts  # 🔄 File watcher + periodic lint
ui/                 # User interface
  settings.ts       # ⚙️ Settings panel
  modals.ts         # 📦 Lint/Ingest/Query modals
+ shared modules: llm-client.ts, prompts.ts, texts.ts, utils.ts, types.ts

Generated pages:

wiki/sources/filename.md — 📄 Source summary
wiki/entities/entity-name.md — 👤 Entity pages (people, orgs, projects, etc.)
wiki/concepts/concept-name.md — 💡 Concept pages (theories, methods, terms, etc.)
wiki/index.md — 📑 Auto-generated index
wiki/log.md — 📝 Operation log

❓ FAQ

Keep your plugin updated. This project ships frequently — new features and fixes land every few days. Run Settings → Community Plugins → Check for updates regularly.

For more, see the FAQ Discussion on GitHub.

💡 General

What does the plugin actually do? You drop notes in, it extracts people, concepts, and theories, then generates an interlinked wiki with [[bidirectional links]]. Ask questions and get answers grounded in your notes — not internet hallucinations.

Minimum requirements? Obsidian v1.6.6+, desktop (Windows/macOS/Linux), an LLM provider API key. Ollama works locally with no API key. See Configure an LLM Provider above.

Which model should I use? See Model Recommendations above. Long-context models work best — the larger your wiki, the more context the LLM needs.

🏷️ Aliases & Duplicates

Why does Lint show "missing aliases" on almost all my pages? Pages generated before v1.7.11 didn't include aliases. This is harmless — aliases are an enhancement, not a bug. Click Complete Aliases in the Lint report to batch-generate translations, acronyms, and alternate names. Once aliases exist, duplicate detection and alias-aware search become much more effective.

Why do I see duplicate pages like "CoT" and "思维链"? Pre-v1.7.10 versions lacked alias-aware duplicate detection. Run Lint Wiki → Merge Duplicates to fuse them. The merged page preserves aliases from both, preventing future duplicates.

How does duplicate detection work? (v1.7.10+) Two-tier semantic detection: Tier 1 (always LLM-verified) catches cross-language matches, abbreviations, high-similarity titles. Tier 2 fills remaining token budget with moderate-similarity candidates. Aliases are critical for Tier 1 — run Complete Aliases if your pages are pre-v1.7.11.

What are "polluted pages"? (v1.9.0) Pages with folder prefixes accidentally baked into filenames — e.g. concepts/concepts布局优化.md. Run Lint Wiki → 🧹 Fix Polluted Pages to rename and update all incoming links.

⚡ Performance & Cost

How do I speed up ingestion? In Settings → Ingestion Acceleration: increase Page Generation Concurrency to 3–5 (parallel page creation), lower Batch Delay to 100–300ms (watch for rate limits). Choose "Minimal", "Coarse", or "Standard" Extraction Granularity to reduce page count and save API costs.

Why am I getting HTTP 429 errors? The plugin auto-detects rate-limiting and suggests: lower concurrency to 1–2, increase Batch Delay to 500–800ms, or switch to a higher-limit provider.

How do I control API costs?

Auto-Maintenance is OFF by default (enable only if you need background processing)
Smart Batch Skip automatically skips already-ingested files
"Standard" or "Coarse" granularity = fewer LLM calls
Batch Delay > 500ms spaces calls without increasing token usage
Lint report shows counts before you run fixes — decide what's worth it

🧹 Maintenance

What does Smart Fix All do? Runs fixes in causality order (v1.9.0+):

🧹 Fix polluted pages → 2. 🏷️ Complete aliases → 3. 🔄 Merge duplicates → 4. 🔗 Fix dead links → 5. 🔗 Link orphans → 6. 📝 Expand empty pages

Lint freezes on a large Wiki? Upgrade to v1.7.17+ — Lint now yields to Obsidian's UI thread every 50 pages, preventing multi-second freezes even on 1200+ page wikis.

🔍 Troubleshooting

Query can't find pages I know exist? Three common causes: (1) Index is stale → Regenerate index. (2) Missing aliases → Complete Aliases. (3) Try different phrasing — LLM does semantic matching, not keyword search.

Can I manually edit Wiki pages? Yes. Set reviewed: true in frontmatter to protect from overwrite. Manual aliases, tags, and sources are preserved during merges.

Safe upgrade? The plugin never modifies your source files. Backup wiki/ → update plugin → Regenerate index → Lint Wiki → fix selectively.

How do I get help?

GitHub Issues — bug reports
GitHub Discussions — questions & feedback

📜 License

MIT License — see LICENSE.

🙏 Acknowledgments

💡 Concept: Andrej Karpathy's LLM Wiki — the original vision that inspired this plugin
🛠️ Platform: Obsidian Plugin API
🔌 LLM SDKs: Anthropic SDK, OpenAI SDK

For plugin developers

Search results and similarity scores are powered by semantic analysis of your plugin's README. If your plugin isn't appearing for searches you'd expect, try updating your README to clearly describe your plugin's purpose, features, and use cases.