pi-tree

AI makes you productive where you already understand. It confuses you where you don't.

Ask an expert a smart question and AI will give them a brilliant answer. Ask a beginner the same question and they'll get a confident-sounding paragraph they can't evaluate. The gap between "I get this" and "I'm lost" doesn't shrink with better models — it widens. Pi-tree works on the boundary: load your books, news feeds, or research papers, and an AI reads them with you — not as flat Q&A, but as branching conversations that help you cross from confusion into comprehension. Go deep on a concept, branch into a tangent, zoom back out. Your reading path is a navigable tree, not a disposable chat log.

Local-first, bring your own key. Runs entirely on your machine. No cloud account, no subscription. Works with cloud APIs (DeepSeek, Gemini, Claude) or fully offline with Ollama / local models.

_{📸 See all features · 📖 Documentation · Vision · Contributing}

Why Pi-tree?

Most AI tools help you skip past material — paste the text, get the summary, move on. That works when you already understand the domain. When you don't, skipping is exactly the problem. Pi-tree treats reading as a process worth having — one that expands what you're capable of understanding.

	Pi-tree	ChatGPT / Claude	NotebookLM	Obsidian + AI
Focus	Comprehension & exploration	General-purpose Q&A	Document Q&A	Note-taking
Conversations	🌳 Tree — branch, explore, return	Linear chat	Linear chat	Linear chat
AI approach	Agentic — tools & skills over local data	Prompt + context window	RAG over uploads	Plugins over local vault
Sources	Books, papers, news feeds, YouTube	File uploads, web	Multi-doc notebooks	Markdown vault
Extensibility	Skills, plugins, MCP bridge	GPTs (cloud-hosted)	None	Community plugins
Model choice	BYOK — any provider or local	Vendor-locked	Google only	Plugin-dependent
Data	Local-first, self-hosted	Cloud	Cloud	Local

What a session looks like

📖 Reading: Thinking, Fast and Slow (Kahneman)

Root
├── What is System 1 vs System 2?
│   ├── How does this relate to cognitive biases?
│   │   └── Anchoring bias deep-dive
│   └── Real-world examples in decision making
├── Chapter 3: The Lazy Controller
│   └── Why do we avoid effortful thinking?
└── Comparison with Nassim Taleb's ideas
    ├── Black Swan connection
    └── Antifragility and heuristics

Each node is a conversation branch with full context. Go deep on any concept, then navigate back to explore something else — no context lost.

Why trees work better for LLMs

The tree structure isn't just a UX choice — it makes the AI better.

In a linear chat, every message you've ever sent is packed into the context window. After 30 turns spanning three different topics, the model is trying to track everything at once — and starts hallucinating, losing the thread, or ignoring your latest question in favor of something from 20 messages ago.

Trees fix this at the architecture level:

Focused context — Each branch carries only its path from root to current node. When you're exploring cognitive biases, the model doesn't see your earlier tangent about Nassim Taleb. Less noise → more accurate responses.
Token savings — A 50-message linear chat sends all 50 messages every turn. A tree with 5 branches of 10 messages sends only ~10. Fewer tokens per request → lower cost, faster responses.
Less hallucination — Context pollution is a primary cause of hallucination in long conversations. Isolated branches mean the model stays grounded in the relevant thread.
Longer effective conversations — Linear chats degrade in quality well before hitting the context window limit. Trees keep each branch short and focused, so you can explore a source across hundreds of messages without quality loss.

What You Can Read

Pi-tree supports four source types, each handled by a dedicated plugin:

📚 Books — Upload EPUB, MOBI, PDF, or Markdown. The AI guides you chapter by chapter with reading skills, structural analysis, and branching discussions. Multiple session modes: guided reading, freeform Q&A, or deep analysis.

📰 News Feeds — Add RSS/Atom feeds. Pi-tree crawls and deduplicates articles, then lets you scan trends, deep-dive into stories, and discuss the news with AI. Comes with its own dashboard and feed management.

📄 Research Papers — Search arXiv directly from the chat. Fetch papers, read them with AI-provided context, and branch into methodology questions or related work.

🎥 YouTube Videos — Paste a link. Pi-tree extracts the transcript and video metadata, then lets you discuss the content — quote specific segments, ask follow-ups, compare with other sources. Includes an embedded video player.

Important

Users are responsible for ensuring they have the right to use any content loaded into pi-tree. This project does not distribute, host, or provide access to any copyrighted material.

Who Is This For?

📚 Nonfiction readers — you're reading a dense chapter and AI summaries skip the part you actually don't understand. Pi-tree stays in that gap with you until you do.
🎓 Researchers & students — you're outside your subfield and every paper assumes background you lack. Branch into what you don't know, then return to the argument.
📰 News followers — you read the headline but can't evaluate the claim. Turn feeds into conversations where you build context over time, not scroll past it.
🔧 Developers — you're in an unfamiliar codebase or domain. Build custom plugins to explore anything conversationally.

Getting Started

Docker (recommended)

cp .env.example .env   # edit with your API key

docker run -d --name pi-tree \
  --env-file .env \
  -p 3847:3847 \
  -v ~/.local/share/pi-tree:/data \
  ghcr.io/shuowu/pi-tree:latest

Open http://localhost:3847 (serves both frontend and API).

Tip

Full setup options → Self-hosting guide

From Source

cp .env.example .env   # edit with your API key and provider
npm install
npm run dev

Dev server runs on :3947, client on :5947. Open http://localhost:5947.

Desktop App ⚠️ Experimental

Download from the Releases page — available for macOS, Linux, and Windows. No Node.js, no Docker, no terminal needed.

Open the app, enter an API key (or point to a local Ollama server), and start reading.

Models

Pi-tree doesn't need frontier-class models — reading and comprehension are more about context and conversation than raw reasoning. Smaller, faster models work well and keep costs low (or free with local inference).

Provider	Model	Notes
DeepSeek	`deepseek-v4-flash`	Very cheap, strong reading comprehension
Google	`gemini-2.5-flash`	Fast, large context window
Anthropic	`claude-haiku-4-20250514`	Fast, great quality-to-cost ratio
Zhipu	`glm-5-turbo`	Good Chinese + English bilingual support

Local models — completely offline, no API costs. Use Ollama or LM Studio. Gemma 4 (12B) and Qwen 3.6 are good starting points.

Tip

Multi-provider setup, runtime switching, compatibility flags → Models & Providers

How It Works

Built on the Pi SDK — a minimalist AI agent framework with tree-structured conversations.

Plugin architecture

Each source type ships as a self-contained plugin — an independent package with its own tools, skills, session profiles, and (optionally) HTTP routes and UI components. The server discovers plugins at startup and wires their capabilities into the right sessions.

Plugin	Source Type	What it provides
`plugin-book`	Books	EPUB/MOBI/PDF parsing, guided reading, analysis, outline generation
`plugin-news`	News	RSS crawling, feed management, news discussion (own database)
`plugin-paper`	Papers	arXiv search, paper fetching, research reading
`plugin-youtube`	YouTube	Transcript extraction, video info, embedded player
`plugin-mcp`	—	Bridges external MCP servers (web search, etc.)

Plugins depend only on @pi-tree/plugin-sdk — they can't access server internals. Each plugin declares a manifest in package.json that tells the server what source type it handles, which tools and skills it provides, and what routes to mount.

Three levels of customization

You don't need to build a plugin to extend pi-tree. There are three tiers, from zero-code to full package:

Custom skills (no code) — Drop a SKILL.md into $DATA_PATH/skills/ to change how the AI reads, discusses, or analyzes any source type. Override built-in skills by name, or add new ones.
Custom profiles + extensions (no plugin) — Add a YAML profile and a tool extension to $DATA_PATH/ to create an entirely new source type. The self-hosting guide walks through a complete example.
Full plugin package — For source types that need parsers, databases, HTTP routes, or UI panels. See the plugin guide.

Other key choices

Session profiles — declarative YAML files map each (sourceType, mode) to specific skills and tools. Audit them, override them, create your own
Data separation — Pi SDK owns conversation content (JSONL files); pi-tree owns metadata (SQLite: users, sessions, config, glossary)
Multi-user — each user gets isolated sessions, config, and glossary per source

Tip

Architecture deep dive, custom skills, plugin development → Documentation

Security & Privacy

Pi-tree is local-first — no cloud accounts, no telemetry, no phone-home. API keys are stored on your filesystem and sent only to your chosen provider.

🛡️ Session-scoped permissions — Each session type declares exactly which tools the agent can use. A book reading session gets 5-8 purpose-built tools. No shell. No file editing. No database writes.
📝 Declarative profiles — Capabilities are configured in YAML. exclude_tools: [bash, edit] is the default for all user-facing sessions. Audit them, override them, create your own.
📡 Fully offline — Pair with Ollama for air-gapped operation. No internet required.
📖 Open source — AGPL-3.0. Audit the code, fork it, self-host it.

Pi-tree's agent is a reading companion, not a general-purpose agent. The permission model reflects that — minimal surface area, scoped by purpose, auditable by design.

Design Philosophy

License

This project is licensed under the GNU Affero General Public License v3.0.

Name		Name	Last commit message	Last commit date
Latest commit History 306 Commits
.agents/skills		.agents/skills
.github		.github
.husky		.husky
config		config
docs		docs
e2e		e2e
examples/github-explorer		examples/github-explorer
packages		packages
scripts		scripts
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.nvmrc		.nvmrc
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
docker-compose.yml		docker-compose.yml
package-lock.json		package-lock.json
package.json		package.json
playwright.config.ts		playwright.config.ts
tsconfig.base.json		tsconfig.base.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pi-tree

Why Pi-tree?

What a session looks like

Why trees work better for LLMs

What You Can Read

Who Is This For?

Getting Started

Docker (recommended)

From Source

Desktop App ⚠️ Experimental

Models

How It Works

Plugin architecture

Three levels of customization

Other key choices

Security & Privacy

Design Philosophy

License

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

pi-tree

Why Pi-tree?

What a session looks like

Why trees work better for LLMs

What You Can Read

Who Is This For?

Getting Started

Docker (recommended)

From Source

Desktop App ⚠️ Experimental

Models

How It Works

Plugin architecture

Three levels of customization

Other key choices

Security & Privacy

Design Philosophy

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages