Skip to content

docs: add FAQ covering algorithms, integrations, and deployment#36

Merged
Siddhant-K-code merged 7 commits intomainfrom
docs/add-faq
Feb 22, 2026
Merged

docs: add FAQ covering algorithms, integrations, and deployment#36
Siddhant-K-code merged 7 commits intomainfrom
docs/add-faq

Conversation

@Siddhant-K-code
Copy link
Copy Markdown
Owner

@Siddhant-K-code Siddhant-K-code commented Feb 22, 2026

Adds FAQ.md with answers to common questions, organized into five sections:

General — What Distill does, semantic vs exact dedup, why not use an LLM for compression

Algorithms — Why agglomerative clustering over K-Means, threshold meaning, MMR formula, time complexity, how rule-based compression works

Integration — Three LangChain integration paths (MCP, HTTP API, planned Python SDK), comparison with LangChain's built-in MMR, support for LlamaIndex/CrewAI/AutoGen, local model compatibility

Performance & Cost — Latency breakdown (~12ms), cost with/without pre-computed embeddings, scaling characteristics

Deployment — Self-hosting options, API key protection, observability stack (Prometheus, OTEL, Grafana)

Also adds:

  • FAQ section in README with 8 key questions as collapsible <details> toggles, linking to FAQ.md for the full list
  • FAQ link in the README Links section

Siddhant-K-code and others added 7 commits February 22, 2026 12:39
Covers common questions about:
- How the dedup pipeline works (clustering, MMR, compression)
- LangChain/LlamaIndex integration paths (MCP, HTTP API, planned SDK)
- Performance, cost, and scaling characteristics
- Deployment and observability options

Also links FAQ from README.

Co-authored-by: Ona <no-reply@ona.com>
8 key questions inline with <details> toggles, linking to
FAQ.md for the full list.

Co-authored-by: Ona <no-reply@ona.com>
GitHub renders all details content as underlined when strong is
nested inside summary.

Co-authored-by: Ona <no-reply@ona.com>
Co-authored-by: Ona <no-reply@ona.com>
Remove cost/latency and embeddings questions. Add cosine vs
Euclidean, time complexity, and context window questions.
Switch from HTML details/summary to plain markdown.

Co-authored-by: Ona <no-reply@ona.com>
Use <p> tags for answer content instead of markdown inside HTML
blocks. This prevents GitHub from rendering content as underlined.

Co-authored-by: Ona <no-reply@ona.com>
@Siddhant-K-code Siddhant-K-code merged commit e6f58f3 into main Feb 22, 2026
2 checks passed
@Siddhant-K-code Siddhant-K-code deleted the docs/add-faq branch February 22, 2026 13:14
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant