New Features:
-
Observability with Langfuse
• Integrated Langfuse tracing to track LLM calls, tool usage, and graph execution across the full pipeline.
• Added dedicatedobservability.ipynbnotebook with a step-by-step guide to set up and use Langfuse with the system. -
Streaming Response & Agent Transparency
• Responses now stream token by token directly in the Gradio interface.
• Agent reasoning is now visible in real time: tool calls (search_child_chunks,retrieve_parent_chunks) and their results are shown as collapsible messages during execution.
• System nodes (query rewriting, history summarization) surface their output progressively, including clarification requests when the query is unclear.
Bug Fixes:
- Docker Volume Permission Fix
• Fixed aPermissionErrorwhen clicking "Clear All" with Docker-mounted volumes.
• Directories are now cleared by removing their contents instead of deleting the root folder, avoiding permission issues with mount points.