refactor(rag): update rag params #765

ch-liuzhide · 2025-03-04T10:41:34Z

No description provided.

vercel · 2025-03-04T10:41:38Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
petercat	🔄 Building (Inspect)	Visit Preview	💬 Add feedback	Mar 4, 2025 10:41am

petercat-assistant · 2025-03-04T10:41:52Z

Walkthrough

This pull request refactors the RAG (Retrieval-Augmented Generation) parameters by adjusting the similarity threshold and modifying the chunk size and overlap in the configuration. These changes aim to optimize the retrieval and processing of data.

Changes

File	Summary
server/agent/tools/knowledge.py	Reduced the similarity threshold from 0.65 to 0.6.
server/rag/router.py	Increased chunk size from 500 to 1000 and chunk overlap from 100 to 200.

petercat-assistant · 2025-03-04T10:41:54Z

server/agent/tools/knowledge.py

@@ -32,7 +32,7 @@ async def search_knowledge(
                    space_id_list=[bot_id, repo_name],
                    question=query,
                    embedding_model_name=EmbeddingModelEnum.OPENAI,
-                    similarity_threshold=0.65,
+                    similarity_threshold=0.6,


Lowering the similarity threshold from 0.65 to 0.6 may increase the number of results returned, potentially including less relevant matches. Ensure this change aligns with the intended retrieval quality.

petercat-assistant · 2025-03-04T10:41:57Z

server/rag/router.py

@@ -60,8 +60,8 @@ async def reload_repo(
                        repo_name=request.repo_name, auth_token=user.access_token
                    ),
                    split_config=KnowledgeSplitConfig(
-                        chunk_size=500,
-                        chunk_overlap=100,
+                        chunk_size=1000,


Increasing the chunk size and overlap may affect memory usage and processing time. Ensure that the system can handle these changes without performance degradation.

codecov · 2025-03-04T10:43:18Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Files with missing lines	Coverage Δ
server/agent/tools/knowledge.py	`92.00% <ø> (ø)`
server/rag/router.py	`43.85% <ø> (ø)`

refactor(rag): update rag params

33d7806

ch-liuzhide temporarily deployed to Preview March 4, 2025 10:41 — with GitHub Actions Inactive

ch-liuzhide deployed to Preview March 4, 2025 10:41 — with GitHub Actions View deployment

petercat-assistant bot reviewed Mar 4, 2025

View reviewed changes

ch-liuzhide merged commit dd2d8cb into main Mar 4, 2025
3 of 4 checks passed

petercat-assistant bot reviewed Mar 4, 2025

View reviewed changes

ch-liuzhide deleted the whisker branch March 4, 2025 10:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor(rag): update rag params #765

refactor(rag): update rag params #765

ch-liuzhide commented Mar 4, 2025

vercel bot commented Mar 4, 2025

petercat-assistant bot commented Mar 4, 2025

petercat-assistant bot Mar 4, 2025

petercat-assistant bot Mar 4, 2025

codecov bot commented Mar 4, 2025 •

edited

Loading

refactor(rag): update rag params #765

refactor(rag): update rag params #765

Conversation

ch-liuzhide commented Mar 4, 2025

vercel bot commented Mar 4, 2025

petercat-assistant bot commented Mar 4, 2025

Walkthrough

Changes

petercat-assistant bot Mar 4, 2025

Choose a reason for hiding this comment

petercat-assistant bot Mar 4, 2025

Choose a reason for hiding this comment

codecov bot commented Mar 4, 2025 • edited Loading

Codecov Report

codecov bot commented Mar 4, 2025 •

edited

Loading