Skip to content
Discussion options

You must be logged in to vote

If there are no changes you should not process a specific file with data loaders. It's an application problem.

I recommend to look at the RAG implementation with the following approach:

  1. Execute the first data loading pipeline to process all available files, articles, etc, and use $rag->addDocuments(...) for the initial population of the vector store.

  2. When new files are uploaded, you should call $rag->reindexBySource(...) if the file is an update of a previously processed file it will update the documents into the vecotr store. If it is a new file the documents will be added as new.

  3. If a file that was previously processed into the vector store is deleted, you can call $rag->resolveVe…

Replies: 1 comment 5 replies

Comment options

You must be logged in to vote
5 replies
@bilogic
Comment options

@ilvalerione
Comment options

@bilogic
Comment options

@ilvalerione
Comment options

Answer selected by bilogic
@bilogic
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants