Simple RAG with DSPy & ChromaDB

This project demonstrates a simple Retrieval-Augmented Generation (RAG) pipeline using DSPy and ChromaDB. The pipeline processes a PDF document, stores the text in a ChromaDB collection, and uses a language model to answer questions based on the retrieved context.

Project Structure

pipeline.ipynb: Jupyter notebook containing the code for the RAG pipeline.
data/tesla10K.pdf: Sample PDF file used for text extraction and processing.

Setup

Clone the repository:

git clone https://github.com/marioyordanoff/dspy

Create a virtual environment and activate it:

python -m venv venv
source venv/bin/activate  # On Windows, use `venv\Scripts\activate`

Install the required packages:
```
pip install -r requirements.txt
```
Create a .env file:
```
touch .env
```
Add your environment variables to the .env file. For example:
```
OPENAI_API_KEY=your_openai_api_key
```

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
data		data
teslasec		teslasec
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pipeline.ipynb		pipeline.ipynb
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Simple RAG with DSPy & ChromaDB

Project Structure

Setup

License

Acknowledgements

About

Releases

Packages

Languages

License

obre10off/dspy

Folders and files

Latest commit

History

Repository files navigation

Simple RAG with DSPy & ChromaDB

Project Structure

Setup

License

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages