Agentic RAG

A small-scale PDF-based Retrieval-Augmented Generation (RAG) system designed for correctness, clear separation of concerns, and practical efficiency.

Features

Persistent Vector Storage: Uses Pinecone for scalable vector storage
Incremental Indexing: Selective reindexing for changed PDFs only
Grounded Answer Generation: Pluggable LLM layer with answer grounding
Local Caching: SQLite-backed caching to avoid unnecessary LLM calls
CLI Interface: Command-line tools for indexing and querying
FastAPI API: HTTP endpoints for programmatic access
Analytics: Query logging for internal analytics

Architecture

The system follows a clean pipeline: PDF -> chunk -> embed -> Pinecone -> retrieve -> LLM

Components

Ingestion: Loads PDFs from data/, cleans text, splits into chunks
Embedding: Uses sentence-transformers/all-MiniLM-L6-v2 for embeddings
Vector Storage: Pinecone for persistent vector storage
Retrieval: Semantic search over embedded chunks
Generation: LLM-powered answer generation with grounding

Installation

Clone the repository:

git clone https://github.com/yourusername/agentic-rag.git
cd agentic-rag

Create a virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies:

pip install -r requirements.txt

Set up environment variables:

cp .env.example .env
# Edit .env with your API keys (Pinecone, Google AI, etc.)

Usage

CLI

Index documents:

python -m app.main index

Ask questions:

python -m app.main ask "What is the main topic of the documents?"

API

Start the FastAPI server:

uvicorn app.api:app --reload

The API will be available at http://localhost:8000

Configuration

The system uses pydantic-settings for configuration. Key settings include:

Pinecone API key and environment
Google AI API key for LLM
Embedding model configuration
Chunk size and overlap settings

See app/config.py for all available options.

Testing

Run tests:

pytest

Project Structure

app/
├── core/          # Core RAG pipeline components
├── infra/         # Infrastructure (embedding, storage, etc.)
├── utils/         # Utilities (logging, caching, etc.)
├── api.py         # FastAPI application
├── config.py      # Configuration management
├── graph.py       # LangGraph orchestration
└── main.py        # CLI entrypoint

tests/             # Test suite
data/              # PDF documents directory

Contributing

Fork the repository
Create a feature branch
Make your changes
Add tests if applicable
Submit a pull request

License

MIT License - see LICENSE file for details

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Agentic RAG

Features

Architecture

Components

Installation

Usage

CLI

API

Configuration

Testing

Project Structure

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
app		app
data		data
tests		tests
.env.example		.env.example
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Agentic RAG

Features

Architecture

Components

Installation

Usage

CLI

API

Configuration

Testing

Project Structure

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages