QueryQuack

Check Live - HuggingFace space - Get Started by clicking on Start Quacking on landing page

QueryQuack

Quack the query, crack the PDF!

QueryQuack transforms how you interact with PDF documents by allowing you to have conversations with your documents instead of manually searching through them. QueryQuack is a document query engine that allows users to upload PDFs and query them using natural language. The system uses vector embeddings and retrieval-based question answering to provide relevant responses based on the document content.

Setup Instructions

Follow these steps to set up QueryQuack locally

```

Clone the Repository

git clone https://github.com/negativenagesh/QueryQuack.git
cd QueryQuack

Create a venv

python3 -m venv .venv
source .venv/bin/activate

Install Dependencies

pip install -r pkgs.txt

Set up Gemini API

# Create a .env file in the project directory
touch .env

# Add your Gemini API key to the .env file
echo "api_key=your_api_key_here" >> .env

Set up Pinecone API

# Create a free Pinecone account at https://www.pinecone.io/
# Create a new project and index in the Pinecone console

# Add your Pinecone API key and environment to the .env file
echo "PINECONE_API_KEY=your_pinecone_api_key_here" >> .env
echo "PINECONE_ENVIRONMENT=your_environment_here" >> .env
echo "PINECONE_INDEX=your_index_name_here" >> .env

Landing page

Main page

Data Flow

Document Ingestion:

PDF Upload → Text Extraction → Chunking → Embedding Generation → Pinecone Storage

Query Processing:

User Query → Query Processing → Embedding Generation → Vector Search → Chunk Retrieval → Response Generation → Display

How does QueryQuack work?

Document Processing Pipeline

Text Extraction: PDFs are processed page by page to extract raw text Document structure (headings, paragraphs) is preserved where possible Images and non-textual elements are noted but not processed
Chunking: Extracted text is divided into smaller, semantically meaningful segments Each chunk maintains metadata about its source document and location Chunks overlap slightly to preserve context across boundaries
Vector Embedding: Each text chunk is transformed into a high-dimensional vector representation These embeddings capture the semantic meaning of the text Similar concepts will have similar vector representations, even if using different words
Storage in Pinecone: All vector embeddings are stored in a Pinecone vector database Documents are organized by user session to maintain privacy The database enables ultra-fast similarity searching

Query Processing Pipeline

Query Understanding: Your natural language question is analyzed for intent and key concepts The system considers conversation context from previous questions The query is transformed into a vector embedding using the same process as documents
Semantic Search: The query vector is compared against all document chunk vectors Pinecone performs this similarity search in milliseconds The most relevant chunks are retrieved based on semantic similarity, not just keyword matching
Context Assembly: The top matching chunks are compiled into a comprehensive context This context represents the most relevant parts of your documents to answer the query Source metadata is preserved for attribution
Response Generation: The Gemini API uses the assembled context to generate a coherent answer The response is formulated to directly address your question Citations link back to the specific parts of your original documents.

Multi-Document Queries

Ask questions that span multiple uploaded documents
The system automatically finds connections between different sources
Compare and contrast information across documents

Conversation Memory

References to previous questions are understood (e.g., "Tell me more about that")
The system maintains the conversation context throughout your session
No need to repeat context in follow-up questions

Source Attribution

Every answer shows exactly which documents contributed to the response
Navigate directly to specific sections in source documents
Verify information against the original content

Models and Tech stack Used in QueryQuack

Embedding Model: all-MiniLM-L6-v2 This model is used for generating vector embeddings from text chunks:

Efficient 384-dimensional embeddings that capture semantic meaning
Very Lightweight to run on CPU without requiring GPU resources
Good balance between performance and computational requirements
Well-supported through HuggingFace's ecosystem
Strong semantic understanding for accurate retrieval

LLM for Response Generation: Gemini 1.5 Flash

Used for generating coherent responses based on retrieved context:
Free
Maintains good performance while being more cost-effective than other models available

Vector Database: Pinecone Used for storing and retrieving vector embeddings:

Purpose-built for vector similarity search at scale
Millisecond query times even with large vector collections
Supports namespaces for organizing data by user session
Cloud-based with simple API integration
Specialized indexing for high-dimensional vectors
Optimized for semantic search rather than keyword matching
Supports metadata filtering for more targeted retrieval

The system architecture uses specialized components for each part of the pipeline, creating an efficient, scalable solution for document question-answering without requiring specialized hardware.

License

QueryQuack is released under the Apache License.

Name		Name	Last commit message	Last commit date
Latest commit History 71 Commits
QueryQuack-logo		QueryQuack-logo
app		app
backend		backend
landing_page		landing_page
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
landingpage.png		landingpage.png
mainpage.png		mainpage.png
pkgs.txt		pkgs.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Check Live - HuggingFace space - Get Started by clicking on Start Quacking on landing page

QueryQuack

Quack the query, crack the PDF!

Setup Instructions

Landing page

Main page

Data Flow

How does QueryQuack work?

Models and Tech stack Used in QueryQuack

License

About

Releases

Packages

Languages

License

negativenagesh/QueryQuack

Folders and files

Latest commit

History

Repository files navigation

Check Live - HuggingFace space - Get Started by clicking on Start Quacking on landing page

QueryQuack

Quack the query, crack the PDF!

Setup Instructions

Landing page

Main page

Data Flow

How does QueryQuack work?

Models and Tech stack Used in QueryQuack

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages