AI-Powered Document Chat Application

A RAG (Retrieval Augmented Generation) system that enables intelligent conversations with documents. Upload PDF and DOCX files, ask questions and get accurate answers based solely on your document content.

🎯 Key Features

Document Upload & Processing: Automatic extraction and processing of PDF and DOCX files
Semantic Search: Vector-based search using embeddings to find relevant information by meaning, not just keywords
RAG Implementation: Retrieval Augmented Generation ensures answers are based only on uploaded documents
Real-time Status Updates: Live file processing status via Socket.IO
Multi-file Support: Select and search across multiple documents simultaneously
Transparent Responses: System explicitly states when information isn't available in documents

🏗️ Architecture

Tech Stack

AI/ML:
- Google Gemini API (@google/genai) for content generation
- Gemini Embeddings (gemini-embedding-001) for vector embeddings
File Processing:
- pdf-parse for PDF text extraction
- mammoth for DOCX text extraction
Storage: Local file storage (with abstraction for future cloud migration)

🔄 How It Works

1. Document Upload & Processing

File Upload: User uploads PDF or DOCX file via drag-and-drop
Raw Storage: File saved to local storage with UUID-based naming
Text Extraction:
- PDF: Extracted using pdf-parse
- DOCX: Extracted using mammoth
Chunking: Text split into semantic chunks (~2000 chars) by paragraphs
Embedding Generation: Each chunk converted to vector embeddings using Gemini Embeddings API (batched, 15 chunks at a time)
Storage:
- Raw files stored locally
- Chunks stored in MongoDB chunks collection with embeddings
- File metadata stored in MongoDB files collection
Status Updates: Real-time status updates (processing → ready/error) via Socket.IO

2. Query Processing (RAG Flow)

File Selection: User selects files via checkboxes
Query Embedding: User's question converted to embedding vector
Vector Search: MongoDB $vectorSearch finds top 3-10 most relevant chunks from selected files
Context Injection: Relevant chunks injected into prompt as context
Response Generation: Gemini model generates answer using the provided context
Transparency: If information isn't available, system explicitly states so

3. Vector Search Implementation

Index: MongoDB vector search index on embedding field
Search Method: Semantic similarity using cosine distance
Filtering: Results filtered by selected fileIds
Scoring: Results include vectorSearchScore for relevance ranking

Prerequisites

This project requires specific versions of Node.js and pnpm. Please check the engines and packageManager fields in package.json for the required versions.

Node.js

If you're using nvm, you can automatically switch to the correct Node.js version by running:

nvm use

This will read the version from the .nvmrc file and switch to it automatically.

pnpm

If you have Corepack enabled (included with Node.js 16.10+), pnpm will automatically use the version specified in the packageManager field of package.json. You can enable Corepack by running:

corepack enable

Starting Application with Turborepo 🚀

To run the infrastructure and all services -- just run:

pnpm start

Running Infra and Services Separately with Turborepo

Start base infrastructure services in Docker containers:
```
pnpm run infra
```
Run the services with Turborepo:
```
pnpm run turbo-start
```

Using Ship with Docker

To run the infrastructure and all services, execute:

pnpm run docker

Running Infra and Services Separately with Docker

Start base infrastructure services in Docker containers:
```
pnpm run infra
```
Run the services you need:
```
./bin/start.sh api web
```

You can also run infrastructure services separately using the ./bin/start.sh bash script.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.cursor/rules		.cursor/rules
.github/workflows		.github/workflows
.husky		.husky
apps		apps
bin		bin
deploy		deploy
packages		packages
.dockerignore		.dockerignore
.editorconfig		.editorconfig
.gitignore		.gitignore
.npmrc		.npmrc
.nvmrc		.nvmrc
README.md		README.md
docker-compose.yml		docker-compose.yml
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
turbo.json		turbo.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI-Powered Document Chat Application

🎯 Key Features

🏗️ Architecture

Tech Stack

🔄 How It Works

1. Document Upload & Processing

2. Query Processing (RAG Flow)

3. Vector Search Implementation

Prerequisites

Node.js

pnpm

Starting Application with Turborepo 🚀

Running Infra and Services Separately with Turborepo

Using Ship with Docker

Running Infra and Services Separately with Docker

About

Uh oh!

Releases

Packages

Languages

paralect/rag-demo

Folders and files

Latest commit

History

Repository files navigation

AI-Powered Document Chat Application

🎯 Key Features

🏗️ Architecture

Tech Stack

🔄 How It Works

1. Document Upload & Processing

2. Query Processing (RAG Flow)

3. Vector Search Implementation

Prerequisites

Node.js

pnpm

Starting Application with Turborepo 🚀

Running Infra and Services Separately with Turborepo

Using Ship with Docker

Running Infra and Services Separately with Docker

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages