InsightX is an AI-powered document processing and analysis system designed for Kochi Metro Rail Limited (KMRL) to handle document overload through automated processing, summarization, and intelligent search capabilities.
Problem: Document overload at KMRL requiring automated solution to process and summarize documents Goal: Build an MVP that ingests documents (English & Malayalam), extracts insights, summarizes content, and provides a searchable dashboard with alerts
InsightX/
├── backend/ # RestAPI Python backend
├── frontend/ # React.js TypeScript frontend
├── infrastructure/ # GCP configs
└── docs/ # Documentation
- API: FastAPI (Python)
- Database: PostgreSQL (structured data)
- Vector DB: Pinecone (embeddings)
- Storage: Google Cloud Storage
- Queue: Redis (background jobs)
- OCR: Google Cloud Document AI
- LLM: Vertex AI (Gemini models)
- Embeddings: BERT via transformers
- Voice: Google Speech-to-Text
- Framework: React.js with TypeScript
- Styling: Tailwind CSS
- Auth: JWT-based authentication
- Cloud: Google Cloud Platform (GCP)
- Containers: Docker + Kubernetes (GKE)
- CI/CD: GitHub Actions
- Front-line Managers - Operational summaries
- Compliance Officers - Regulatory alerts & deadlines
- Cross-Department Teams - Shared updates
- Engineers/Retiring Employees - Knowledge preservation
- Upload PDFs, Word, scanned images
- OCR support (English + Malayalam)
- Bulk processing via Redis workers
- Role-based summaries (compliance, operations, technical)
- Extract deadlines and key actions
- Cross-link related documents
- Web dashboard with document viewer
- Text and voice search capabilities
- Role-based access control
- Automated compliance alerts
- Email notifications for regulatory updates
- JWT-based authentication
- Role-based access control
- AES-256 encryption for sensitive data
- OCR accuracy ≥ 80% (English), ≥ 70% (Malayalam)
- Summary relevance rating ≥ 0.8
- Handle 500 documents/week
- Support 3 user roles with separate views
- Python 3.11+
- Node.js 18+
- Docker & Docker Compose
- Google Cloud Platform account
cd backend
pip install -r requirements.txt
uvicorn app.main:app --reloadcd frontend
npm install
npm startdocker-compose up -dOnce running, visit:
- Backend API: http://localhost:8000/docs
- Frontend: http://localhost:3000
- SharePoint/Maximo integration
- Mobile application
- Advanced analytics & reporting
- Multi-language support expansion
MIT License - see LICENSE file for details
InsightX - Transforming document chaos into actionable insights