Local backup validation & file integrity dashboard built with Python and Streamlit.
Professional sysadmin-style tooling for backup verification, integrity checks, duplicate detection and reporting.
Backup Guardian is a lightweight local dashboard designed to help analyze backup folders, validate backup freshness, verify file integrity, detect duplicate files, and generate reports.
The project was created as part of my learning journey during my Ausbildung as a Fachinformatiker fรผr Systemintegration.
The application runs completely locally and does not require:
- cloud services
- paid APIs
- databases
- Docker
- external infrastructure
All file analysis is performed directly on the local machine.
- Modern overview page
- Quick backup validation overview
- Clean sysadmin-style UI
- Simple workflow explanation
- Recursive folder scanning
- File statistics
- Largest file overview
- File extension analysis
- Modified date tracking
-
Detect outdated backups
-
Configurable backup age threshold
-
Status indicators:
- OK
- WARNING
- CRITICAL
-
Backup freshness validation
- SHA256 checksum generation
- File integrity validation
- Single file or folder hashing
- CSV checksum export
- Duplicate file detection using SHA256 hashes
- Duplicate grouping
- Wasted storage estimation
- CSV duplicate reports
- Generate local backup reports
- Plain text export
- Folder summaries
- Backup health overview
| Technology | Purpose |
|---|---|
| Python | Core language |
| Streamlit | Dashboard UI |
| pathlib / os | File handling |
| hashlib | SHA256 hashing |
| pandas | Tables & reports |
| datetime | Backup age calculations |
The interface is inspired by:
- modern internal IT dashboards
- backup operation panels
- sysadmin workflows
- cybersecurity-inspired tooling
- Dark mode only
- Professional dashboard layout
- Clean sidebar navigation
- Modern metric cards
- Soft borders and spacing
- Status badges
- Terminal-inspired report sections
- Responsive layout
- Python 3.10 or newer recommended
- Local laptop or desktop
- Internet access only for installing Python packages
git clone https://github.com/mrachcore/backup-guardian.gitcd backup-guardianpython -m venv .venv.venv\Scripts\activatesource .venv/bin/activatepip install -r requirements.txtstreamlit run app.pyStreamlit will print a local URL, usually:
http://localhost:8501
Open that URL in your browser.
backup-guardian/
โ
โโโ app.py
โโโ requirements.txt
โโโ README.md
โ
โโโ assets/
โ โโโ logo.png
โ
โโโ screenshots/
โ โโโ dashboard.png
โ โโโ folder-scanner.png
โ โโโ backup-age-checker.png
โ โโโ integrity-checker.png
โ โโโ duplicate-finder.png
โ โโโ reports.png
โ
โโโ utils/
โโโ file_scanner.py
โโโ checksum_tools.py
โโโ report_generator.py
This project was built to:
- improve Python skills
- learn practical file handling
- understand backup validation workflows
- practice SHA256 integrity checks
- create realistic sysadmin tooling
- build a professional portfolio project
- Recursive file scanning
- SHA256 checksum generation
- File integrity validation
- Duplicate detection logic
- Report generation
- Streamlit dashboard development
- Error handling
- Beginner-friendly project organization
Possible future additions:
- ZIP backup inspection
- JSON report export
- Backup schedule tracking
- Backup comparison system
- File change history
- Multi-folder monitoring
- Cleanup assistant for duplicate files
- All checks are performed locally.
- No files are uploaded anywhere.
- Large folders may take longer to scan.
- Some files may be skipped because of permission restrictions.
- Duplicate detection uses SHA256 hashes for accuracy.
Code. Connect. Control.
Repository:
https://github.com/mrachcore/backup-guardian
This tool was created for educational and portfolio purposes.
Use responsibly and only on files and backup folders you are authorized to analyze.






