Skip to content

walterkwong/vit_receipt

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ViT Receipt Recorder

A Vision Transformer specialised in reading and recording receipts using MiniCPM-V-4_5-int4. Automatically processes all images in a folder, analyzes them, and generates a CSV file for easy auditing tasks.

GPU Highly Recommended.

Installation

git clone https://github.com/walterkwong/vit_receipt
cd vit_receipt
pip install .

Model Access

openbmb/MiniCPM-V-4_5-int4

  1. Create an access token on Hugging Face.

  2. Authenticate with your token:

    hf auth login

Note:
The first run will download the model (~6.54 GB). Ensure you have sufficient disk space and a stable internet connection. Also, this is a very immature project solely for personal use, but do let me know if you want me to work on this project.

Model used: openbmb/MiniCPM-V-4_5-int4

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages