Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

help with fine tune layoutxlm #466

Open
piarosebelledelapaz opened this issue Oct 3, 2024 · 0 comments
Open

help with fine tune layoutxlm #466

piarosebelledelapaz opened this issue Oct 3, 2024 · 0 comments

Comments

@piarosebelledelapaz
Copy link

hello,

i want to fine tune layout xlm with 13 categories i have annotations from PaddleOCR KIE, with a dataset structure of:

image.jpeg \t annotations
image.jpeg \t annotations
image.jpeg \t annotations

where the annotations include transcription, points (bboxes), key_cls (labels), and is stored in a txt file.

i was using the approach of paddleocr but the KIE models based on layoutlm has restrictions with max_seq_len=512, so i would like to finetune the model myself and have the max_seq_len=1024. any assistance on how to do it?

also is it possible to use my weights from the trained model of paddleocr for the detection adn recognition?

thank you for the help in advance.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant