-
Notifications
You must be signed in to change notification settings - Fork 27
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
data donation #5
Comments
Hey @breunigs, that looks amazing! I'd be very, very happy if you could send me some of your data - especially since you have a lot of faces/heads in there. They're incredibly underrepresented in my dataset, and the (comparatively) few faces I have aren't always perfectly labeled either. I don't drive all that much, and when I do it's pretty much just highways. I might take a page out of your book and start taking a camera when I ride my bike though! I don't fully know how to best share data with me privately - aside from github, you can reach me via email at my github username (at) me.com. Or any other means you prefer really. Thank you so much! |
I invited you to a private github repo, they should have sent you an email. |
They did. Thank you once more!! I'll start the download with your command and a rate limit of 10MiB/s, do tell me (here or wherever) in case I'm straining the server too much. |
You don't need to constrain yourself that much, 100 MBit/s should be no problem. |
oh wait, nevermind, I misread that you wrote MiB/s, so yeah, that sounds reasonable :) |
I have ~29h of video (5x speed up, so technically ~145h), including understand.ai's labels. It is definitely a different camera than yours. I can donate that for retraining.
Since their model has some unfortunate misses or inaccurate bounding boxes, I also went ahead and started manually labeling. That dataset is currently only ~10k images with ~18k head¹ and ~9k plate bounding boxes that are "large enough" to avoid training the model on bboxes with only a few pixels. I can also donate the model with these bboxes at 1280px, which proved sufficient for my use case, as the small bboxes are not discernible anyway. The model is definitely not perfect, here's an example inference that was not used for training/validation: https://vimeo.com/660127406 . The model is trained from yolov5x6 and 1.1GB, since "deployability to someone else's computer" was not a concern, though.
¹: I use a "head + neckline" bounding box, rather than just "face" as anonymizer does. I.e. the two are incompatible.
The text was updated successfully, but these errors were encountered: