Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[enhancement]: Spam filter OCR and QR-code reading #1034

Open
1 task done
TaaviE opened this issue Dec 30, 2024 · 2 comments
Open
1 task done

[enhancement]: Spam filter OCR and QR-code reading #1034

TaaviE opened this issue Dec 30, 2024 · 2 comments
Labels
enhancement New feature or request

Comments

@TaaviE
Copy link

TaaviE commented Dec 30, 2024

Which feature or improvement would you like to request?

I'd love to see really simple/barebones OCR and QR link/text extraction built into Stalwart. This would make it somewhat harder to bypass the spam filter by just using images.

Is your feature request related to a problem?

Recent phishing waves have started using images to bypass the spam filter. While some other approaches might be able to mark letters like that suspicious, being able to actually see the content linked would be a great improvement.

Code of Conduct

  • I agree to follow this project's Code of Conduct
@TaaviE TaaviE added the enhancement New feature or request label Dec 30, 2024
@mdecimus
Copy link
Member

mdecimus commented Jan 3, 2025

Although implementing these enhancements shouldn't be too much work, my only concern is the latency these checks will add to message intake. But I'll check if there is a lightweight Rust library for OCR.

Regarding the QR spam, can you attach a few examples to this issue? I haven't seen this in the real world. Also, what is usually encoded in the QR, is it a link or something else?

@TaaviE
Copy link
Author

TaaviE commented Jan 3, 2025

Although implementing these enhancements shouldn't be too much work, my only concern is the latency these checks will add to message intake. But I'll check if there is a lightweight Rust library for OCR.

That's a fair point, though the trade-off is likely okay for some (especially smaller operators).

I suspect it might be a bit complex to implement some kind of lookup table and sampling system for more compute-intensive scans?

Regarding the QR spam, can you attach a few examples to this issue? I haven't seen this in the real world. Also, what is usually encoded in the QR, is it a link or something else?

Here's a recent example (fake tax return notice):
Scam letter example

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants