Replies: 3 comments 5 replies
-
Hey Matt
I read somewhere that the MD5 checksum is sadly not "safe to use":
But this should not be the case with what you described... Do you have an example of a file you could share here which does not generate the same MD5 Checksum? With the checksum computed by FSCrawler and the one computed within Powershell? A workaround could be to use another algorithm... |
Beta Was this translation helpful? Give feedback.
-
Hi, A colleague of mine has said they're getting mismatch when OCR is disabled, but when enabled they match.. The mismatches I have had are apart of indexes that have had OCR disabled. OCR has always been enabled in the settings, but FSCrawler says it's disabled as it begins its crawl (only for some crawls). I pointed my _settings at Tesseract-ocr and OCR now looks to be running as intended however I'm still getting a mismatch unlike my colleague. I'll keep playing and post any updates. |
Beta Was this translation helpful? Give feedback.
-
So I haven't been able to figure this one out. Using this gif, Powershell get-filehash returns 43B14C45B4F719D7DB4B61E69772943D I get the same results using FSCrawler 2.7 & 2.10 and both crawlers are pointed to different ES servers. |
Beta Was this translation helpful? Give feedback.
-
Hi,
I've came across a few files that have been crawled with checksum MD5 enabled, how ever when I run the files through Powershell with get-filehash -Algorithm MD5 they don't match. I have checked other items in the index and they do match, so it only appears to be some.
Any idea what maybe causing this?
Thank you. Matt
Beta Was this translation helpful? Give feedback.
All reactions