Best config for OCR-ready PNGs #4365

hymair · 2025-04-07T19:09:18Z

Question about an existing feature

What are you trying to achieve?

We are trying to achieve the best possible (most accurate) OCR results. Images will be of invoices and receipts taken by users with their phones mostly.

We want to downscale unnecessary large images and try to reduce AI token usage by sending less pixels.

Please provide a minimal, standalone code sample, without other dependencies, that demonstrates this question

Current config:

export async function optimizeImage(buffer: Buffer): Promise<Buffer> {
    const processedBuffer = await sharp(buffer)
        .rotate()
        .resize({
            width: 2000,
            height: 2000,
            withoutEnlargement: true,
            fit: 'inside',
        })
        .grayscale()
        .normalise()
        .sharpen({
            sigma: 1.2,
            m1: 0.5,
            m2: 0.5,
        })
        .png()
        .toBuffer()

    return processedBuffer
}

hymair added the question label Apr 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Best config for OCR-ready PNGs #4365

Best config for OCR-ready PNGs #4365

hymair commented Apr 7, 2025 •

edited

Loading

Best config for OCR-ready PNGs #4365

Best config for OCR-ready PNGs #4365

Comments

hymair commented Apr 7, 2025 • edited Loading

Question about an existing feature

What are you trying to achieve?

Please provide a minimal, standalone code sample, without other dependencies, that demonstrates this question

hymair commented Apr 7, 2025 •

edited

Loading