`TypeError: Got unsupported ScalarType BFloat16` with `OCRErrorPredictor` #290

kevinhu · 2025-01-25T22:30:20Z

I'm running Surya via Marker on a GPU, and I'm attempting to use the bfloat16 datatype. However, I'm running into this TypeError because of the order in which the cast is performed:

surya/surya/ocr_error/__init__.py

Lines 47 to 50 in aa8ee5a

    
           with torch.inference_mode(): 
        
               pred = self.model(batch_input_ids, attention_mask=batch_attention_mask) 
        
               logits = pred.logits.detach().cpu().numpy().astype(np.float32) 
        
               predictions.extend(np.argmax(logits, axis=1).tolist())

This can be fixed by performing the cast before the tensors are moved to NumPy:

logits = pred.logits.to(torch.float32).detach().cpu().numpy()

When running on GPU, this should also be faster than casting in NumPy.

The text was updated successfully, but these errors were encountered:

VikParuchuri · 2025-01-26T20:33:26Z

We usually use float16 for this model - any specific reason to use bfloat?

kevinhu · 2025-01-26T23:45:31Z

In a couple of our benchmarks, we saw that bfloat16 gives better accuracy than float16—it seems that there's some precision errors with float16.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`TypeError: Got unsupported ScalarType BFloat16` with `OCRErrorPredictor` #290

`TypeError: Got unsupported ScalarType BFloat16` with `OCRErrorPredictor` #290

kevinhu commented Jan 25, 2025

VikParuchuri commented Jan 26, 2025

kevinhu commented Jan 26, 2025

TypeError: Got unsupported ScalarType BFloat16 with OCRErrorPredictor #290

TypeError: Got unsupported ScalarType BFloat16 with OCRErrorPredictor #290

Comments

kevinhu commented Jan 25, 2025

VikParuchuri commented Jan 26, 2025

kevinhu commented Jan 26, 2025

`TypeError: Got unsupported ScalarType BFloat16` with `OCRErrorPredictor` #290

`TypeError: Got unsupported ScalarType BFloat16` with `OCRErrorPredictor` #290