PDF中的图片和表格是否支持? #391
dream-in-night
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
像论文中的图片怎么支持?
![微信图片_20240612145429](https://private-user-images.githubusercontent.com/17213331/338854512-7e1e050d-0980-4065-9773-e9cba3fb31a7.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzg5OTYzNzcsIm5iZiI6MTczODk5NjA3NywicGF0aCI6Ii8xNzIxMzMzMS8zMzg4NTQ1MTItN2UxZTA1MGQtMDk4MC00MDY1LTk3NzMtZTljYmEzZmIzMWE3LnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNTAyMDglMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjUwMjA4VDA2Mjc1N1omWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPWEzM2U3YWZkMzBiMzcxYmUwN2NkYTNlMWFkYWU1YjEwNDA1MjdjZGE5MzM2NGQxMDk0ZTcyY2MwYjZiY2QyZDcmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0In0.r2MQsDonBX4pEbHwDzVkuLCG6RjMqEggDCsa0Okup6c)
![d6169f2228aff6adbab0ccdf614c3d4](https://private-user-images.githubusercontent.com/17213331/338854696-4cc5089c-4e92-43a3-9630-4f545a83bbf2.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzg5OTYzNzcsIm5iZiI6MTczODk5NjA3NywicGF0aCI6Ii8xNzIxMzMzMS8zMzg4NTQ2OTYtNGNjNTA4OWMtNGU5Mi00M2EzLTk2MzAtNGY1NDVhODNiYmYyLnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNTAyMDglMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjUwMjA4VDA2Mjc1N1omWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPWUxM2IyNmJhMzg1ZDhjNDI1Nzg4NTE0NzY0Yjk4NTc5ZjM0MDRiM2ZkZGM3ODNmYmJhOTU4NTI2Njg1YTAwNGQmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0In0.YXuk1crHAhOaxl26i9fcmm_j4CORX_njCmwNnFQZTQs)
因为PDF相关的库和OCR能够解析表格,但是图片咋整啊?
得先检测到图片,然后再用多模态去理解图片?
但是目前从PDF中提取图片的库,有的提取不全,这个咋解决呢?
像下面的图,我想截取这么大的区域的图片,
但是用库提取出来的是这样的
用膨胀腐蚀?还是用一大堆逻辑判断啊?
用通义千问VL模型貌似也无法提取这部分。
Beta Was this translation helpful? Give feedback.
All reactions