Add option to use local Nemotron instance #1236
-
|
Hey mate, thought I'd catch you while you're a roll writing this Nemotron stuff. I was wondering if you could please add an option to the PDF parser settings for it to use a web address that we specify? This one is actually feasible to self-host, like the embeddings, so you can save a decent amount of resources doing it yourself. Also good if you have private data you would not like corporations getting high quality scans of! Thanks for your work on this, had a play around with Nemotron last night and was immediately impressed/installed it in my server. Cheers, P.S. if too busy I can do a pull request in 2-3 weeks, but not sure if the way I'd go about coding it would work with how the team wants to refactor the code for v6. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
|
Hi @taylorwass thanks for the discussion, happy to help.
What is behind the web address, is it a PDF to read or a hosted nemotron-parse? As far as PDF to read, are you familiar with If you mean hosted nemotron-parse, we support either:
I believe actually what you want to do is already possible. Please let me know if I misinterpreted your post |
Beta Was this translation helpful? Give feedback.
Hi @taylorwass thanks for the discussion, happy to help.
What is behind the web address, is it a PDF to read or a hosted nemotron-parse?
As far as PDF to read, are you familiar with
aadd_url: https://github.com/Future-House/paper-qa/blob/v5.29.1/src/paperqa/docs.py#L190If you mean hosted nemotron-parse, we support either:
paper-qa/packages/paper-qa-nemotron/src/paperqa_nemotron/api.py
Lines 434 to 4…