Skip to content

Conversation

@maddox-j
Copy link
Contributor

@maddox-j maddox-j commented Nov 4, 2024

Adding a WIP PR to manage the codebase refactor merge to main

shawseanyang and others added 30 commits August 16, 2024 21:11
* add init to panza to turn it into a package

* add pyproject.toml but no dependencies yet

* add the rest of the panzamail dependencies :)

* install dependencies based on pyproject.toml instead of raw pip and conda commands
* Add Ollama inference

* expose Panza as a web server

* add api keys to env variables and check in server

* check api key

* switch to fastapi to prevent model reloading

* Add ollama-backed streaming HTTP server

---------

Co-authored-by: Armand Nicolicioiu <[email protected]>
Eugenia Iofinova and others added 7 commits October 28, 2024 12:48
qq
[Ad[DxxRevert "remove some unused .sh files"

This reverts commit 233083e.
- Added bug fix for error encountered in json dumps
for Message and mboxMessage objects
- Added clarification for email and username reqs
- Changed wanbd_disabled default to true to track with README
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Need to rename, and to link back to the original README


If running with Ollama, then Ollama needs to be installed from the [web page](https://ollama.com/).

Then, you will need to convert your model into a GGUF file.
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is it beneficial to add more support for this?


- To run Panza after a full training run, try something like `CUDA_VISIBLE_DEVICES=0 python3 runner.py user=USERNAME interfaces=cli writer/llm=transformers`.
- To run Panza after a RoSA or LoRA training run, replace `writer/llm=transformers` with `writer/llm=peft` TODO Armand: can we fix this?

Copy link
Contributor Author

@maddox-j maddox-j Nov 6, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Integrate with the inference markdown+ resolve TODO

@@ -0,0 +1,9 @@
email_address: "[email protected]" # Change this to your email address!
username: "abc" # TODO(armand): Use custom resolver to extract username from email address.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Address TODO

</div>


## TODO: Prerequisites
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Clean TODO

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants