This app is a simple topic modeling tool that uses Latent Dirichlet Allocation (LDA) to discover hidden topics in a corpus of text documents. It's designed for researchers, data scientists, and anyone interested in text analytics. Its basic version runs in the browser.
You can try out the app here. If you want to visualize the topics with the more advanced PyLDAvis library, you need to run the app locally.
This section provides instructions on how to install the dependencies required to run the app locally. Make sure you have Python 3.11 or higher and Node.js installed before proceeding.
Use the package manager poetry to install all Python dependencies. Run the following command in your terminal:
poetry installUse the package manager npm to install more dev dependencies like prettier. Run the following command in your terminal:
npm installTo run the app locally, use the following command.
poetry run streamlit run src/app.pyIf you want to contribute to this project, you can use the following commands to test the browser version of the app.
python3 -m http.server 8000 --directory .To format the code, use the following commands.
poetry run ruff format
npm formatThis project is maintained by @maehr. Please understand that we won't be able to provide individual support via email. We also believe that help is much more valuable if it's shared publicly, so that more people can benefit from it.
| Type | Platforms |
|---|---|
| 🚨 Bug Reports | GitHub Issue Tracker |
| 📚 Docs Issue | GitHub Issue Tracker |
| 🎁 Feature Requests | GitHub Issue Tracker |
| 🛡 Report a security vulnerability | See SECURITY.md |
| 💬 General Questions | GitHub Discussions |
No changes are currently planned.
Please read CONTRIBUTING.md for details on our code of conduct, and the process for submitting pull requests to us.
We use SemVer for versioning. For the versions available, see the tags on this repository.
- Moritz Mähr - Initial work - maehr
See also the list of contributors who participated in this project.
This project is licensed under the GNU Affero General Public License v3.0 - see the LICENSE.md file for details.