Skip to content

Unleashing the power of local vlms with moondream and streamlit

License

Notifications You must be signed in to change notification settings

smaranjitghose/LunarSightAI

Folders and files

NameName
Last commit message
Last commit date

Latest commit

7204626 Β· Jan 11, 2025

History

3 Commits
Jan 11, 2025
Jan 11, 2025
Jan 11, 2025
Jan 11, 2025
Jan 11, 2025
Jan 11, 2025
Jan 11, 2025
Jan 11, 2025
Jan 11, 2025
Jan 11, 2025

Repository files navigation

πŸŒ™ LunarSight AI

A versatile and user-friendly visual analysis interface powered by Moondream VLM, built with Python and Streamlit.

🌟 Features

  • πŸ“ Intelligent Image Captioning
  • 🎯 Precise Object Detection with Bounding Boxes
  • πŸ“ Object Pointing Capabilities
  • πŸ” Natural Language Visual Querying
  • 🎨 Clean, Tab-based User Interface
  • πŸ’Ύ Download Options for Analyzed Images
  • πŸ” Secure API Key Management

πŸ–ΌοΈ Demo

Image Captioning Demo

Visual Querying Demo

πŸ”§ Prerequisites

  • Python 3.11 or higher
  • Web Browser
  • Moondream API key from Moondream Console or download the model file from here

πŸ“₯ Installation

  1. Clone the repository:
git clone https://github.com/smaranjitghose/lunarsightai.git
cd lunarsightai
  1. Create and activate virtual environment:
# Windows
python -m venv env
.\env\Scripts\activate

# Linux/Mac
python3 -m venv env
source env/bin/activate
  1. Install required packages:
pip install -r requirements.txt

πŸš€ Usage

  1. Start the application:
streamlit run app.py
  1. Open your browser and navigate to:
http://localhost:8501

🎯 Example Use Cases

πŸ“ Image Captioning

  • Get detailed descriptions of any image
  • Perfect for accessibility features
  • Useful for content indexing

🎯 Object Detection

  • "Detect all people in the image"
  • "Find books on the shelf"
  • "Locate electronic devices"

πŸ“ Object Pointing

  • "Point to the main subject"
  • "Identify the location of logos"
  • "Mark all faces in the image"

πŸ” Visual Querying

  • "What colors are dominant in this image?"
  • "How many people are wearing glasses?"
  • "Describe the environment in the image"

πŸ› οΈ Troubleshooting

Common Issues:

  1. API Key Error

    • Verify API key is entered correctly
    • Check if API key has necessary permissions
    • Ensure API key is active
  2. Image Upload Issues

    • Check if image format is supported (JPG, JPEG, PNG)
    • Ensure image size is reasonable
    • Verify image is not corrupted
  3. Analysis Failures

    • Check internet connection
    • Verify API quota hasn't been exceeded
    • Ensure prompts are clear and specific

🀝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

  1. Fork the project
  2. Create your feature branch (git checkout -b feature/AmazingFeature)
  3. Commit your changes (git commit -m 'Add some AmazingFeature')
  4. Push to the branch (git push origin feature/AmazingFeature)
  5. Open a Pull Request

πŸ“ License

This project is licensed under the MIT License - see the LICENSE file for details.


Made with ❀️ by Smaranjit Ghose

About

Unleashing the power of local vlms with moondream and streamlit

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks