🌐 The app is available online: Deploy to Vercel, Netlify, or any platform
A real-time collaborative image editing platform powered by AI, enabling multiple users to modify and enhance images simultaneously through natural language commands or voice instructions.
- Natural Language Image Editing: Transform your images using simple text commands
- Voice-Controlled Editing: Speak your editing instructions naturally
- Real-time Collaboration: Multiple users can work on the same image simultaneously
- Smart Image Processing: Powered by advanced AI models
- Remove image backgrounds
- Adjust contrast
- Change color schemes
- Generate new images from descriptions
- Edit existing images with natural language instructions
- Make images more realistic
- Navigate through image history
The platform is built around an intelligent Image Modifier Agent that:
- Receives instructions (text or transcribed audio)
- Processes them through AI to understand the intended modifications
- Applies the changes in real-time
- Maintains a history of modifications for easy navigation
/process-text: Accept text commands for image modification/process-audio: Accept voice commands for image modification
Both endpoints support:
- Uploading new images
- Modifying existing images
- Maintaining modification history
- Real-time collaboration through shared image states
- Python 3.12+
- OpenAI API key
- FastAPI
- Base64 image processing capabilities
- Clone the repository
- Set up your environment variables:
export OPENAI_API_KEY="your-api-key"
-
Run the app:
python src/collaborative-photoshop/app.py
This will start the FastAPI server on http://localhost:8000
-
API Documentation:
- Once the app is running, visit http://localhost:8000/docs for interactive API documentation
- You can test the
/process-audioendpoint through the Swagger UI
-
Usage:
- The API accepts audio files and image data
- Upload audio commands along with an image
- The agent will process your voice commands and modify the image
# Text-based editing
"Remove the background of the image. Then adjust the contrast by 50%. Then make it more realistic."
# Voice-based editing
"Make the colors more vibrant and remove the background"Contributions are welcome!