Skip to content

Multimodal Summarization | Seamlessly Summarize Youtube Videos | Documents | Analyze Complex Diagrams

Notifications You must be signed in to change notification settings

get-rishabh/Summarize-Karo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Summarize-Karo

A multimodal data summarization application using Google's Gemini AI to summarize content from YouTube videos, images, and documents.

Youtube Summarizer Demo Video

video.mp4

Image Summarizer Demo Video

IMAGE.mp4

Document Summarizer Demo Video

compressed.DOCUMENT-DEMO.mp4

Features

  • YouTube Video Summarizer: Extract transcripts and generate comprehensive summaries
  • Image Analyzer & Summarizer: Analyze and describe images with AI
  • Document Summarizer: Process PDF documents and generate summaries

Cloud Deployement Issue

Problems on Cloud Deployment (Streamlit Cloud, Huggingface Spaces)

When deploying to cloud platforms like Hugging Face, YouTube often detects automated tools like yt-dlp as bots and requires authentication, causing the error:

ERROR: [youtube] video_id: Sign in to confirm you're not a bot. Use --cookies-from-browser or --cookies for the authentication.

Installation

pip install -r requirements.txt

Usage

  1. Run the Streamlit app:
streamlit run app.py
  1. Navigate to the YouTube Video Summarizer section
  2. Paste a YouTube URL
  3. Click "Summarise Now"

Environment Variables

Create a .env file with:

GOOGLE_API_KEY=your_google_api_key_here

Dependencies

  • streamlit==1.48.0
  • google-generativeai==0.8.5
  • youtube-transcript-api==0.6.2
  • yt-dlp==2025.8.11
  • python-dotenv==1.1.1
  • And more (see requirements.txt)

About

Multimodal Summarization | Seamlessly Summarize Youtube Videos | Documents | Analyze Complex Diagrams

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  

Languages