Added example Podcast_and_Audio_Transcription #665

SonjeVilas · 2025-04-04T09:28:51Z

Adds automated audio transcription using Gemini 2.0 with:

✅ Speaker identification (labeled or as Speaker A/B)
✅ Precision timestamps ([HH:MM:SS])
✅ Music/sound effect detection (e.g., [Jingle] or [Song Name])
✅ Clean text output with [END] marker

Testing: Verified with podcasts & call recordings.
Deps: jinja2, Gemini API client.

Useful for podcasts, interviews, and call analysis.

Adds automated audio transcription using Gemini 2.0 with: ✅ Speaker identification (labeled or as Speaker A/B) ✅ Precision timestamps ([HH:MM:SS]) ✅ Music/sound effect detection (e.g., [Jingle] or [Song Name]) ✅ Clean text output with [END] marker Testing: Verified with podcasts & call recordings. Deps: jinja2, Gemini API client. Useful for podcasts, interviews, and call analysis.

review-notebook-app · 2025-04-04T09:28:56Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

Giom-V · 2025-04-04T13:33:02Z

Thanks @SonjeVilas, that's an interesting example. I won't have time to review it today but I'll try to do it next week.

Podcast_and_Audio_Transcription.ipynb

Giom-V

Hello @SonjeVilas,

That's a nice example. On top of what @andycandy already reported, and the minor stuff I pointed out, can you:

move the notebook in the examples/ directory
add a link to it in the examples' README
add a "what's next" section at the end of the notebook, pointing to similar notebooks (or just you preferred ones).
run the formatting script (cf. https://github.com/google-gemini/cookbook/actions/runs/14262332934/job/39989516304?pr=665)

Thanks again!

SonjeVilas · 2025-04-20T10:34:42Z

@Giom-V Thanks For the Review... :)

examples/Podcast_and_Audio_Transcription.ipynb

SonjeVilas · 2025-04-29T09:42:51Z

@nikitamaia Thanks for Review :)

Giom-V · 2025-05-05T14:24:40Z

examples/Podcast_and_Audio_Transcription.ipynb

@@ -0,0 +1,531 @@
+{


Line #4. file_path = "https://storage.googleapis.com/generativeai-downloads/data/State_of_the_Union_Address_30_January_1961.mp3"
I think we need to find a better example with 2 speakers to showcase the diarization. What about something like https://archive.org/details/Apollo11Audio (not the whole recording but a specific part). They also have some open-sourced podcasts I think.
In any case, whatever the source of your audio file If you do, don't forget to cite where it comes from.

Reply via ReviewNB

Giom-V · 2025-06-04T20:14:15Z

Hello @SonjeVilas, do you still want to push that example?

SonjeVilas · 2025-06-05T03:37:53Z

Thanks for reminder ! I will complete this PR on this weekend.

github-actions bot added the status:awaiting review PR awaiting review from a maintainer label Apr 4, 2025

andycandy reviewed Apr 6, 2025

View reviewed changes

Podcast_and_Audio_Transcription.ipynb Outdated Show resolved Hide resolved

Podcast_and_Audio_Transcription.ipynb Outdated Show resolved Hide resolved

Giom-V reviewed Apr 7, 2025

View reviewed changes

Podcast_and_Audio_Transcription.ipynb Outdated Show resolved Hide resolved

Podcast_and_Audio_Transcription.ipynb Outdated Show resolved Hide resolved

Podcast_and_Audio_Transcription.ipynb Outdated Show resolved Hide resolved

Giom-V reviewed Apr 7, 2025

View reviewed changes

Podcast_and_Audio_Transcription.ipynb Outdated Show resolved Hide resolved

Giom-V requested changes Apr 7, 2025

View reviewed changes

Giom-V self-assigned this Apr 7, 2025

SonjeVilas added 5 commits April 20, 2025 15:47

Created using Colab

2d92d00

Created using Colab

9607ea3

Delete Podcast_and_Audio_Transcription.ipynb

a5c868a

Add files via upload

92ef723

Update README.md

a6dfb66

nikitamaia reviewed Apr 25, 2025

View reviewed changes

examples/Podcast_and_Audio_Transcription.ipynb Show resolved Hide resolved

examples/Podcast_and_Audio_Transcription.ipynb Show resolved Hide resolved

examples/Podcast_and_Audio_Transcription.ipynb Show resolved Hide resolved

Created using Colab

a7ebd9b

SonjeVilas requested a review from Giom-V May 2, 2025 16:01

Giom-V reviewed May 5, 2025

View reviewed changes

Giom-V added the component:examples Issues/PR referencing examples folder label Jun 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added example Podcast_and_Audio_Transcription #665

Added example Podcast_and_Audio_Transcription #665

SonjeVilas commented Apr 4, 2025

Uh oh!

review-notebook-app bot commented Apr 4, 2025

Uh oh!

Giom-V commented Apr 4, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Giom-V left a comment

Uh oh!

SonjeVilas commented Apr 20, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

SonjeVilas commented Apr 29, 2025

Uh oh!

Giom-V May 5, 2025 •

edited

Loading

Uh oh!

Giom-V commented Jun 4, 2025

Uh oh!

SonjeVilas commented Jun 5, 2025

Uh oh!

Uh oh!

Added example Podcast_and_Audio_Transcription #665

Are you sure you want to change the base?

Added example Podcast_and_Audio_Transcription #665

Conversation

SonjeVilas commented Apr 4, 2025

Uh oh!

review-notebook-app bot commented Apr 4, 2025

Uh oh!

Giom-V commented Apr 4, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Giom-V left a comment

Choose a reason for hiding this comment

Uh oh!

SonjeVilas commented Apr 20, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

SonjeVilas commented Apr 29, 2025

Uh oh!

Giom-V May 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Giom-V commented Jun 4, 2025

Uh oh!

SonjeVilas commented Jun 5, 2025

Uh oh!

Uh oh!

Giom-V May 5, 2025 •

edited

Loading