Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Captions - webvtt guide, subtitleedit, whisperai #4

Merged
merged 2 commits into from
May 24, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
@@ -1,2 +1,2 @@
# Digital Services Documentation
Documentation for [LAI Digital Services unit](https://library.gwu.edu/digital-collections-and-services). The documentation can be accessed at: (https://gwu-libraries.github.io/DigitalServicesDocs/)
Documentation for [LAI Digital Services unit](https://library.gwu.edu/digital-collections-and-services). The documentation can be accessed at: https://gwu-libraries.github.io/DigitalServicesDocs/
2 changes: 1 addition & 1 deletion digitization/av/av_records.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,7 @@ title: "Enhancing AV records"
permalink: /digitization/avrecords/
parent: "Digitization: Audiovisual Carriers"
grand_parent: Digitization
nav_order: 1
nav_order: 3
---

When digitizing audiovisual carriers we should update finding aids to reflect new information gained from the digitization process. Use the following fields in ArchivesSpace to hold new information.
Expand Down
37 changes: 29 additions & 8 deletions digitization/av/captions.md
Original file line number Diff line number Diff line change
@@ -1,24 +1,45 @@
---
layout: page
title: "Creating Captions and Transcripts for A/V material"
title: "Captions and Transcripts for AV material"
permalink: /digitization/captions/
parent: "Digitization: Audiovisual Carriers"
grand_parent: Digitization
nav_order: 1
nav_order: 2
---
# Creating Captions and Transcripts for Audiovisual material
Captions and transcripts for audiovisual material make collections material more accessible and enhance the over-all level of access. Captions should be created as VTT (WEBVTT) format. Captions can be embedded into the content, but only for access files.
# Captions and Transcripts for Audiovisual Material
Captions and transcripts for audiovisual material make collections material more accessible and enhance the over-all level of access. [^1] Captions should be created in the WebVTT format. Captions can be embedded or burned into access files, but they should not be embedded into master files.

## Tools for Creating and Reviewing Captions
# Tools for Creating and Reviewing Captions

## WhisperAI - Jupyter Notebook Python Script

The Digital Services unit uses [WhisperAI](https://github.com/openai/whisper) via a python script run in a Jupyter Notebook environment to generate WebVTT caption files. This script sets WebVTT metadata values as described in the [Guidelines for Embedding Metadata in WebVTT Files (FADGI)](https://www.digitizationguidelines.gov/guidelines/accessibilty_WebVTT.html)

The WebVTT files generated through this script can be edited in SubtitleEdit.

## SubtitleEdit
SubtitleEdit is an open source tool for creating and editing subtitles and captions. SubtitleEdit can also be used as a GUI for WhisperAI.

## Adobe Premiere
The SCRC digitization lab provides access to Adobe Premiere. You may also request a license for your individual workstation via GWU IT.

-[Guide: Creating Captions using Adobe Premiere](https://docs.google.com/document/d/1ZRbPUFNSYGfbZA5lTdnkvejmCeflvuGvMg0Ot4aBwyU/edit?usp=sharing)

## SubtitleEdit
SubtitleEdit is an open source tool for creating and editing subtitles and captions. SubtitleEdit can also be used as a GUI for WhisperAI.

[^1]: Dave Rodriguez, Bryan J. Brown, and Florida State University Libraries, “Comparative Analysis of Automated Speech Recognition Technologies for Enhanced Audiovisual Accessibility,” The Code4Lib Journal, no. 58 (December 4, 2023), https://journal.code4lib.org/articles/17820.
# File Naming for Caption and Subtitle Files
WebVTT files should inherent the file name of the AV source file. The file name should also include information about the language to improve accessibility.

Example:
- *Source AV File*: ms0000_s01_ss02_c03_i01.mp4
- *Language*: eng
- **WebVTT file**: ms0000_s01_ss02_c03_i01_eng.vtt

# Metadata for WebVTT
See [Guidelines for Embedding Metadata in WebVTT Files (FADGI)](https://www.digitizationguidelines.gov/guidelines/accessibilty_WebVTT.html). Required elements should be used when possible. The following elements are *always recommended*:

- Type
- Language
- Originating File

---
[^1]: Dave Rodriguez, Bryan J. Brown, and Florida State University Libraries, “Comparative Analysis of Automated Speech Recognition Technologies for Enhanced Audiovisual Accessibility,” The Code4Lib Journal, no. 58 (December 4, 2023), https://journal.code4lib.org/articles/17820.