App should automatically split audio files >25 MB and transcribe each part #8

Bklieger · 2024-06-23T18:34:54Z

Currently, the app can only handle audio files that are less than 25 MB. This is a limitation due to Whisper's max input file size of 25 MB. However, we can get around this limitation by splitting audio files greater than 25 MB into several files which can each be transcribed by the API. Then, the results can be combined into one transcript.

It should be noted that I believe there still needs to be an upper limit on the file size to preserve Whisper API cost. In addition, if the transcript becomes too large (# of tokens), then Groq API rate limits may cause errors on Groq API calls. There should be a check on this size as well.

Split audio files greater than 25 MB into several files which are each transcribed by the API. The results are combined into one transcript. An upper limit of 100 MB is applied.

Bklieger · 2024-06-27T22:49:47Z

@MentatBot

Currently, the app can only handle audio files that are less than 25 MB. This is a limitation due to Whisper's max input file size of 25 MB. However, we can get around this limitation by splitting audio files greater than 25 MB into several files which can each be transcribed by the API. Then, the results can be combined into one transcript.

It should be noted that I believe there still needs to be an upper limit on the file size to preserve Whisper API cost. In addition, if the transcript becomes too large (# of tokens), then Groq API rate limits may cause errors on Groq API calls. There should be a check on this size as well.

mentatbot · 2024-06-27T22:49:57Z

I will start working on this issue

Bklieger added the enhancement New feature or request label Jun 23, 2024

Bklieger referenced this issue Jun 23, 2024

Add segmenting of files greater than 25 MB.

ae0c824

Split audio files greater than 25 MB into several files which are each transcribed by the API. The results are combined into one transcript. An upper limit of 100 MB is applied.

mentatbot bot mentioned this issue Jun 27, 2024

Add support for splitting and transcribing large audio files #20

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

App should automatically split audio files >25 MB and transcribe each part #8

App should automatically split audio files >25 MB and transcribe each part #8

Bklieger commented Jun 23, 2024

Bklieger commented Jun 27, 2024

mentatbot bot commented Jun 27, 2024

App should automatically split audio files >25 MB and transcribe each part #8

App should automatically split audio files >25 MB and transcribe each part #8

Comments

Bklieger commented Jun 23, 2024

Bklieger commented Jun 27, 2024

mentatbot bot commented Jun 27, 2024