Skip to content

Commit 53008a1

Browse files
authored
Minor-fixes (#415)
* switch to new fairseq * clean up requirements * update changelog * fix decimals in avg time extension * add voices and piano model recommendation * fix fairseq version * switch fairseq to new mac wheel * fix tortoise voices for React UI * note additional torch version
1 parent 2fe746c commit 53008a1

File tree

7 files changed

+35
-11
lines changed

7 files changed

+35
-11
lines changed

README.md

Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -74,6 +74,24 @@
7474

7575
## Changelog
7676

77+
Nov 11:
78+
* Switch to a fixed fairseq version for windows reducing installation conflicts and speeding up updates.
79+
80+
## October 2024
81+
82+
Oct 28:
83+
* Added installer tests, model downloader, and pip CPU-only option for Torch.
84+
85+
Oct 24:
86+
* Downgraded Gradio to 5.1.0 due to a bug.
87+
* Added test workflows and fixed minor bugs.
88+
89+
Oct 22:
90+
* Fixed Dockerfile issues for smoother deployment.
91+
92+
Oct 21:
93+
* Redesigned README: improved Whisper extension, added changelogs for August, September, and October, updated screenshots, and reorganized content.
94+
7795
Oct 19:
7896
* Fixed extension logs and added new extensions.
7997

extensions/builtin/extension_decorator_average_execution_time/main.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -27,7 +27,7 @@ def wrapper(*args, **kwargs):
2727
result = fn(*args, **kwargs)
2828
end = time.time()
2929
times.append(end - start)
30-
print("Average execution time: ", sum(times) / len(times))
30+
print(f"Average execution time: {sum(times) / len(times):.3f}")
3131
return result
3232

3333
return wrapper

installer_scripts/js/initializeApp.js

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -8,11 +8,9 @@ const { applyDatabaseConfig } = require("./applyDatabaseConfig.js");
88

99
const DEBUG_DRY_RUN = false;
1010

11-
const torchVersion = "2.3.1";
11+
const torchVersion = "2.3.1"; // 2.4.1+cu118
1212
const cudaVersion = "11.8";
1313

14-
// xformers==0.0.19 # For torch==2.0.0 project plane
15-
// xformers==xformers-0.0.22.post7 # For torch==2.1.0 project plane
1614
const pythonVersion = `3.10.11`;
1715
const pythonPackage = `python=${pythonVersion}`;
1816
const ffmpegPackage = `conda-forge::ffmpeg=4.4.2[build=lgpl*]`;

requirements.txt

Lines changed: 0 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,5 @@
11
safetensors
2-
# safetensors==0.3.1 # Until they fix it
32
ffmpeg-python # Apache 2.0
4-
# gradio==3.48.0
53
gradio==5.1.0 # MIT License
64
python-dotenv==1.0.0
75
soundfile==0.12.1; sys_platform == 'win32' # torchaudio
Lines changed: 5 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,7 @@
11
# pre-install fairseq for bark_hubert_quantizer
2-
fairseq @ https://github.com/Sharrnah/fairseq/releases/download/v0.12.4/fairseq-0.12.4-cp310-cp310-win_amd64.whl ; sys_platform == 'win32' # MIT License
3-
fairseq==0.12.2 ; sys_platform == 'linux' or sys_platform == 'darwin' # MIT License
2+
fairseq @ https://github.com/rsxdalv/fairseq/releases/download/v0.12.3/fairseq-0.12.13-cp310-cp310-win_amd64.whl ; sys_platform == 'win32' # MIT License
3+
# fairseq==0.12.2 ; sys_platform == 'linux' or sys_platform == 'darwin' # MIT License
4+
fairseq==0.12.2 ; sys_platform == 'linux' # MIT License
5+
# https://github.com/rsxdalv/fairseq/releases/download/v0.12.3/fairseq-0.12.13-cp310-cp310-macosx_11_0_universal2.whl
6+
fairseq @ https://github.com/rsxdalv/fairseq/releases/download/v0.12.3/fairseq-0.12.13-cp310-cp310-macosx_11_0_universal2.whl ; sys_platform == 'darwin' # MIT License
47
bark_hubert_quantizer @ git+https://github.com/rsxdalv/bark-voice-cloning-HuBERT-quantizer@bark_hubert_quantizer#egg=bark_hubert_quantizer # MIT License

tts_webui/stable_audio/stable_audio_tab.py

Lines changed: 9 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -257,9 +257,15 @@ def model_select_ui():
257257

258258

259259
def model_download_ui():
260-
gr.Markdown(
261-
"Models can be found on the [HuggingFace model hub](https://huggingface.co/models?search=stable-audio-open-1.0)."
262-
)
260+
gr.Markdown("""
261+
Models can be found on the [HuggingFace model hub](https://huggingface.co/models?search=stable-audio-open-1.0).
262+
263+
Recommended models:
264+
265+
voices: RoyalCities/Vocal_Textures_Main
266+
piano: RoyalCities/RC_Infinite_Pianos
267+
original: stabilityai/stable-audio-open-1.0
268+
""")
263269
pretrained_name_text = gr.Textbox(
264270
label="HuggingFace repo name, e.g. stabilityai/stable-audio-open-1.0",
265271
value="",

tts_webui/tortoise/tortoise_tab.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -36,6 +36,7 @@ def tortoise_ui():
3636
value="Press refresh to load the list",
3737
show_label=False,
3838
container=False,
39+
allow_custom_value=True,
3940
)
4041
gr_open_button_simple(
4142
TORTOISE_VOICE_DIR_ABS, api_name="tortoise_open_voices"

0 commit comments

Comments
 (0)