AI4Bharat TTS Model Inference for Tibetan (Continuing Post-Training) #3

tenzinchoedon · 2025-01-20T06:36:09Z

Link to previous card: #2 (comment)

Description

This phase focuses on running inference on the fine-tuned AI4Bharat Indic-TTS model. The objective is to generate high-quality, natural-sounding Tibetan speech and validate the model's performance. Necessary debugging and adjustments will also be carried out if issues arise during inference.

Completion Criteria

Generate Tibetan speech audio files from textual inputs.
Resolve any issues related to phoneme mismatches or inference configurations.

Implementation

Inference Execution

Run inference using the fine-tuned Tibetan TTS model and prepared Tibetan text input.
Save generated audio files for evaluation.

Debugging

Investigate and resolve issues like unsupported phoneme languages (bo) or incorrect configurations.
Make adjustments to phoneme language or preprocessing as required.

Evaluation

Evaluate the quality of the generated speech.
Compare the generated outputs with the expected results to spot any differences.

Subtasks

Validate inference setup and configurations.
Run inference on sample Tibetan text input.
Save and analyze generated audio outputs for debugging and quality assessment.
Document issues, if any, and implement required adjustments.

Card Reviewer

@gangagyatso4364

tenzinchoedon · 2025-01-24T06:19:39Z

Wav files from training on Weights & Biases (WandB):

TrainAudio
EvalAudio

tenzinchoedon · 2025-03-19T06:17:46Z

Link to the documentation: https://docs.google.com/document/d/1pj5ZWiiUJ0lLKvaBC76ITcAtmGYLGklbknXH6KCE5Oo/edit?usp=sharing

tenzinchoedon added this to STT & TTS Dev Jan 16, 2025

tenzinchoedon self-assigned this Jan 20, 2025

tenzinchoedon converted this from a draft issue Jan 20, 2025

tenzinchoedon moved this from IN PROGRESS to DONE in STT & TTS Dev Mar 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AI4Bharat TTS Model Inference for Tibetan (Continuing Post-Training) #3

AI4Bharat TTS Model Inference for Tibetan (Continuing Post-Training) #3

tenzinchoedon commented Jan 20, 2025 •

edited

Loading

tenzinchoedon commented Jan 24, 2025

tenzinchoedon commented Mar 19, 2025

AI4Bharat TTS Model Inference for Tibetan (Continuing Post-Training) #3

AI4Bharat TTS Model Inference for Tibetan (Continuing Post-Training) #3

Comments

tenzinchoedon commented Jan 20, 2025 • edited Loading

Description

Completion Criteria

Implementation

Subtasks

Card Reviewer

tenzinchoedon commented Jan 24, 2025

tenzinchoedon commented Mar 19, 2025

tenzinchoedon commented Jan 20, 2025 •

edited

Loading