forked from MoonInTheRiver/DiffSinger
-
Notifications
You must be signed in to change notification settings - Fork 315
Open
Milestone
Description
Branches
mainbranch will be frozen and will not update until v3 release. For latest updates (if there are any), please turn tov2-backport.- All refactoring and new features, including bug fixes to the last v2 release, goes to
v3. This branch will be merged intomainoncev3is ready for release. v2-backportwill contain all bug fixes to the last v2 release and backported features from v3, and will be discarded oncev3is ready for public testing.v3-devis for daily development. Commits shall be squashed and merged intov3.
TODO list
Framework
- New configuration system based on OmegaConf and Pydantic
- Optimized binarizer workflows
- Refactor NN modules
- New training framework
- Acoustic model training
- Variance model training
- Simple inference and composed inference
- ONNX exporting with latest PyTorch versions
Features
- Muon optimizer + LYNXNet 2
- EMA (framework support)
- LoRA (framework support)
- Remake tension
- Falsetto parameter
- Mouth opening parameter
- Inpainting (retaking) in acoustic models
- Generative duration predictor
- Latent NSF vocoder
hrukalive
Metadata
Metadata
Assignees
Labels
No labels