Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Inquiry on Semantic Richness and Acoustic Fidelity Variation with n_q in XCodec, and Challenges in Scaling to 44kHz #15

Open
LiuZH-19 opened this issue Dec 6, 2024 · 0 comments

Comments

@LiuZH-19
Copy link

LiuZH-19 commented Dec 6, 2024

Great work!
I would like to inquire if there are any results available regarding the variation of semantic richness and acoustic fidelity as the number of n_q changes in XCodec. Specifically, I am interested in understanding how these two factors (semantic richness and acoustic fidelity) behave as n_q is increased or decreased.

Additionally, I have observed that XCodec operates at a sampling rate of 16kHz, and the reconstructed WAV files lose many acoustic details compared to raw 44.1kHz audio. I am curious about the challenges involved when applying XCodec's technology to a 44kHz codec. For instance, would it be feasible to enhance the DAC by integrating Hubert-based representations?

Any insights or experiences would be greatly appreciated.

Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant