-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Speech synthesis and machine learning #31
Comments
Thanks for your review and comment! Speech synthesis with deep learning is another interesting and developing area, spearheaded by WaveNet. Generation of raw audio waveforms has many applications. The list in the Motivation and Background section was not meant to be all inclusive, but mention a few use cases that are widely understood to set the context. The intent is certainly not to discourage the use of the WebNN API for other use cases it may be applicable to. Given the charter is under AC review I leave it to @dontcallmedom to tell whether such informative tweaks are appropriate at this stage, or whether we should keep this text as is for now. |
I agree that it's a bit late to consider this addition to the charter at this stage of the process. With that said:
|
[@dontcallmedom suggested that i should raise this here. It was originally a comment at https://github.com/w3c/strategy/issues/367#issuecomment-1431300909]
The description of areas in scope in the charter appears a little lop-sided to me, since Speech Recognition is called out but not Speech Synthesis. Specifically, just as you need to infer text from wave input in speech recognition, you need to infer meaning and pronunciation guides ('text understanding') as a prep for speech synthesis, and i wonder why machine learning is not being applied to that and included in the scope of the charter (ie. for the linguistic analysis of the text, rather than what drives the audio hardware, and could provide input to other TTS specs at the W3C - such as SSML, CSS, etc). If that doesn't fall within the scope of the work, I think the charter should probably indicate briefly why it is not addressing the use of machine learning for that function, whereas it does make inferences for speech recognition.
The text was updated successfully, but these errors were encountered: