2 People - Japanese Average Tone Speech Synthesis Corpus. It is recorded by native Japan, with authentic accent. Contains news and colloquial style general corpus,the phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
For more details, please refer to the link: https://www.nexdata.ai/datasets/tts/1411?source=Github
48,000Hz, 24bit, uncompressed wav, mono channel;
professional recording studio;
contains news and general corpus;
professional voice actor, one male and one female, aged 25-35, 10 hours per person;
word and phoneme transcription, four-level prosodic boundary annotation;
microphone;
Japanese
speech synthesis.
Commercial License