Skip to content

Nexdata-AI/2-People-Japanese-Average-Tone-Speech-Synthesis-Corpus

Repository files navigation

2-People-Japanese-Average-Tone-Speech-Synthesis-Corpus


Description

2 People - Japanese Average Tone Speech Synthesis Corpus. It is recorded by native Japan, with authentic accent. Contains news and colloquial style general corpus,the phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

For more details, please refer to the link: https://www.nexdata.ai/datasets/tts/1411?source=Github

Specifications

Format

48,000Hz, 24bit, uncompressed wav, mono channel;

Recording environment

professional recording studio;

Recording content

contains news and general corpus;

Speaker

professional voice actor, one male and one female, aged 25-35, 10 hours per person;

Annotation

word and phoneme transcription, four-level prosodic boundary annotation;

Device

microphone;

Language

Japanese

Application scenarios

speech synthesis.

Licensing Information

Commercial License