Skip to content

Latest commit

 

History

History
17 lines (13 loc) · 857 Bytes

README.md

File metadata and controls

17 lines (13 loc) · 857 Bytes

Digit2Speech

This project is an attempt to create a deep neural network which generates audio snippets of spoken digits. Primarily, it acts as a way to experiment with different neural network architectures when working with audio.

The documentation is structured into subpages further explaining the corresponding subject, which can be accessed from the table of contents.

Table of Contents

  1. Metadata
  2. Preprocessing
  3. Dataset
  4. Concepts and Architectures
  5. Development and usage
  6. Future Work
  7. References