This project is an attempt to create a deep neural network which generates audio snippets of spoken digits. Primarily, it acts as a way to experiment with different neural network architectures when working with audio.
The documentation is structured into subpages further explaining the corresponding subject, which can be accessed from the table of contents.