Hear Me If You Can!

This repository contains the code for our project on audio steganography. Most of our work is based on this paper. This work was done as part of CS753: Automatic Speech Recognition at IIT Bombay.

A major chunk of the repo has been forked from here.

Contributors: Samyak Shah, Rishabh Dahale and Mithilesh Vaidya

Installation

We recommend creating a virtual environment and installing the python requirements there.

virtualenv <path_to_your_env>
source <path_to_your_env>/bin/activate
pip install -r requirements.txt

Directory structure

ctc_best: Trained ASR models

examples: a few examples which are mentioned in the presentation. Each recording has a folder which contains:

name.wav: original clean recording e.g. walter.wav
name_encoded_text.wav: perturbed recording for the encoded text
name_encoded_text.pkl: pickle file containing loss and PESQ score as a function of the number of iterations

speech: main codebase which contains the ASR model and preprocessing steps

final_presentation.pptx: a brief presentation of our project

stego.py: contains the actual stego algorithm

train.py: file for training the ASR model

Running the code

stego.py contains the both algorithms for calculating the perturbation in time-domain and spectral-domain

It takes as input a path to the audio recording and a list of phones to encode.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
ctc_best		ctc_best
examples		examples
mid-term-progress		mid-term-progress
plots		plots
report		report
speech		speech
.gitignore		.gitignore
README.md		README.md
config.json		config.json
download.py		download.py
eval.py		eval.py
final_presentation.pptx		final_presentation.pptx
phones.60-48-39.map		phones.60-48-39.map
preprocess.py		preprocess.py
report.pdf		report.pdf
requirements.txt		requirements.txt
stego.py		stego.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hear Me If You Can!

Installation

Directory structure

Running the code

About

Releases

Packages

Contributors 3

Languages

methi1999/stego-audio

Folders and files

Latest commit

History

Repository files navigation

Hear Me If You Can!

Installation

Directory structure

Running the code

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages