ITMO Digital Signal Processing Course

Lab work 1. Signals (link).
Lab work 2. Signal analysis (link).
Lab work 3. Filters (link).
Lab work 4. Acoustic features (link).
Lab work 5. Gender detection (link).
Lab work 6. Speech recognition (link).
Lab work 7. Griffin-Lima algorithm (link).

A draft description of these labs (now only in russian language) can be found here.

ITMO Acoustic Event Detection Course

Lab work 1. VAD (link).
Lab work 2. Shot detection (link).
Lab work 3. CNN for AED (link).
Lab work 4. Transformers for AED (link).

Kaggle leaderboard (4th place)

Presentation

ITMO Speaker Recognition Course

Description: the project is related to the development of labs for the ITMO Speaker Recognition Course.

Keywords: voice biometrics, speaker recognition, speaker verification, speaker identification, acoustic features, speech activity detector, machine learning, speaker embedding extractor, deep neural network, decision theory, domain adaptation and calibration.

Datasets: the main databases for performing of labs is VoxCeleb corpus.

Content: the repository contains materials (now only in russian language) for self-performing five labs. The titles of the labs are listed below.

Lab work 1. Informative features of speech signals: feature extraction (link).
Lab work 2. Voice activity detector training (link).
Lab work 3. Creating and comparing speaker models (link).
Lab work 4. Decision criteria and quality metrics (link).
Lab work 5. Adaptation and calibration of speaker recognition system (link).

Some ideas for creating of labs were borrowed here (training of voice activity detector model), here (training and testing of speaker embedding extractor) and here (training of calibration model for voice biometrics system).

A published version of these labs (now only in russian language) can be found here. Publication date: 05/24/2022.

A latest updated version of these labs (now only in russian language) can be found here. Publication date: 05/24/2022.

Name		Name	Last commit message	Last commit date
Latest commit History 111 Commits
AED		AED
ASR		ASR
DSP		DSP
ML		ML
other		other
sr_labs_book		sr_labs_book
.gitignore		.gitignore
README.md		README.md
nlp-lab-3.ipynb		nlp-lab-3.ipynb
summarization-with-gpt.ipynb		summarization-with-gpt.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

ITMO Digital Signal Processing Course

ITMO Acoustic Event Detection Course

ITMO Speaker Recognition Course

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Uh oh!

Uh oh!

qocharian/digital_signal_processing_labs

Folders and files

Latest commit

History

Repository files navigation

ITMO Digital Signal Processing Course

ITMO Acoustic Event Detection Course

ITMO Speaker Recognition Course

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages