Authors: Novoselov S., Lavrentyeva G., Volokhov V., Matveev Y.
Description: the project is related to the development of Basics of Voice Biometrics lecture book for the ITMO Speaker Recognition Course.
Keywords: voice biometrics, speaker recognition, speaker verification, speaker identification, acoustic features, speech activity detector, machine learning, speaker embedding extractor, deep neural network, decision theory, domain adaptation and calibration, speaker diarization.
Content: the repository contains theoretical materials (now only in russian language) for self-study in the speaker recognition area. This book is a theoretical supplement to the lab work here. Overleaf project of the book is here. The titles of the book chapters are listed below.
- Introduction (link).
- Chapter 1. Introduction to voice biometrics (link).
- Chapter 2. Preprocessing of speech signals (link).
- Chapter 3. Classical methods for speaker model computing (link).
- Chapter 4. State of the art methods for speaker model computing (link).
- Chapter 5. Comparison of speaker models (link).
- Chapter 6. Decision criteria (link).
- Chapter 7. Quality assessment of biometric systems (link).
- Chapter 8. Domain adaptation (link).
- Chapter 9. Calibration of speaker recognition system (link).
- Chapter 10. Speaker diarization (link).
- Chapter 11. Prospective directions for the voice biometrics development (link).
- Subject index (link).
- Contents (link).