Skip to content

The project is related to the development of Basics of Voice Biometrics lecture book for the ITMO Speaker Recognition Course.

Notifications You must be signed in to change notification settings

itmo-mbss-lab/sr_lectures_book

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ITMO Speaker Recognition Course

Authors: Novoselov S., Lavrentyeva G., Volokhov V., Matveev Y.

Description: the project is related to the development of Basics of Voice Biometrics lecture book for the ITMO Speaker Recognition Course.

Keywords: voice biometrics, speaker recognition, speaker verification, speaker identification, acoustic features, speech activity detector, machine learning, speaker embedding extractor, deep neural network, decision theory, domain adaptation and calibration, speaker diarization.

Content: the repository contains theoretical materials (now only in russian language) for self-study in the speaker recognition area. This book is a theoretical supplement to the lab work here. Overleaf project of the book is here. The titles of the book chapters are listed below.

  • Introduction (link).
  • Chapter 1. Introduction to voice biometrics (link).
  • Chapter 2. Preprocessing of speech signals (link).
  • Chapter 3. Classical methods for speaker model computing (link).
  • Chapter 4. State of the art methods for speaker model computing (link).
  • Chapter 5. Comparison of speaker models (link).
  • Chapter 6. Decision criteria (link).
  • Chapter 7. Quality assessment of biometric systems (link).
  • Chapter 8. Domain adaptation (link).
  • Chapter 9. Calibration of speaker recognition system (link).
  • Chapter 10. Speaker diarization (link).
  • Chapter 11. Prospective directions for the voice biometrics development (link).
  • Subject index (link).
  • Contents (link).