Binaural-Source-Localization-CNN

Basic Information

Author: Gregory Hunkins

Organization: University of Rochester

License: MIT

Abstract: A Convolutional Neural Network (CNN) classification system was designed for the task of source localization of human voices in 3-D space. A new dataset, VoiceBin100K, is introduced to accomplish this task and for future work in the field. The CNN inputs variable-length binaurual short- time Fourier Transform (STFT) magnitude and phase features and predicts location of the speaker’s voice according to 168 location classes.

Running The Code

Reference: https://cs.rochester.edu/~cxu22/t/577F17/bluehive_tutorial.html

Data

Please contact [email protected] for access to the data. A public link will available shortly.

Name		Name	Last commit message	Last commit date
Latest commit History 78 Commits
Data_Processing		Data_Processing
Neural_Net		Neural_Net
.DS_Store		.DS_Store
LICENSE.txt		LICENSE.txt
README.md		README.md
job.sh		job.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Binaural-Source-Localization-CNN

Basic Information

Running The Code

Data

About

Releases

Packages

Languages

License

ghunkins/Binaural-Source-Localization-CNN

Folders and files

Latest commit

History

Repository files navigation

Binaural-Source-Localization-CNN

Basic Information

Running The Code

Data

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages