-
I've made some recordings at 48kHz sample rate which gives a frequency range up to at least 20kHz, but the spectrogram only shows up to ~12kHz. Is this correct? A spectrogram from Audacity shows bird sounds in these recordings well up to 17kHz. (File attached; Audacity at top) We've had a discussion about the accuracy of detections based on different sample rates, and the Audacity spectrogram clearly shows information exists in the 12kHz-17kHz range, information that would be missed in recordings at 24kHz (which I've been using). Do we know if BirdNET uses information in this range? It would certainly be a good reason for recording at 48kHz if it does. |
Beta Was this translation helpful? Give feedback.
Replies: 3 comments 4 replies
-
The BirdNET model does not use frequencies above 15KHz. As far as I know, what you see in this spectrogram are just harmonics or overtones. |
Beta Was this translation helpful? Give feedback.
-
Yes, that is correct, the spectrogram view is capped at 12kHz. BirdNET is sensitive to sounds up to 15kHz and Chirpity sends it audio with a sample rate of 48kHz regardless of the source sample rate of your audio. Obviously, if your audio sample rate is 24kHz, there will be no information above 12kHz, so for BirdNET analysis it will be better to use audio with a sample rate of at least 30kHz. 48kHz would be the ideal rate. |
Beta Was this translation helpful? Give feedback.
-
We have to distinguish between:
1.) the BirdNET model uses only frequencies up to 15kHz (as has already been correctly stated). Internally, BirdNET uses 48ksps sample rate. If you use a differing sample rate, your WAV files have to be resampled which takes some time and could potentially lead to artifacts. However the latter point is very minor and should not result in fundamentally different results in the classification. 2.) Birds (at least in Europe) mainly use frequencies up to 10kHz. All higher frequencies are non-relevant for the ID. So, for a manual inspection of spectrograms for ID purposes, we need a span from 100Hz to 10kHz. All higher frequencies are non-relevant for the ID. In order to have a max freq resolution in a given spectrogram, we should scale the spectrogram for this freq span. For example, have a look into Stanislaw Wrozas book "Identifying migratory bird sounds in Britain and Europe": the spectrograms there show freqs between 0 and 11kHz. So, in conclusion in order to answer this threads question "Should Chirpity show the whole recorded frequency range in the spectrogram?": No, Chirpity should not show frequencies above 12kHz in the spectrogram (at least if we are talking about bird distinction) I would further recommend using 48ksps sample rate in order to give BirdNET the whole spectrum it uses (up to 15kHz) AND to simultaneously save time and processing power, because no resampling is needed. P.S.: your recording, ceperman, looks quite strange . . . the harmonics seem to indicate an overload of the ADC of your recorder or something else. What is your recorder model ? Was the bird very close ? Was it a field recording or did you record from a playback with a low quality speaker (eg. a smartphone)? The latter would explain the strange looking spectrogram |
Beta Was this translation helpful? Give feedback.
Yes, that is correct, the spectrogram view is capped at 12kHz. BirdNET is sensitive to sounds up to 15kHz and Chirpity sends it audio with a sample rate of 48kHz regardless of the source sample rate of your audio. Obviously, if your audio sample rate is 24kHz, there will be no information above 12kHz, so for BirdNET analysis it will be better to use audio with a sample rate of at least 30kHz. 48kHz would be the ideal rate.