-
Notifications
You must be signed in to change notification settings - Fork 8
Description
I noticed that the PPG extractor sometimes doesn't work properly when there is silence before or after the audio section. I'm using the latest version installed via pip.
Does anyone have insights into what might be causing this issue?
Observations
The input audio waveform:
I processed this audio and got the following ppg output:
However, the PPG output does not align with the actual audio section, which starts from frame 85. It's not only about the beginning, but the whole thing seems out of sync on the timeline.
The Mel spectrogram aligns correctly, so it seems the issue might be with the model:
To investigate, I cut the audio to frame range [65, 293], then got this PPG output:
The output aligns with the input audio and seems more accurate than non-cut version.
Test Data
- Original Audio: https://drive.google.com/file/d/1jW9qkXABDvef39KWWZv8TL3lP7GOzLRm/view?usp=sharing
- Cut version: https://drive.google.com/file/d/1sp6tcLNUdXhj5pk1RiT9kWRoorINNluK/view?usp=sharing
Environment
- Python: 3.10.12
- ppgs: 0.0.8