Timeline misalignment in PPG extraction for audio with silence

I noticed that the PPG extractor sometimes doesn't work properly when there is silence before or after the audio section. I'm using the latest version installed via pip. 

Does anyone have insights into what might be causing this issue?

## Observations
The input audio waveform:
![image](https://github.com/user-attachments/assets/ab40f1e7-ec52-4b92-b1bf-0b257ec66c47)
I processed this audio and got the following ppg output:
![debug01](https://github.com/user-attachments/assets/746112a4-d8e8-438c-9f0f-2daab850a23a)
However, the PPG output does not align with the actual audio section, which starts from frame 85. It's not only about the beginning, but the whole thing seems out of sync on the timeline.
The Mel spectrogram aligns correctly, so it seems the issue might be with the model:
![debug01_mel_spectrogram](https://github.com/user-attachments/assets/a012ccb0-314b-4335-9993-8f15577b8bb0)
To investigate, I cut the audio to frame range [65, 293], then got this PPG output:
![debug01_cut](https://github.com/user-attachments/assets/056b3ba0-ad7d-4346-8c55-b4df5ffe8b29)
The output aligns with the input audio and seems more accurate than non-cut version.

## Test Data
- Original Audio:  https://drive.google.com/file/d/1jW9qkXABDvef39KWWZv8TL3lP7GOzLRm/view?usp=sharing
- Cut version: https://drive.google.com/file/d/1sp6tcLNUdXhj5pk1RiT9kWRoorINNluK/view?usp=sharing

## Environment
- Python: 3.10.12
- ppgs: 0.0.8



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Timeline misalignment in PPG extraction for audio with silence #18

Observations

Test Data

Environment

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Timeline misalignment in PPG extraction for audio with silence #18

Description

Observations

Test Data

Environment

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions