Support: Memory Efficiency for long recordings #226

Evan8456 · 2024-12-20T17:22:50Z

Issue Description

Had trouble performing feature extraction locally on audio recordings longer than a minute.

Environment Details

Ram: 16 Gb
Os: Ubuntu 20.04.3 LTS
Python: 3.10.9

Attempted Solutions

No response

Reproduction Steps

No response

Additional Notes

No response

github-actions · 2024-12-20T17:23:13Z

👋 Welcome to Senselab!

Thank you for your interest and contribution. Senselab is a comprehensive Python package designed to process behavioral data, including voice and speech patterns, with a focus on reproducibility and robust methodologies. Your issue will be reviewed soon. Stay tuned!

fabiocat93 · 2025-01-15T17:40:02Z

Hi @Evan8456, thank you for reporting the issue! Could you please provide more details on the problem you're experiencing? Are you encountering any specific error messages? Any unusual behavior during the process? Does it just stick forever?

Additionally, could you share the steps or code to help us reproduce the issue on our end? This will allow us to understand the problem better and assist you more effectively

Evan8456 · 2025-01-21T19:59:28Z

Hi, you can see the context of my issue here. The bug seems to occur when I attempt to run extract_features_from_audios on audio recordings that are 30+ seconds long.

ibevers · 2025-03-11T19:44:02Z

We've agreed it should be a lazy object.

ibevers · 2025-03-11T19:44:49Z

Laziness possibly optional

ibevers · 2025-03-11T19:47:15Z

Is there a version of torchaudio load that is lazy? Possibly, similar to: librosa, cv2

fabiocat93 · 2025-03-14T17:16:27Z

Is there a version of torchaudio load that is lazy? Possibly, similar to: librosa, cv2

I have checked and unfortunately librosa doesn't seem to have lazy load 👎

900miles · 2025-03-14T17:28:03Z

I have checked and unfortunately librosa doesn't seem to have lazy load 👎

librosa.stream seems to be fairly similar -- it returns a generator of audio chunks rather than loading the whole audio into memory at once.

The library I was thinking of during the meeting was soundfile (which it looks like librosa uses underneath). The SoundFile class opens an audio file, then you can use various methods to seek around in that audio file and read/write a given amount of frames.

fabiocat93 · 2025-03-14T18:35:09Z

I have checked and unfortunately librosa doesn't seem to have lazy load 👎

librosa.stream seems to be fairly similar -- it returns a generator of audio chunks rather than loading the whole audio into memory at once.

The library I was thinking of during the meeting was soundfile (which it looks like librosa uses underneath). The SoundFile class opens an audio file, then you can use various methods to seek around in that audio file and read/write a given amount of frames.

maybe StreamReader is the corresponding tool in torchaudio: https://pytorch.org/audio/main/tutorials/streamreader_basic_tutorial.html

Evan8456 added the question Further information is requested label Dec 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support: Memory Efficiency for long recordings #226

Support: Memory Efficiency for long recordings #226

Evan8456 commented Dec 20, 2024 •

edited

Loading

github-actions bot commented Dec 20, 2024

fabiocat93 commented Jan 15, 2025

Evan8456 commented Jan 21, 2025

ibevers commented Mar 11, 2025

ibevers commented Mar 11, 2025

ibevers commented Mar 11, 2025 •

edited

Loading

fabiocat93 commented Mar 14, 2025

900miles commented Mar 14, 2025

fabiocat93 commented Mar 14, 2025

Support: Memory Efficiency for long recordings #226

Support: Memory Efficiency for long recordings #226

Comments

Evan8456 commented Dec 20, 2024 • edited Loading

Issue Description

Environment Details

Attempted Solutions

Reproduction Steps

Additional Notes

github-actions bot commented Dec 20, 2024

fabiocat93 commented Jan 15, 2025

Evan8456 commented Jan 21, 2025

ibevers commented Mar 11, 2025

ibevers commented Mar 11, 2025

ibevers commented Mar 11, 2025 • edited Loading

fabiocat93 commented Mar 14, 2025

900miles commented Mar 14, 2025

fabiocat93 commented Mar 14, 2025

Evan8456 commented Dec 20, 2024 •

edited

Loading

ibevers commented Mar 11, 2025 •

edited

Loading