-
Notifications
You must be signed in to change notification settings - Fork 20
Description
First of all, I want to express my gratitude for your work on developing this algorithm. It has been very helpful for my research.
I've successfully run about 40 samples using Realign. However, I'm encountering some difficulties with the remaining half of my samples. The process seems to stall, making little progress. For example, one sample (totaling about 20 GB for the three BAM files used when running Realign) has read about 300 TB of data but keeps returning to the same place in the same file. It's using all the computational power to unpack BAM files and almost none on analysis.
I've attempted to address this by increasing the number of cores, but it doesn't seem to make a difference. My last attempt used 32 cores and 128 GB of RAM, with the maximum runtime available to me on the cluster being 7 days, which was not enough time.
I'm puzzled as to why about half of my samples were able to run successfully while the other half seems unable to finish. There are no major differences in the size of the input BAMs between the samples that ran successfully and those that are not able to run.
Is there a known reason for this problem? Are there any suggestions or workarounds you could provide to help process these remaining samples?
Thank you in advance for any assistance you can offer.