slurmstepd: error: Detected 1 oom_kill event in StepId=11300781.0. #5933
Unanswered
thalesChagas
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hello, my name is Vieira.
I am having difficulty executing a calculation but it gives me the error slurmstepd: error: Detected 1 oom_kill event in StepId=11300781.0. Some of the step tasks have been OOM Killed. srun: error: sdumont6156: task 0: Out Of Memory, how do I remove this error, here is my executor file:
#!/bin/bash
#SBATCH --nodes=4 #Number of Nodes
#SBATCH --ntasks-per-node=16 #Number of tasks per Node
#SBATCH --ntasks=64 #Total number of MPI tasks
#SBATCH -p sequana_cpu_dev #Queue (partition) to be used
#SBATCH -J qe_test #Job name
#SBATCH --time=744:00:00 #duration time
#SBATCH --mem=64Gb
#Display the nodes allocated to the Job
echo $SLURM_JOB_NODELIST
nodeset -e $SLURM_JOB_NODELIST
cd $SLURM_SUBMIT_DIR
#Configure compilers
#------------------------#
2) Using OpenMPI with Intel
module load quantum-espresso/7.3_+cuda-11.1+intel_2020_sequana
#Configure the executable
EXEC=/scratch/app/quantum-espresso/7.3+cuda-11.1_intel_sequana/bin/pw.x
Beta Was this translation helpful? Give feedback.
All reactions