deepspeed using multi - GPUs #3743
Unanswered
zhangyuanxun
asked this question in
Q&A
Replies: 1 comment
-
this's normal. even slower than using one single gpu since the calculation must be done step by step when batch size is one. unless the cuda cores aren't enough and MP use tensor-parallelism and the bandwidth is much high, maybe it will be faster than using one single gpu. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello,
I'm using deepspeed inference with multiple GPUs. However, I have observed that there is no difference in performance between using a single GPU and multiple GPUs when the data (batch size = 1). Is this expected behavior, or could there be an issue with my code?
Thank you.
Beta Was this translation helpful? Give feedback.
All reactions