A tutorial on pin_memory and non_blocking usage #2983

vmoens · 2024-07-24T15:51:43Z

This is a draft PR on the proper usage of pin_memory and non_blocking when sending data from CPU to GPU (and GPU to CPU).

Description

There is some confusion about the proper use of pin_memory and non_blocking in the community.
To be convinced about it, one can simply look for the .pin_memory.to( pattern on github (about 2k results), when this command is always slower than a simple call to to(device).

The responsibility lies in part in the pytorch doc itself where it is recommended (implicitly) to call pin_memory before calling to with non_blocking=True and several such occurrences on the pytorch forum too.

Some refs on the topic:
https://discuss.pytorch.org/t/what-is-the-disadvantage-of-using-pin-memory/1702/13
https://discuss.pytorch.org/t/non-blocking-memory-transfer-to-gpu/188941
https://discuss.pytorch.org/t/should-we-set-non-blocking-to-true/38234
https://discuss.pytorch.org/t/how-is-explicit-pin-memory-different-from-just-calling-to-and-let-cuda-handle-it/197422

I use TensorDict to demonstrate how to use pin_memory across threads.

A follow-up will be to link back the pytorch doc / docstrings of to / pin_memory etc to this tutorial.

The syntax should be fixed and integrated better in the lib. Conclusion and additional resources are missing.
Still, feedback is more than welcome!

The tutorial requires tensordict v0.5 which will be released in the coming days.

cc @shagunsodhani @albanD @janeyx99 @dstaay-fb @mikaylagawarecki @ptrblck

pytorch-bot · 2024-07-24T15:51:46Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/tutorials/2983

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 07f9932 with merge base c3882db ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

svekars · 2024-07-24T17:06:35Z

intermediate_source/pinmem_nonblock.py

@@ -0,0 +1,429 @@
+# -*- coding: utf-8 -*-
+"""
+A guide on good usage of `non_blocking` and `pin_memory()` in PyTorch


We need to use .rst syntax:

double backticks instead of single ones

Links should be Text <link>_ not

Good point, I was planning on doing that - currently the script is a mere export from a regular ipynb, I should make a second pass and correct all the links etc!

svekars · 2024-07-24T17:08:27Z

intermediate_source/pinmem_nonblock.py

+TL;DR
+-----


I suggest we remove the TL;DR - the text below can be just the intro abstract right after the title. Can you follow this template: https://github.com/pytorch/tutorials/blob/main/beginner_source/template_tutorial.py
Add What you will learn and Prerequisites and Author

vmoens · 2024-07-25T20:07:22Z

@svekars Happy to get some more feedback.
On my side, I only see these items to be done:

edit figures to make them look prettier
add additional info (links etc)
check that we have the most demonstrative examples (ie play a bit with the setups to highlight the differences better)

svekars

Just a couple very minor editorial nits - looks great, otherwise!

intermediate_source/pinmem_nonblock.py

svekars · 2024-07-30T16:24:00Z

intermediate_source/pinmem_nonblock.py

+#   .. _pinned_memory_resources:
+#
+#  If you are dealing with issues with memory copies when using CUDA devices or want to learn more about
+#  what was discussed in this tutorial, check the following references:
+#
+#  - `CUDA toolkit memory management doc <https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__MEMORY.html>`_
+#  - `CUDA pin-memory note <https://forums.developer.nvidia.com/t/pinned-memory/268474>`_
+#  - `How to Optimize Data Transfers in CUDA C/C++ <https://developer.nvidia.com/blog/how-optimize-data-transfers-cuda-cc/>`_
+#  - tensordict :meth:`~tensordict.TensorDict.to` method;
+#


I thin there is a bit of an issue with rendering this part because of an extra space in all those liens. If you have just one space instead of two, the indentation will be correct.

Co-authored-by: Svetlana Karslioglu <[email protected]>

intermediate_source/pinmem_nonblock.py

…m-nonblock-tuto

mikaylagawarecki

Please address the nits though

intermediate_source/pinmem_nonblock.py

init

0974b34

facebook-github-bot added the cla signed label Jul 24, 2024

vmoens marked this pull request as draft July 24, 2024 15:51

Vincent Moens added 10 commits July 24, 2024 17:01

index.rst

5f0b6ae

index.rst

72f8951

spelling

4fe1b2d

black

5f57f50

pegeable -> pageable

67f4d1a

update requirements

07456f9

update requirements

ffdcef9

update requirements

f6ba568

update requirements

85d07be

amend

9b4640a

svekars reviewed Jul 24, 2024

View reviewed changes

Vincent Moens added 12 commits July 24, 2024 19:47

amend

a688b90

amend

d706405

amend

5ae66ec

amend

dc86259

amend

4af8cae

amend

73ab3ba

black

5cb0510

amend

01e580d

amend

3ad73ad

amend

8b2ec64

amend

d980553

amend

ac5b5e4

Vincent Moens added 2 commits July 26, 2024 07:35

amend

68241fe

amend

96e1582

amend

3869413

svekars added the tensordict label Jul 29, 2024

Vincent Moens added 3 commits July 29, 2024 18:08

amend

b1488d5

amend

33236ec

amend

2fd193a

vmoens marked this pull request as ready for review July 29, 2024 22:23

Vincent Moens added 11 commits July 29, 2024 18:25

amend

392230a

amend

bff42d1

Merge remote-tracking branch 'origin/main' into pinmem-nonblock-tuto

c8f7e41

amend

e6b20b1

amend

d6318f7

amend

83abe5f

amend

4fe4bde

amend

ea53204

amend

0d6cba7

amend

69e98ea

amend

ed465bd

svekars reviewed Jul 30, 2024

View reviewed changes

Vincent Moens and others added 2 commits July 30, 2024 19:28

Update intermediate_source/pinmem_nonblock.py

d4169d4

Co-authored-by: Svetlana Karslioglu <[email protected]>

Apply suggestions from code review

2f55eb8

Co-authored-by: Svetlana Karslioglu <[email protected]>

vmoens commented Jul 30, 2024

View reviewed changes

intermediate_source/pinmem_nonblock.py Outdated Show resolved Hide resolved

Update intermediate_source/pinmem_nonblock.py

12d1b69

svekars approved these changes Jul 30, 2024

View reviewed changes

Vincent Moens added 3 commits July 30, 2024 16:59

edit index.rst

8f4d6d7

Merge remote-tracking branch 'origin/pinmem-nonblock-tuto' into pinme…

d3befe4

…m-nonblock-tuto

edit tensordict to() link

1dfe315

mikaylagawarecki approved these changes Jul 30, 2024

View reviewed changes

address comments

07f9932

vmoens merged commit a66464b into main Jul 31, 2024
20 checks passed

vmoens deleted the pinmem-nonblock-tuto branch July 31, 2024 00:16

		TL;DR
		-----

A tutorial on pin_memory and non_blocking usage #2983

A tutorial on pin_memory and non_blocking usage #2983

Uh oh!

Conversation

vmoens commented Jul 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

pytorch-bot bot commented Jul 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/tutorials/2983

✅ No Failures

Uh oh!

svekars Jul 24, 2024

Choose a reason for hiding this comment

Uh oh!

vmoens Jul 24, 2024

Choose a reason for hiding this comment

Uh oh!

svekars Jul 24, 2024

Choose a reason for hiding this comment

Uh oh!

vmoens Jul 24, 2024

Choose a reason for hiding this comment

Uh oh!

vmoens commented Jul 25, 2024

Uh oh!

svekars left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

svekars Jul 30, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mikaylagawarecki left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vmoens commented Jul 24, 2024 •

edited

Loading

pytorch-bot bot commented Jul 24, 2024 •

edited

Loading