predict_action function performs pyTorch conversion using GPU #37

demobo-com · 2025-06-09T23:17:50Z

What this does

(⚡️ Performance)
Optimizes predict_action function's observations' pyTorch conversions by using CUDA GPU

How to checkout & try? (for the reviewer)

Run the same ACT policy evaluation in these 3 different scenarios:

original predict_action without running other CPU intensive applications (ex. Zoom)
original predict_action while running other CPU intensive applications (ex. Zoom): very slow fps inferencing
new predict_action without running other CPU intensive applications (ex. Zoom): expect a little performance optimization
new predict_action while running other CPU intensive applications (ex. Zoom): expect 10x better performance

Examples:

python lerobot/scripts/control_robot.py --control.type=record --control.policy.path=someACT_model --some.option=true

Copilot

Pull Request Overview

This PR optimizes predict_action by moving the tensor device transfer to before data transformations so that normalization and reshaping occur on the GPU, improving inference performance under CPU load.

Moves observation[name].to(device) to the start of the conversion loop and removes the redundant transfer at the end
Ensures all type casting, normalization, permute, and unsqueeze operations happen on the CUDA device

Comments suppressed due to low confidence (1)

lerobot/common/robot_devices/control_utils.py:112

Add a unit test to verify that after predict_action, all observation tensors are on the specified device and have the correct shape and dtype.

            observation[name] = observation[name].to(device)

Copilot · 2025-07-07T14:20:47Z

lerobot/common/robot_devices/control_utils.py

+            observation[name] = observation[name].to(device)
            if "image" in name:
                observation[name] = observation[name].type(torch.float32) / 255
                observation[name] = observation[name].permute(2, 0, 1).contiguous()
            observation[name] = observation[name].unsqueeze(0)


You can combine device transfer, dtype conversion, and normalization into a single chained call to reduce intermediate allocations, e.g.: observation[name] = observation[name].to(device=device, dtype=torch.float32).div(255).permute(2,0,1).unsqueeze(0).

Suggested change

observation[name] = observation[name].to(device)

if "image" in name:

observation[name] = observation[name].type(torch.float32) / 255

observation[name] = observation[name].permute(2, 0, 1).contiguous()

observation[name] = observation[name].unsqueeze(0)

if "image" in name:

observation[name] = observation[name].to(device=device, dtype=torch.float32).div(255).permute(2, 0, 1).unsqueeze(0)

else:

observation[name] = observation[name].to(device).unsqueeze(0)

Copilot · 2025-07-07T14:20:47Z

lerobot/common/robot_devices/control_utils.py

@@ -109,11 +109,11 @@ def predict_action(observation, policy, device, use_amp):
    ):
        # Convert to pytorch format: channel first and float32 in [0,1] with batch dimension
        for name in observation:
+            observation[name] = observation[name].to(device)
            if "image" in name:
                observation[name] = observation[name].type(torch.float32) / 255


[nitpick] Consider using the .float() alias instead of .type(torch.float32) for readability and consistency with common PyTorch code style.

Suggested change

observation[name] = observation[name].type(torch.float32) / 255

observation[name] = observation[name].float() / 255

perform pyforch conversion using GPU

8ef1922

shantanuparab-tr requested a review from Copilot July 7, 2025 14:19

Copilot AI reviewed Jul 7, 2025

View reviewed changes

demobo-com changed the title ~~predicti_action function performs pyTorch conversion using GPU~~ predict_action function performs pyTorch conversion using GPU Jul 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

predict_action function performs pyTorch conversion using GPU #37

predict_action function performs pyTorch conversion using GPU #37

Uh oh!

demobo-com commented Jun 9, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Jul 7, 2025

Uh oh!

Copilot AI Jul 7, 2025

Uh oh!

Uh oh!

	observation[name] = observation[name].type(torch.float32) / 255
	observation[name] = observation[name].float() / 255

predict_action function performs pyTorch conversion using GPU #37

Are you sure you want to change the base?

predict_action function performs pyTorch conversion using GPU #37

Uh oh!

Conversation

demobo-com commented Jun 9, 2025

What this does

How to checkout & try? (for the reviewer)

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Copilot AI Jul 7, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jul 7, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!