You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently, all GPU kernel execution calls are preceded by copying the data from the host to the GPU and followed by another set of copies in the opposite direction. This is done to ensure correctness; however, it is not an efficient approach.
To improve performance, we can infer if the data used by the GPU are read/written and avoid copies when they are node needed.