Skip to content

Remove redundant copies from/to GPU #112

@pthomadakis

Description

@pthomadakis

Currently, all GPU kernel execution calls are preceded by copying the data from the host to the GPU and followed by another set of copies in the opposite direction. This is done to ensure correctness; however, it is not an efficient approach.
To improve performance, we can infer if the data used by the GPU are read/written and avoid copies when they are node needed.

Metadata

Metadata

Assignees

Type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions