Open
Description
Is your feature request related to a problem? Please describe.
#3520 implemented multiplication operator with overflow checks.
Please refer to the overflow check algorithms for: int8, int16, int32, int64 in the PR.
And NVIDIA/cccl#3755 is developing the similar integral arithmetic with overflow checks APIs.
After cccl PR is done, we need to switch to the algorthim in cccl to get better perf.
Describe the solution you'd like
It's better to use the APIs in cccl repo when the development is done.
- Add
- subtract
- multiply, refer to Ansi multiply #3520
- divide