Argmax(cpp wrapper) #784

AdvancedCompiler · 2025-07-15T00:39:00Z

PR Category

Operator

Type of Change

Refactor

Description

cpp wrapper

Issue

Progress

Change is properly reviewed (1 reviewer required, 2 recommended).
Change is responded to an issue.
Change is fully covered by a UT.

Performance

Bowen12992 · 2025-07-15T02:53:10Z

ctests/test_triton_argmax.cpp

+#include "flag_gems/operators.h"
+#include "torch/torch.h"
+
+TEST(reduction_op_test, argmax) {


It's better to test with larger shapes and with different dtypes.

Bowen12992 · 2025-07-15T02:53:47Z

ctests/test_triton_argmax.cpp

+
+TEST(reduction_op_test, argmax_keepdim_option) {
+  const torch::Device device(torch::kCUDA, 0);
+  torch::Tensor input = torch::randn({2, 2, 2, 2}, device);


ditto, the test shape is too small

Bowen12992 · 2025-07-15T03:02:56Z

lib/argmax.cpp

+      auto shape = self.sizes().vec();
+      for (auto &s : shape) {
+        s = 1;
+      }


Suggested change

auto shape = self.sizes().vec();

for (auto &s : shape) {

s = 1;

}

const auto shape = std::vector<int64_t>(self.dim(), 1);

Bowen12992 · 2025-07-15T03:05:03Z

lib/argmax.cpp

+    c10::DeviceGuard guard(self.device());
+    c10::cuda::CUDAStream stream = c10::cuda::getCurrentCUDAStream();
+
+    f1(stream, mid_size, 1, 1, 4 /*num_warps*/, 2 /*num_stages*/, self, mid_value, mid_index, M, block_size);


Suggested change

f1(stream, mid_size, 1, 1, 4 /*num_warps*/, 2 /*num_stages*/, self, mid_value, mid_index, M, block_size);

f1(stream, mid_size, 1, 1, /* num_warps = */ 4, /* num_stages = */ 2 , self, mid_value, mid_index, M, block_size);

Bowen12992 · 2025-07-15T03:05:39Z

lib/argmax.cpp

+
+    f1(stream, mid_size, 1, 1, 4 /*num_warps*/, 2 /*num_stages*/, self, mid_value, mid_index, M, block_size);
+
+    f2(stream, 1, 1, 1, 4 /*num_warps*/, 2 /*num_stages*/, mid_value, mid_index, out, mid_size, block_mid);


Bowen12992 · 2025-07-15T03:06:24Z

lib/argmax.cpp

+  int64_t dim_val = dim.value();
+  dim_val = at::maybe_wrap_dim(dim_val, self.dim());
+
+  auto shape = self.sizes();


Suggested change

auto shape = self.sizes();

const auto& shape = self.sizes();

…iler/FlagGems into argmax(cpp-wrapper)

src/flag_gems/csrc/cstub.cpp

0x45f

LGTM

“ph0375” and others added 2 commits July 14, 2025 17:49

add C++ wrapper for argmax operator

0e6934d

Merge branch 'FlagOpen:master' into argmax(cpp-wrapper)

15124a2

Bowen12992 reviewed Jul 15, 2025

View reviewed changes

Bowen12992 previously approved these changes Jul 15, 2025

View reviewed changes

“ph0375” added 2 commits July 15, 2025 15:27

fix: Update the argmax test based on the review

74bcf6b

Merge branch 'argmax(cpp-wrapper)' of https://github.com/AdvancedComp…

5b7cd6b

…iler/FlagGems into argmax(cpp-wrapper)

fangxinsishui8023 dismissed Bowen12992’s stale review via 5b7cd6b July 15, 2025 07:47

Merge branch 'FlagOpen:master' into argmax(cpp-wrapper)

e2020f5

StrongSpoon assigned Bowen12992 Jul 15, 2025

0x45f reviewed Jul 16, 2025

View reviewed changes

src/flag_gems/csrc/cstub.cpp Outdated Show resolved Hide resolved

“ph0375” added 2 commits July 16, 2025 15:52

fix: Update the argmax.cpp based on the review

394d9f2

fix: Update the argmax based on the review

741de4f

Bowen12992 approved these changes Jul 16, 2025

View reviewed changes

0x45f approved these changes Jul 16, 2025

View reviewed changes

Bowen12992 merged commit b91901b into FlagOpen:master Jul 16, 2025
12 of 14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Argmax(cpp wrapper) #784

Argmax(cpp wrapper) #784

Uh oh!

AdvancedCompiler commented Jul 15, 2025

Uh oh!

Bowen12992 Jul 15, 2025

Uh oh!

Bowen12992 Jul 15, 2025

Uh oh!

Bowen12992 Jul 15, 2025

Uh oh!

Bowen12992 Jul 15, 2025 •

edited

Loading

Uh oh!

Bowen12992 Jul 15, 2025

Uh oh!

Bowen12992 Jul 15, 2025

Uh oh!

Uh oh!

0x45f left a comment

Uh oh!

Uh oh!

Uh oh!

	f1(stream, mid_size, 1, 1, 4 /num_warps/, 2 /num_stages/, self, mid_value, mid_index, M, block_size);
	f1(stream, mid_size, 1, 1, /* num_warps = / 4, / num_stages = */ 2 , self, mid_value, mid_index, M, block_size);


		f1(stream, mid_size, 1, 1, 4 /num_warps/, 2 /num_stages/, self, mid_value, mid_index, M, block_size);

		f2(stream, 1, 1, 1, 4 /num_warps/, 2 /num_stages/, mid_value, mid_index, out, mid_size, block_mid);

Argmax(cpp wrapper) #784

Argmax(cpp wrapper) #784

Uh oh!

Conversation

AdvancedCompiler commented Jul 15, 2025

PR Category

Type of Change

Description

Issue

Progress

Performance

Uh oh!

Bowen12992 Jul 15, 2025

Choose a reason for hiding this comment

Uh oh!

Bowen12992 Jul 15, 2025

Choose a reason for hiding this comment

Uh oh!

Bowen12992 Jul 15, 2025

Choose a reason for hiding this comment

Uh oh!

Bowen12992 Jul 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Bowen12992 Jul 15, 2025

Choose a reason for hiding this comment

Uh oh!

Bowen12992 Jul 15, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

0x45f left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Bowen12992 Jul 15, 2025 •

edited

Loading